landing-page-intel

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Landing Page Intel

落地页情报提取

Extract GTM-relevant intelligence from any company's landing page by scraping its HTML source.
通过抓取任意公司落地页的HTML源码,提取与GTM相关的情报。

Quick Start

快速开始

Only dependency is
pip install requests
. No API key needed.
bash
undefined
唯一依赖是
pip install requests
,无需API密钥。
bash
undefined

Basic scan of a single URL

Basic scan of a single URL

python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com"
python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com"

Scan multiple pages of the same site

Scan multiple pages of the same site

python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com" --pages "/,/pricing,/about"
python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com" --pages "/,/pricing,/about"

Output as summary table instead of JSON

Output as summary table instead of JSON

python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com" --output summary
python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com" --output summary

Save full report to file

Save full report to file

python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com" --output json > report.json
undefined
python3 skills/landing-page-intel/scripts/scrape_landing_page.py
--url "https://example.com" --output json > report.json
undefined

What It Extracts

可提取的内容

CategoryDetails
Tech StackAnalytics (GA4, Mixpanel, Amplitude, PostHog, Heap), marketing automation (HubSpot, Marketo, Pardot), chat widgets (Intercom, Drift, Crisp, Zendesk), A/B testing (Optimizely, VWO, LaunchDarkly), session recording (Hotjar, FullStory, LogRocket), CDPs (Segment, Clearbit, 6sense)
Ad PixelsMeta Pixel, Google Ads, LinkedIn Insight Tag, TikTok pixel, Twitter pixel
Customer LogosImage URLs from "trusted by" / logo carousel sections, grouped by directory
SEO MetadataTitle, meta description, Open Graph tags, Twitter Cards, canonical URL, structured data (JSON-LD), hreflang tags
CTAs & Sales MotionAll CTA button text and links — reveals PLG vs sales-led motion
Social ProofTestimonials, customer counts, case study links, badge images
IntegrationsLinks to integration/partner pages, embedded third-party widgets
Hidden ElementsContent in
display:none
,
hidden
, or HTML comments that may reveal upcoming features
InfrastructureCMS platform (Webflow, WordPress, Next.js, etc.), detected from HTML signatures
分类详情
技术栈分析工具(GA4、Mixpanel、Amplitude、PostHog、Heap)、营销自动化工具(HubSpot、Marketo、Pardot)、聊天组件(Intercom、Drift、Crisp、Zendesk)、A/B测试工具(Optimizely、VWO、LaunchDarkly)、会话录制工具(Hotjar、FullStory、LogRocket)、客户数据平台(Segment、Clearbit、6sense)
广告像素Meta Pixel、Google Ads、LinkedIn Insight Tag、TikTok pixel、Twitter pixel
客户标识来自“客户信任”/标识轮播区域的图片URL,按目录分组
SEO元数据页面标题、元描述、Open Graph标签、Twitter Cards、规范URL、结构化数据(JSON-LD)、hreflang标签
CTA与销售流程所有CTA按钮的文本和链接——可揭示产品是PLG(产品主导型增长)还是销售主导型增长模式
社交证明客户证言、客户数量、案例研究链接、徽章图片
集成信息集成/合作伙伴页面链接、嵌入的第三方组件
隐藏元素
display:none
hidden
属性或HTML注释中的内容,可能揭示即将推出的功能
基础设施内容管理系统平台(Webflow、WordPress、Next.js等),通过HTML特征识别

CLI Reference

CLI参考

FlagDefaultDescription
--url
requiredTarget website URL
--pages
/
Comma-separated paths to scan (e.g.,
/,/pricing,/about
)
--output
json
Output format:
json
or
summary
--timeout
15
Request timeout in seconds
参数默认值描述
--url
必填目标网站URL
--pages
/
逗号分隔的待扫描路径(例如:
/,/pricing,/about
--output
json
输出格式:
json
summary
--timeout
15
请求超时时间(秒)

GTM Use Cases

GTM使用场景

  • Competitive intel: See what tools competitors use, how they position, who their customers are
  • Prospect research: Before a sales call, scan a prospect's site to understand their stack and maturity
  • Market mapping: Scan multiple competitors to compare positioning, customer segments, and GTM motions
  • Customer discovery: Extract competitor customer logos as potential prospects for your own product
  • 竞品情报:了解竞品使用的工具、定位方式以及客户群体
  • 潜在客户调研:销售沟通前,扫描潜在客户的网站以了解其技术栈和成熟度
  • 市场图谱构建:扫描多个竞品以对比其定位、客户细分和GTM策略
  • 客户挖掘:提取竞品的客户标识,作为自身产品的潜在客户

Cost

成本

Free. No API keys required. Uses only HTTP requests to fetch public HTML.
完全免费。无需API密钥,仅通过HTTP请求获取公开的HTML内容。