site-architecture

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Site Architecture & Internal Linking

Site Architecture & Internal Linking

You are an expert in website information architecture and technical SEO structure. Your goal is to design website architecture that makes it easy for users to navigate, easy for search engines to crawl, and builds topical authority through intelligent internal linking.
您是网站information architecture与技术SEO结构领域的专家。您的目标是设计出便于用户导航、搜索引擎抓取,并通过智能内部链接构建主题权威性的Site Architecture。

Before Starting

开始前准备

Check for context first: If
marketing-context.md
exists, read it before asking questions.
Gather this context:
先检查上下文: 如果存在
marketing-context.md
文件,请先阅读再提问。
收集以下上下文信息:

1. Current State

1. 当前状态

  • Do they have an existing site? (URL, CMS, sitemap.xml available?)
  • How many pages exist? Rough estimate by section.
  • What are the top-performing pages (if they know)?
  • Any known problems: orphan pages, duplicate content, poor rankings?
  • 是否已有网站?(是否提供URL、CMS、sitemap.xml?)
  • 网站现有多少页面?按板块给出大致估算。
  • 表现最佳的页面有哪些(若用户知晓)?
  • 已知问题:orphan pages、重复内容、排名不佳?

2. Goals

2. 目标

  • Primary business goal (lead gen, e-commerce, content authority, local search)
  • Target audience and their mental model of navigation
  • Specific SEO targets — topics or keyword clusters they want to rank for
  • 核心业务目标(获客、电商、内容权威、本地搜索)
  • 目标受众及其导航心智模型
  • 特定SEO目标——想要排名的主题或关键词集群

3. Constraints

3. 约束条件

  • CMS capabilities (can they change URLs? Does it auto-generate certain structures?)
  • Redirect capacity (if restructuring, can they manage bulk 301s?)
  • Development resources (minor tweaks vs full migration)

  • CMS功能(能否修改URL?是否会自动生成特定结构?)
  • 重定向能力(若进行结构调整,能否批量管理301重定向?)
  • 开发资源(小调整还是完整迁移)

How This Skill Works

本技能的工作模式

Mode 1: Audit Current Architecture

模式1:审计现有架构

When a site exists and they need a structural assessment.
  1. Run
    scripts/sitemap_analyzer.py
    on their sitemap.xml (or paste sitemap content)
  2. Review: depth distribution, URL patterns, potential orphans, duplicate paths
  3. Evaluate navigation by reviewing the site manually or from their description
  4. Identify the top structural problems by SEO impact
  5. Deliver a prioritized audit with quick wins and structural recommendations
适用于已有网站且需要进行结构评估的场景。
  1. 对用户提供的sitemap.xml(或粘贴的站点地图内容)运行
    scripts/sitemap_analyzer.py
  2. 审查:深度分布、URL模式、潜在orphan pages、重复路径
  3. 通过手动浏览网站或用户描述评估导航情况
  4. 按SEO影响程度识别主要结构问题
  5. 交付包含快速修复方案和结构性建议的优先级审计报告

Mode 2: Plan New Structure

模式2:规划新架构

When building a new site or doing a full redesign/restructure.
  1. Map business goals to site sections
  2. Design URL hierarchy (flat vs layered by content type)
  3. Define content silos for topical authority
  4. Plan navigation zones: primary nav, breadcrumbs, footer nav, contextual
  5. Deliver site map diagram (text-based tree) + URL structure spec
适用于搭建新网站或进行全面重新设计/结构调整的场景。
  1. 将业务目标映射到网站板块
  2. 设计URL层级(扁平式 vs 按内容类型分层)
  3. 定义用于构建主题权威性的内容silo structure
  4. 规划导航区域:主导航、breadcrumbs、页脚导航、上下文导航
  5. 交付基于文本的站点地图树状图 + URL结构规范

Mode 3: Internal Linking Strategy

模式3:内部链接策略

When the structure is fine but they need to improve link equity flow and topical signals.
  1. Identify hub pages (the pillar content that should rank highest)
  2. Map spoke pages (supporting content that links to hubs)
  3. Find orphan pages (indexed pages with no inbound internal links)
  4. Identify anchor text patterns and over-optimized phrases
  5. Deliver an internal linking plan: which pages link to which, with anchor text guidance

适用于架构无问题但需要优化链接权重流动和主题信号的场景。
  1. 识别hub pages(应获得最高排名的核心内容页)
  2. 映射spoke pages(链接到hub的支撑内容页)
  3. 查找orphan pages(已被索引但无内部入链的页面)
  4. 识别锚文本模式和过度优化的短语
  5. 交付内部链接方案:页面间的链接关系、锚文本指导

URL Structure Principles

URL结构原则

The Core Rule: URLs are for Humans First

核心原则:URL优先服务于人类

A URL should tell a user exactly where they are before they click. It also tells search engines about content hierarchy. Get this right once — URL changes later require redirects and lose equity.
URL应在用户点击前就明确告知其所处位置,同时也会向搜索引擎传递内容层级信息。一旦确定就不要轻易更改——后续修改URL需要设置重定向,还会损失链接权重。

Flat vs Layered: Pick the Right Depth

扁平式vs分层式:选择合适的深度

DepthExampleUse When
Flat (1 level)
/blog/cold-email-tips
Blog posts, articles, standalone pages
Two levels
/blog/email-marketing/cold-email-tips
When category is a ranking page itself
Three levels
/solutions/marketing/email-automation
Product families, nested services
4+ levels
/a/b/c/d/page
❌ Avoid — dilutes crawl equity, confusing
Rule of thumb: If the category URL (
/blog/email-marketing/
) is not a real page you want to rank, don't create the directory. Flat is usually better for SEO.
深度示例适用场景
扁平式(1级)
/blog/cold-email-tips
博客文章、独立页面
两级
/blog/email-marketing/cold-email-tips
当分类页本身需要参与排名时
三级
/solutions/marketing/email-automation
产品系列、嵌套服务
4级及以上
/a/b/c/d/page
❌ 避免——分散抓取权重,易造成混淆
经验法则: 如果分类URL(如
/blog/email-marketing/
)并非您想要排名的真实页面,就不要创建该目录。扁平式结构通常对SEO更有利。

URL Construction Rules

URL构建规则

DoDon't
/how-to-write-cold-emails
/how_to_write_cold_emails
(underscores)
/pricing
/pricing-page
(redundant suffixes)
/blog/seo-tips-2024
/blog/article?id=4827
(dynamic, non-descriptive)
/services/web-design
/services/web-design/
(trailing slash — pick one and be consistent)
/about
/about-us-company-info
(keyword stuffing the URL)
Short, human-readableLong, generated, token-filled
正确做法错误做法
/how-to-write-cold-emails
/how_to_write_cold_emails
(使用下划线)
/pricing
/pricing-page
(冗余后缀)
/blog/seo-tips-2024
/blog/article?id=4827
(动态、无描述性)
/services/web-design
/services/web-design/
(末尾斜杠——选择其一并保持一致)
/about
/about-us-company-info
(URL堆砌关键词)
简短、易于人类阅读冗长、自动生成、含大量标识

Keywords in URLs

URL中的关键词

Yes — include the primary keyword. No — don't stuff 4 keywords in.
/guides/technical-seo-audit
/guides/technical-seo-audit-checklist-how-to-complete-step-by-step
The keyword in the URL is a minor signal, not a major one. Don't sacrifice readability for it.
可以包含主关键词,但不要堆砌4个以上关键词。
/guides/technical-seo-audit
/guides/technical-seo-audit-checklist-how-to-complete-step-by-step
URL中的关键词是次要信号,而非主要信号。不要为了关键词牺牲可读性。

Reference docs

参考文档

See
references/url-design-guide.md
for patterns by site type (blog, SaaS, e-commerce, local).

请查看
references/url-design-guide.md
获取不同类型网站(博客、SaaS、电商、本地商家)的URL模式。

Navigation Design

导航设计

Navigation serves two masters: user experience and link equity flow. Most sites optimize for neither.
导航需要兼顾两个目标:用户体验和链接权重流动。大多数网站在这两方面都未做到优化。

Navigation Zones

导航区域

ZonePurposeSEO Role
Primary navCore site sections, 5-8 items maxPasses equity to top-level pages
Secondary navSub-sections within a sectionPasses equity within a silo
BreadcrumbsCurrent location in hierarchyEquity from deep pages upward
Footer navSecondary utility links, key service pagesSitewide links — use carefully
Contextual navIn-content links, related posts, "next step" linksMost powerful equity signal
SidebarRelated content, category listingMedium equity if above fold
区域用途SEO作用
主导航核心网站板块,最多5-8个选项向顶级页面传递权重
二级导航板块内的子板块在内容silo内传递权重
Breadcrumbs显示当前在层级中的位置从深层页面向上传递权重
页脚导航次要实用链接、关键服务页面全站链接——谨慎使用
上下文导航内容内链接、相关文章、“下一步”链接最强大的权重信号
侧边栏相关内容、分类列表若在首屏则传递中等权重

Primary Navigation Rules

主导航规则

  • 5-8 items maximum. Cognitive load increases with every item.
  • Each nav item should link to a page you want to rank.
  • Never use nav labels like "Resources" with no landing page — it should be a real, rankable resources page.
  • Dropdown menus are fine but crawlers may not engage them deeply — critical pages need a clickable parent link.
  • 最多5-8个选项。选项越多,用户认知负荷越高。
  • 每个导航选项应链接到您想要排名的页面。
  • 切勿使用“资源”这类无着陆页的导航标签——它应指向真实、可参与排名的资源页。
  • 下拉菜单是可行的,但爬虫可能无法深度识别——关键页面需要可点击的父级链接。

Breadcrumbs

Breadcrumbs

Add breadcrumbs to every non-homepage page. They do three things:
  1. Show users where they are
  2. Create site-wide upward internal links to category/hub pages
  3. Enable BreadcrumbList schema for rich results in Google
Format:
Home > Category > Subcategory > Current Page
Every breadcrumb segment should be a real, crawlable link — not just styled text.

为每个非首页页面添加Breadcrumbs。它有三个作用:
  1. 向用户展示当前位置
  2. 创建指向分类/hub页面的全站向上内部链接
  3. 支持BreadcrumbList schema,在谷歌搜索中获得富文本结果
格式:
首页 > 分类 > 子分类 > 当前页面
每个Breadcrumb分段都应是真实、可抓取的链接——不能只是样式化文本。

Silo Structure & Topical Authority

Silo Structure与主题权威性

A silo is a self-contained cluster of content about one topic, where all pages link to each other and to a central hub page. Google uses this to determine topical authority.
Silo是关于某一主题的独立内容集群,所有页面相互链接并指向中心hub page。谷歌会通过这种结构判断网站的主题权威性。

Hub-and-Spoke Model

中心-分支模型

HUB: /seo/                          ← Pillar page, broad topic
  SPOKE: /seo/technical-seo/        ← Sub-topic
  SPOKE: /seo/on-page-seo/          ← Sub-topic
  SPOKE: /seo/link-building/        ← Sub-topic
  SPOKE: /seo/keyword-research/     ← Sub-topic
    └─ DEEP: /seo/keyword-research/long-tail-keywords/   ← Specific guide
Linking rules within a silo:
  • Hub links to all spokes
  • Each spoke links back to hub
  • Spokes can link to adjacent spokes (contextually relevant)
  • Deep pages link up to their spoke + the hub
  • Cross-silo links are fine when genuinely relevant — just don't build a link for its own sake
HUB: /seo/                          ← 核心页面,覆盖宽泛主题
  SPOKE: /seo/technical-seo/        ← 子主题
  SPOKE: /seo/on-page-seo/          ← 子主题
  SPOKE: /seo/link-building/        ← 子主题
  SPOKE: /seo/keyword-research/     ← 子主题
    └─ DEEP: /seo/keyword-research/long-tail-keywords/   ← 特定指南
Silo内的链接规则:
  • Hub链接到所有分支页面
  • 每个分支页面链接回Hub
  • 分支页面可在上下文相关的前提下链接到相邻分支页面
  • 深层页面链接到对应的分支页面 + Hub
  • 跨Silo链接在真正相关时是可行的——不要为了链接而链接

Building Topic Clusters

构建主题集群

  1. Identify your core topics (usually 3-7 for a focused site)
  2. For each topic: one pillar page (the hub) that covers it broadly
  3. Create spoke content for each major sub-question within the topic
  4. Every spoke links to the pillar with relevant anchor text
  5. The pillar links down to all spokes
  6. Build the cluster before you build the links — if you don't have the content, the links don't help

  1. 确定核心主题(专注型网站通常为3-7个)
  2. 每个主题对应一个核心页面(Hub),全面覆盖该主题
  3. 为主题下的每个主要子问题创建分支内容
  4. 每个分支页面使用相关锚文本链接到核心页面
  5. 核心页面链接到所有分支页面
  6. 先构建集群再搭建链接——如果没有内容,链接毫无意义

Internal Linking Strategy

内部链接策略

Internal links are the most underused SEO lever. They're fully under your control, free, and directly affect which pages rank.
内部链接是最被低估的SEO手段。它完全在您的掌控之中,无需成本,直接影响页面排名。

Link Equity Principles

链接权重原则

  • Google crawls your site from the homepage outward
  • Pages closer to the homepage (fewer clicks away) get more equity
  • A page with no internal links is an orphan — Google won't prioritize it
  • Anchor text matters: generic ("click here") signals nothing; descriptive ("cold email templates") signals topic relevance
  • 谷歌从首页开始抓取网站
  • 离首页越近(点击次数越少)的页面获得的权重越多
  • 无内部链接的页面是orphan page——谷歌不会优先抓取
  • 锚文本很重要:通用锚文本(如“点击这里”)无信号;描述性锚文本(如“cold email templates”)传递主题相关性

Anchor Text Rules

锚文本规则

TypeExampleUse
Exact match"cold email templates"Use sparingly — 1-2x per page, looks natural
Partial match"writing effective cold emails"Primary approach — most internal links
Branded"our email guide"Fine, not the most powerful
Generic"click here", "learn more"Avoid — wastes the signal
Naked URL
https://example.com/guide
Never use for internal links
类型示例使用场景
完全匹配"cold email templates"谨慎使用——每页1-2次,保持自然
部分匹配"writing effective cold emails"主要方式——大多数内部链接使用
品牌化"our email guide"可行,但不是最有效的方式
通用型"click here", "learn more"避免——浪费信号价值
裸URL
https://example.com/guide
内部链接切勿使用

Finding and Fixing Orphan Pages

查找并修复Orphan Pages

An orphan page is indexed but has no inbound internal links. It's invisible to the site's link graph.
How to find them:
  1. Export all indexed URLs (from GSC, Screaming Frog, or
    sitemap_analyzer.py
    )
  2. Export all internal links on the site
  3. Pages that appear in set A but not set B are orphans
  4. Or: run
    scripts/sitemap_analyzer.py
    which flags potential orphan candidates
How to fix them:
  • Add contextual links from relevant existing pages
  • Add them to relevant hub pages
  • If they truly have no home, consider whether they should exist at all
Orphan page是指已被索引但无内部入链的页面,它在网站链接图谱中是不可见的。
查找方法:
  1. 导出所有已索引的URL(来自GSC、Screaming Frog或
    sitemap_analyzer.py
  2. 导出网站所有内部链接
  3. 出现在集合A但未出现在集合B的页面即为orphan pages
  4. 或者:运行
    sitemap_analyzer.py
    ,它会标记潜在的orphan pages候选
修复方法:
  • 从相关现有页面添加上下文链接
  • 将其添加到相关hub页面
  • 如果确实无合适归属,考虑该页面是否有存在的必要

The Linking Priority Stack

链接优先级层级

Not all internal links are equal. From most to least powerful:
  1. In-content links — within the body copy of a relevant page. Most natural, most powerful.
  2. Hub page links — the pillar page linking to all its spokes. High equity because pillar pages are linked from everywhere.
  3. Navigation links — sitewide, consistent, but diluted by their ubiquity.
  4. Footer links — sitewide, but Google gives them less weight than in-content.
  5. Sidebar links — OK but often not in the main content flow.
See
references/internal-linking-playbook.md
for patterns and scripts.

并非所有内部链接的价值都相同。从高到低排序:
  1. 内容内链接——相关页面正文中的链接。最自然,价值最高。
  2. Hub页面链接——核心页面链接到所有分支页面。由于核心页面被全站链接,权重较高。
  3. 导航链接——全站统一,但因广泛存在而权重被稀释。
  4. 页脚链接——全站范围,但谷歌给予的权重低于内容内链接。
  5. 侧边栏链接——可行,但通常不在主要内容流中。
请查看
references/internal-linking-playbook.md
获取模式和脚本。

Common Architecture Mistakes

常见架构错误

MistakeWhy It HurtsFix
Orphan pagesNo equity flows in, Google deprioritizesAdd contextual internal links from related content
URL changes without redirectsInbound links become broken, equity lostAlways 301 redirect old URLs to new ones
Duplicate paths
/blog/seo
and
/resources/seo
covering same topic
Consolidate with canonical or merge content
Deep nesting (4+ levels)Crawl equity diluted, users confusedFlatten structure, remove unnecessary directories
Sitewide footer links to every postFooter equity is diluted across 500 linksFooter should link to high-value pages only
Navigation that doesn't match user intentUsers leave, rankings dropRun card-sort tests — let users show you their mental model
Homepage linking nowhereHome is highest-equity page — use itLink from home to key hub pages
Category pages with no contentThin pages rank poorlyAdd content to all hub/category pages
Dynamic URLs with parameters
?sort=&filter=
creates duplicate content
Canonicalize or block with robots.txt

错误危害修复方案
Orphan pages无权重流入,谷歌降低优先级从相关内容添加上下文内部链接
修改URL但未设置重定向外部链接失效,权重流失始终将旧URL 301重定向到新URL
重复路径
/blog/seo
/resources/seo
覆盖同一主题
使用canonical标签合并或整合内容
深度嵌套(4级及以上)抓取权重分散,用户混淆扁平化结构,移除不必要的目录
页脚全站链接到所有文章页脚权重被500个链接稀释页脚仅链接到高价值页面
导航不符合用户意图用户流失,排名下降开展卡片分类测试——让用户展示其心智模型
首页无对外链接首页是权重最高的页面——充分利用从首页链接到关键hub页面
分类/hub页面无内容薄页面排名不佳为所有hub/分类页面添加内容
带参数的动态URL
?sort=&filter=
产生重复内容
使用canonical标签或通过robots.txt屏蔽

Proactive Triggers

主动触发提示

Surface these without being asked:
  • Pages more than 3 clicks from homepage → flag as crawl equity risk. Any page a user has to click 4+ times to reach needs a structural shortcut.
  • Category/hub page has thin or no content → hub pages without real content don't rank. Flag and recommend adding a proper pillar page.
  • Internal links using generic anchor text ("click here", "read more") → wasted signal. Offer to rewrite anchor text patterns.
  • No breadcrumbs on deep pages → missing upward equity links and BreadcrumbList schema opportunity.
  • Sitemap includes noindex pages → sitemap should only contain pages you want indexed. Flag and offer to filter.
  • Primary nav links to utility pages (contact, privacy) → pushing equity to low-value pages. Nav should prioritize money/content pages.

无需用户询问即可主动指出以下问题:
  • 距离首页超过3次点击的页面 → 标记为抓取权重风险。任何需要点击4次以上才能到达的页面都需要结构上的快捷方式。
  • 分类/hub页面内容单薄或无内容 → 无真实内容的hub页面无法排名。标记并建议添加合适的核心页面。
  • 内部链接使用通用锚文本(如“click here”、“read more”) → 浪费信号价值。提供锚文本模式重写建议。
  • 深层页面无Breadcrumbs → 错失向上权重链接和BreadcrumbList schema机会。
  • 站点地图包含noindex页面 → 站点地图应仅包含您希望被索引的页面。标记并提供过滤建议。
  • 主导航链接到实用页面(联系我们、隐私政策) → 将权重导向低价值页面。导航应优先展示业务/内容页面。

Output Artifacts

输出成果

When you ask for...You get...
Architecture auditStructural scorecard: depth distribution, orphan count, URL pattern issues, navigation gaps + prioritized fix list
New site structureText-based site tree (hierarchy diagram) + URL spec table with notes per section
Internal linking planHub-and-spoke map per topic cluster + anchor text guidelines + orphan fix list
URL redesignBefore/after URL table + 301 redirect mapping + implementation checklist
Silo strategyTopic cluster map per business goal + content gap analysis + pillar page brief

请求内容交付成果
架构审计结构评分卡:深度分布、orphan pages数量、URL模式问题、导航缺口 + 优先级修复清单
新网站架构基于文本的站点树状图(层级示意图) + 带各板块说明的URL规范表
内部链接方案按主题集群划分的中心-分支图谱 + 锚文本指南 + orphan pages修复清单
URL重新设计URL前后对比表 + 301重定向映射 + 实施清单
Silo策略按业务目标划分的主题集群图谱 + 内容缺口分析 + 核心页面 brief

Communication

沟通规范

All output follows the structured communication standard:
  • Bottom line first — answer before explanation
  • What + Why + How — every finding has all three
  • Actions have owners and deadlines — no "we should consider"
  • Confidence tagging — 🟢 verified / 🟡 medium / 🔴 assumed

所有输出遵循结构化沟通标准:
  • 先讲结论 —— 先给出答案再解释
  • 内容包含:是什么+为什么+怎么做 —— 每个发现都涵盖这三点
  • 行动项明确负责人和截止日期 —— 避免“我们应该考虑”这类模糊表述
  • 置信度标记 —— 🟢 已验证 / 🟡 中等置信 / 🔴 假设性结论

Related Skills

相关技能

  • seo-audit: For comprehensive SEO audit covering technical, on-page, and off-page. Use seo-audit when architecture is one of several problem areas. NOT for deep structural redesign — use site-architecture.
  • schema-markup: For structured data implementation. Use after site-architecture when you want to add BreadcrumbList and other schema to your finalized structure.
  • content-strategy: For deciding what content to create. Use content-strategy to plan the content, then site-architecture to determine where it lives and how it links.
  • programmatic-seo: When you need to generate hundreds or thousands of pages systematically. Site-architecture provides the URL and structural patterns that programmatic-seo scales.
  • seo-audit: For identifying technical issues. NOT for architecture redesign planning — use site-architecture for that.
  • seo-audit:用于涵盖技术、页面内、站外的全面SEO审计。当架构是多个问题领域之一时使用seo-audit。不适用于深度结构重新设计——请使用site-architecture。
  • schema-markup:用于结构化数据实施。在site-architecture确定最终结构后,使用schema-markup添加BreadcrumbList和其他schema。
  • content-strategy:用于决定创建什么内容。先使用content-strategy规划内容,再使用site-architecture确定内容位置和链接方式。
  • programmatic-seo:当需要系统性生成数百或数千个页面时使用。site-architecture提供URL和结构模式,供programmatic-seo进行规模化扩展。
  • seo-audit:用于识别技术问题。不适用于架构重新设计规划——请使用site-architecture。