build-a-brand

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Build a Brand

构建品牌

Take any input — an idea, a website, a list of reference brands, product photos, or an existing brand to refresh — and produce a complete brand identity, ending in a 15-page brand guidelines PDF.
This is a standalone brand-building workflow focused on strategy, identity, design, and copy. Output is one brand guidelines PDF for the chosen identity.
基于任意输入——创意想法、网站、参考品牌列表、产品照片,或是需要焕新的现有品牌——生成完整的品牌标识,最终输出一份15页的品牌指南PDF。
这是一个专注于策略、标识、设计和文案的独立品牌构建工作流,输出为所选品牌标识对应的一份品牌指南PDF。

Full Workflow

完整工作流

Stage 0 — Intake (empty-args menu)

阶段0 — 需求收集(无参数菜单)

If invoked with no input (no idea, no URL, no photos, no reference brands, and no relevant prior context in the conversation), print this menu verbatim as your full response and stop. Do not call any tool. Wait for the user's next message.
What are we branding? Paste any of:
  • An idea / description — e.g. "a streetwear label for cat people"
  • A website URL — e.g. "rebrand my existing site at example.com"
  • Product photos — drop them in the chat
  • Reference brands — e.g. "I want something that feels like Aesop + Patagonia"
  • An existing brand to refresh — name + what's working / what isn't
If the user already dropped one of the above, skip the menu and proceed straight to Step 1.
如果调用时无任何输入(无创意、无URL、无照片、无参考品牌,对话中也无相关上下文),请完整打印以下菜单作为回复并停止操作,不要调用任何工具,等待用户的下一条消息。
我们要打造什么品牌? 请粘贴以下任意内容:
  • 创意想法/描述 — 例如:「面向猫奴的街头服饰品牌」
  • 网站URL — 例如:「为我现有的example.com网站重塑品牌」
  • 产品照片 — 直接在聊天中上传
  • 参考品牌 — 例如:「我想要类似Aesop + Patagonia风格的品牌」
  • 需要焕新的现有品牌 — 品牌名称 + 当前优势/待改进点
如果用户已提供上述内容之一,请跳过菜单直接进入步骤1。

Stage 0.5 — Non-interactive fast lane

阶段0.5 — 非交互式快速通道

Use this path when the caller passes
--quick
or
--config <path>
, or when the caller states they are running from CI, a subagent, a batch job, or any other non-interactive harness.
This section has precedence over the interactive ask/wait instructions below. When it applies, use this fast lane and do not fall through to the multi-turn intake unless a required input is truly missing.
  • --config <path>
    points to a JSON file that pre-bakes intake answers:
    input
    ,
    photos
    ,
    reference_brands
    ,
    audience
    ,
    positioning
    ,
    assets_to_keep
    ,
    references
    ,
    chosen_direction
    ,
    chosen_identity
    , and
    export_kit
    .
  • --quick
    means use model judgment for all confirmation gates. Ask only if the original input is missing entirely; otherwise infer reasonable defaults, choose the strongest strategy direction and identity option, and continue.
  • For
    --quick
    or
    --config
    , do not stop for confirmation at the deliverable preview, strategy-direction choice, identity-option choice, or brand-kit export gate. Record the assumption inline, then proceed.
  • If a required asset is unavailable and cannot be inferred from the input, stop once with a single compact missing-fields list instead of starting a multi-turn Q&A loop.
  • Do not deliver a condensed or partial brand output just because the caller is non-interactive or the run is short on wall-clock time. A condensed 6-page deck is not an acceptable substitute for the required 15-page guidelines. If the run is out of wall-clock budget, save a resumable checkpoint with the pages/assets already completed and stop; do not mark the workflow complete.
当调用者传入
--quick
--config <path>
参数,或是调用者说明正在从CI、子Agent、批处理作业或其他非交互式环境运行时,使用此路径。
此部分优先级高于下方的交互式询问/等待指令。适用时请使用快速通道,除非确实缺少必要输入,否则不要进入多轮需求收集流程。
  • --config <path>
    指向一个预填充需求答案的JSON文件,包含:
    input
    photos
    reference_brands
    audience
    positioning
    assets_to_keep
    references
    chosen_direction
    chosen_identity
    export_kit
  • --quick
    表示对所有确认环节使用模型判断。仅当原始输入完全缺失时才询问,否则推断合理默认值,选择最有力的策略方向和标识选项并继续推进。
  • 对于
    --quick
    --config
    模式,在交付预览、策略方向选择、标识选项选择或品牌工具包导出环节均无需停止等待确认。需在流程中记录假设,然后继续推进。
  • 如果必要资产不可用且无法从输入中推断,仅需一次性列出缺失字段,无需启动多轮问答循环。
  • 不要因为调用者处于非交互式环境或运行时间有限就交付精简版或不完整的品牌输出。6页的精简演示文稿不能替代要求的15页指南。如果运行时间耗尽,请保存包含已完成页面/资产URL以及下一步渲染步骤的可恢复检查点,然后停止操作,不要标记工作流已完成。

Step 1 — Read the Input

步骤1 — 读取输入

Inputs vary. Before asking any questions, open with a brief agenda so the user knows what's coming:
here's how this works — 5 steps:
1. **Read the input** — i ask a few questions, you answer, i play back what i'm hearing
2. **Strategy directions** — 2-3 distinct positioning angles to choose from
3. **Identity options** — 3 full brand identities (name, colors, voice, brand board PDF preview) within your chosen direction
4. **Build the guidelines** — full 15-page brand book PDF for the chosen identity
5. **Export the brand kit** — once you're happy with the guidelines, i'll bundle a `brand.md` spec + logo assets (transparent PNG + PDF wrapper for every mark; SVG when vector-authored) in each brand color as a zip you can use anywhere

let's start. [questions follow]
Then ask 3-5 targeted questions in a single message. Adapt to the input type:
If they dropped an idea / description:
  • What does this brand sell or do? (product / service / app / community / something else)
  • Who is this for — describe the 2-3 audience segments this brand should serve, plus one vivid anchor persona inside the primary segment
  • Why does this exist? what's broken about the alternatives, or what feeling are you trying to deliver?
  • Do you have a name in mind, or is naming part of what you want help with?
If they dropped a website / existing brand URL:
  • Are we refreshing this brand or rebuilding it from scratch?
  • What's working about it today, and what isn't?
  • Who's the current customer vs. who you wish were the customer?
If they dropped product photos:
  • Ask how the product is made, who has bought or used it, price point, current sales/channel context, and any direction they already have in mind.
  • Keep the output scoped to guidelines and a brand kit, not a commerce launch.
If they dropped reference brands only ("I want a brand that feels like Aesop + Patagonia"):
  • What's the product, service, or thing this brand will be attached to?
  • What about each reference brand specifically do you love? (the photography? the tone? the restraint?)
  • Who buys this — describe the 2-3 audience segments this brand should serve, plus one vivid anchor persona inside the primary segment.
  • Any constraints? (industry, regulation, location, price tier?)
Always also ask (regardless of input type):
  • Do you have any existing brand assets you want to keep or incorporate? (Logo, wordmark, symbol, name, colors, fonts, photography, packaging — anything you don't want to lose.)
  • Any specific references, inspirations, or moodboards you'd want this to draw from?
These two are essential — they prevent you from generating things the user already has, and they anchor the work in references the user actually likes. Always include them.
Keep it to a single message. Aim for 5-7 questions total (input-specific + the 2 universal), conversational not clinical. Wait for answers before proceeding.
After answers, analyze the input + answers together and read back:
  • Aesthetic territory: what visual world does this live in?
  • Audience segments: primary segment, secondary segment(s), and one vivid anchor persona inside the primary segment. Do not collapse the audience into one over-specific individual.
  • Positioning: what's the wedge — what does this stand for that competitors don't?
  • Price tier / category fit: where on the market shelf does this sit?
  • Story hook: what's the emotional reason someone cares?
  • Assets being kept: explicitly list what the user said they want to preserve (existing wordmark, name, colors, etc.)
  • References anchoring the work: list the user-named inspirations.
Then preview the deliverable and invite specific guidance — before moving to brand directions, show the user what'll be in the final guidelines so they can flag anything to add, change, or call out:
here's what i'll build into the 15-page brand guidelines:

1. Cover (brand name, tagline, hero mood)
2. Strategy & positioning (primary/secondary audience segments + anchor persona)
3. Brand foundation (mission, values, story)
4. Logo (wordmark + symbol + variants)
5. Logo don'ts
6. Color palette
7. Typography
8. Icons (UI icon system + library guidance)
9. Voice & tone
10. Imagery rules (photography and/or illustration, adapted to brand medium)
11. Visual world / lifestyle imagery
12. Touchpoints (real photos showing the brand in use)
13. Brand applications (mockups: business card, app icon, favicon, etc.)
14. Digital + social
15. Do & don't

plus a brand kit zip at the end with: `brand.md` spec, logo assets (transparent PNG + PDF wrapper for every mark; SVG when vector-authored), design tokens (CSS / JSON / Tailwind), AI prompts (system prompt + task-specific starters), and the icon SVGs.

anything you want to add, change, call out specifically, or want me to handle differently? if not, i'll move on to brand directions.
Wait for response. Incorporate any specific user guidance (add a page, swap something, special focus on a particular section, exclude something) before moving to Step 2. This catches scope mismatches early — much cheaper than discovering them after the PDF is built.
输入形式多样。在提问前,先简要说明流程,让用户了解后续步骤:
here's how this works — 5 steps:
1. **Read the input** — i ask a few questions, you answer, i play back what i'm hearing
2. **Strategy directions** — 2-3 distinct positioning angles to choose from
3. **Identity options** — 3 full brand identities (name, colors, voice, brand board PDF preview) within your chosen direction
4. **Build the guidelines** — full 15-page brand book PDF for the chosen identity
5. **Export the brand kit** — once you're happy with the guidelines, i'll bundle a `brand.md` spec + logo assets (transparent PNG + PDF wrapper for every mark; SVG when vector-authored) in each brand color as a zip you can use anywhere

let's start. [questions follow]
然后在一条消息中提出3-5个针对性问题,根据输入类型调整:
如果用户提供创意想法/描述:
  • 该品牌销售或提供什么?(产品/服务/应用/社区/其他)
  • 目标受众是谁?请描述2-3个受众细分群体,以及核心细分群体中的一个典型人物画像
  • 品牌创立的初衷是什么?现有竞品存在哪些不足,或是你希望传递何种感受?
  • 你是否已有品牌名称构想,还是需要我们协助命名?
如果用户提供网站/现有品牌URL:
  • 我们是要焕新该品牌还是从零开始重建?
  • 当前品牌的优势是什么,待改进点是什么?
  • 当前客户群体是谁,你期望的客户群体是谁?
如果用户上传产品照片:
  • 询问产品的制作方式、已购买/使用人群、价格区间、当前销售渠道,以及用户已有的方向构想。
  • 输出范围仅限品牌指南和品牌工具包,不包含电商上线内容。
如果用户仅提供参考品牌(例如「我想要类似Aesop + Patagonia风格的品牌」):
  • 该品牌将依附于什么产品、服务或事物?
  • 你喜欢每个参考品牌的哪些具体点?(摄影风格?语调?简约风格?)
  • 目标客户是谁?请描述2-3个受众细分群体,以及核心细分群体中的一个典型人物画像。
  • 是否有任何限制条件?(行业、法规、地域、价格层级?)
无论输入类型如何,始终询问:
  • 你是否有想要保留或融入的现有品牌资产?(Logo、文字标志、符号、名称、配色、字体、摄影素材、包装——任何你不想舍弃的内容。)
  • 是否有特定的参考素材、灵感来源或情绪板希望我们参考?
这两个问题至关重要——它们可以避免生成用户已有的内容,并确保工作基于用户真正喜欢的参考素材。务必包含这两个问题。
将所有问题整合为一条消息,总共5-7个问题(针对输入类型的问题+2个通用问题),语气要口语化而非生硬。等待用户回答后再继续。
收到回答后,结合输入和回答进行分析并反馈:
  • 美学领域:该品牌属于何种视觉风格范畴?
  • 受众细分:核心群体、次要群体,以及核心群体中的一个典型人物画像。不要将受众简化为一个过于具体的个体。
  • 品牌定位:差异化优势——该品牌主张的是竞争对手没有的什么?
  • 价格层级/品类适配:在市场中处于什么位置?
  • 故事切入点:用户关注该品牌的情感动因是什么?
  • 保留的资产:明确列出用户希望保留的内容(现有文字标志、名称、配色等)
  • 参考素材:列出用户指定的灵感来源
然后预览交付内容并邀请用户给出具体指导——在进入品牌方向阶段前,向用户展示最终指南的内容,以便他们标记需要添加、修改或特别关注的部分:
here's what i'll build into the 15-page brand guidelines:

1. Cover (brand name, tagline, hero mood)
2. Strategy & positioning (primary/secondary audience segments + anchor persona)
3. Brand foundation (mission, values, story)
4. Logo (wordmark + symbol + variants)
5. Logo don'ts
6. Color palette
7. Typography
8. Icons (UI icon system + library guidance)
9. Voice & tone
10. Imagery rules (photography and/or illustration, adapted to brand medium)
11. Visual world / lifestyle imagery
12. Touchpoints (real photos showing the brand in use)
13. Brand applications (mockups: business card, app icon, favicon, etc.)
14. Digital + social
15. Do & don't

plus a brand kit zip at the end with: `brand.md` spec, logo assets (transparent PNG + PDF wrapper for every mark; SVG when vector-authored), design tokens (CSS / JSON / Tailwind), AI prompts (system prompt + task-specific starters), and the icon SVGs.

anything you want to add, change, call out specifically, or want me to handle differently? if not, i'll move on to brand directions.
等待用户回复。在进入步骤2前,整合用户的所有具体指导(添加页面、替换内容、特别关注某一章节、排除某部分)。这可以提前发现范围偏差——比PDF生成后再修改成本低得多。

Step 2 — Present 2-3 Brand Directions

步骤2 — 呈现2-3个品牌方向

Based on your read, present 2-3 distinct brand directions. See
references/brand-directions.md
for structure.
Each direction must be a genuinely different business answer — not aesthetic variations. Differentiate on WHO and WHY, not WHAT.
Ask the user to pick one direction before proceeding.
基于你的分析,呈现2-3个截然不同的品牌方向。结构参考
references/brand-directions.md
每个方向必须是真正不同的商业定位,而非美学变体。差异应体现在目标人群和品牌价值主张上,而非视觉表现。
请用户选择一个方向后再继续推进。

Step 3 — Generate 3 Brand Identity Options

步骤3 — 生成3个品牌标识选项

Once they choose a direction, generate 3 complete brand identity options within that direction. See
references/brand-identity.md
for structure.
Each option includes: name + tagline, color palette, typography direction, voice & tone, logo concept, brand story, photography direction (product + lifestyle/mood), UI/website direction, example brands.
After presenting all 3 in text, build a 3-page brand board PDF (one page per option) so the user can see each identity before committing.
Each option must include:
  • Wordmark in the brand's display font — use the user's existing wordmark if they have one they like; propose a new one if they need a logo or don't like their current one. A new wordmark must have custom letter treatment: adjusted spacing, ligature, cut, terminal, case, underline, or other ownable detail. It is not just a Google Font typed in a color.
  • Symbol/mark — a standalone graphic that lives without the wordmark. Use the user's existing symbol if they have one they like; propose a new one otherwise. Even if the user keeps their wordmark, propose a symbol if they don't have one — favicons and app icons need a non-typographic mark. For new symbols, default to a generated PNG via
    mcp__pika__generate_image
    with
    provider="gpt-image-2"
    when visual quality, texture, detail, or originality matters. Ask for a clean isolated mark on transparent background, no baked-in letters, no watermark, no mockup, centered in a square. After generating, run the PNG through
    mcp__pika__remove_background
    with
    mode="logo"
    to guarantee a real transparent background before compositing it onto any colored or photo background — generated symbols often come back with a baked-in white background that becomes a white box at composite time. (Background removal strips only the outer background; design any interior negative space to be open at generation time, since the cleanup pass won't carve it out.) Use inline SVG only if the mark is intentionally simple, can be drawn cleanly by hand, and passes small-size QA. Must work at 16×16 AND 512×512.
  • Seal / badge — if the option uses a seal, stamp, badge, or monogram, it must be readable and ownable at small and medium sizes. It cannot be a generic circular font lockup, clip-art crest, or low-contrast decorative filler.
  • Tagline (8 words max)
  • Voice sample with visible "VOICE" label (one quoted sentence, 14 words max)
  • Compact board story (~35 words max, min 2 sentences)
  • Lifestyle world description with visible "WORLD" label (~22 words max, min 1 full sentence)
  • Lifestyle mood image (generated via gpt-image-2)
  • 4-color palette with hex + role labels
  • Display + body type specimens with named fonts
Brand board pages should look different enough that the user can tell which identity they are seeing before reading the labels. Don't use the same template recolored 3 times; each board's layout should embody the option's design philosophy. A magazine-cover option should look like a magazine cover (full-bleed photo, masthead-style); a soft consumer option should look like a homepage hero (rounded shapes, soft circles for swatches); an editorial option should look like a literary spread (huge italic centered, inset photo). See
references/brand-guidelines.md
"Brand Board Layout — Differentiate per Option" for examples.
If you can't physically tell which brand you're looking at without reading the labels — regenerate.
Page size: 1200×850px. Renderer: Pika MCP
mcp__pika__html_to_pdf
for PDF output and
mcp__pika__html_to_png
for PNG QA previews. Both use server-side Chromium; do not run local WeasyPrint or Chrome headless by default. The delivered PDF and QA PNGs must come from the MCP render path. If a local/sandbox/browser fallback is used for debugging, discard that output, rerender through MCP, and QA the final MCP PNGs. If the MCP and fallback renders disagree, fix the HTML/CSS; do not ship the fallback render.
Board quality gate: the 3-page preview must pass visual QA, not just render QA.
  1. Inspect each PNG preview before sending the PDF. Fail ugly density, weak hierarchy, muddy one-note palette, unreadable small text, empty mockup/image slots, clipped text, and body copy or non-masthead text intersecting icons, swatches, seals, photos, phone mockups, or decorative rules. Also inspect the symbol at 16×16 and 512×512; if the small-size read is illegible, muddy, too generic, or collapses into noise, regenerate or simplify before delivery. Website/social/app mockups are optional on brand boards; do not add them unless they contain real content. If included, flat color rectangles count as empty placeholders unless the section is explicitly a palette specimen.
  2. Run
    mcp__pika__analyze_media
    on each board PNG using the exact PASS/FAIL prompt in
    references/brand-guidelines.md
    "Full-Deck Visual QA" — its low-contrast clause is load-bearing (without it the model rates low-contrast text "legible" and PASSes a real defect). Ask about all issues listed above in addition to that prompt's checks. Masthead wordmark/tagline/issue metadata overlays on photos are allowed only when they use deliberate negative space or a contrast scrim and pass contrast QA as defined in
    references/brand-guidelines.md
    Rule 5. Body copy on images is still forbidden.
  3. Handle tool state before interpreting the result. If
    mcp__pika__analyze_media
    returns
    {task_id, status: "running"}
    , poll
    mcp__pika__task_status({task_id})
    until terminal. Treat the QA as unavailable and halt with a manual-review warning if the tool is missing, raises
    tool_not_found
    ,
    provider_unavailable
    ,
    unsupported_media_type
    ,
    rate_limited
    ,
    quota_exceeded
    ,
    auth_error
    , any HTTP 4xx/5xx error envelope, or a transport error, says it cannot analyze the image, or returns final text that does not match
    /^\s*[`*]{0,2}(PASS|FAIL)\b/
    .
  4. Interpret the result with that regex only. Fix every captured FAIL before delivery. If the captured result is PASS but the explanation lists a blocking collision, clipping, unreadable text, or missing required board content, treat it as FAIL. Do not proceed silently.
Font rule: fresh fonts per brand AND fonts must have character. Never default to Inter / Karla / Outfit / DM Sans / Lato — they have no point of view as a display face. Explore the full Google Fonts library. See
references/brand-guidelines.md
"Must Have Character — Don't Default to Safe Fonts" for approved high-character options by vibe (Fraunces, Instrument Serif, Bricolage Grotesque, Funnel Display, Bodoni Moda, Reddit Mono, etc.). For MCP rendering, use HTTPS font URLs or inline
data:font/...
sources; local
file://
font paths are not available server-side.
Do not build the full 15-page guidelines PDF until the user picks an option — that wastes time on rejected identities.
Present all 3 clearly. Ask the user to pick one, or mix elements from different options.
用户选定方向后,在该方向下生成3个完整的品牌标识选项。结构参考
references/brand-identity.md
每个选项包含:品牌名称+标语、配色方案、字体方向、语气语调、Logo概念、品牌故事、摄影方向(产品+生活方式/情绪)、UI/网站方向、参考品牌。
以文字形式呈现所有3个选项后,制作一份3页的品牌看板PDF(每个选项一页),让用户在确认前直观看到每个标识。
每个选项必须包含:
  • 文字标志:使用品牌的展示字体——如果用户已有喜欢的文字标志则沿用;如果用户需要Logo或不满意现有标志则提出新方案。新文字标志必须有自定义字母处理:调整间距、连字、切割、末端样式、大小写、下划线或其他独特细节,不能只是用Google Fonts输入的彩色文字。
  • 符号/标志图形:可独立于文字标志使用的图形。如果用户已有喜欢的符号则沿用;否则提出新方案。即使用户保留文字标志,如果没有符号也需提出新方案——网站图标和应用图标需要非文字类标志。对于新符号,当视觉质量、纹理、细节或原创性很重要时,默认通过
    mcp__pika__generate_image
    工具,使用
    provider="gpt-image-2"
    生成PNG图片。要求生成透明背景的干净独立标志,无内置文字、无水印、无样机,居中放置在正方形画布中。生成后,将PNG通过
    mcp__pika__remove_background
    工具,使用
    mode="logo"
    处理,确保背景真正透明后再合成到彩色或照片背景上——生成的符号通常会带有内置白色背景,合成时会变成白框。(背景移除仅去除外部背景;生成时需设计开放的内部负空间,因为清理步骤不会雕刻内部空间。)仅当标志故意设计得简单、可手绘且在小尺寸下清晰可读时,才使用内嵌SVG。标志必须在16×16和512×512尺寸下均清晰可读。
  • 印章/徽章:如果选项使用印章、图章、徽章或字母组合,必须在中小尺寸下清晰可读且具有独特性,不能是通用的圆形字体组合、剪贴画纹章或低对比度装饰元素。
  • 标语(最多8个单词)
  • 语气示例(标注「VOICE」标签,引用一句不超过14个单词的句子)
  • 简洁的看板说明(最多约35个单词,至少2句话)
  • 生活方式风格描述(标注「WORLD」标签,最多约22个单词,至少1个完整句子)
  • 生活方式情绪图片(通过gpt-image-2生成)
  • 4色配色方案,包含十六进制代码+角色标签
  • 展示字体和正文字体样本,标注字体名称
品牌看板页面应差异足够大,用户无需阅读标签就能区分不同标识。不要使用同一模板仅更换配色;每个看板的布局应体现对应选项的设计理念。杂志封面风格的选项应看起来像杂志封面(全屏 bleed 照片、报头样式);柔和消费品牌选项应看起来像主页 hero 区(圆角形状、柔和圆形色卡);编辑风格选项应看起来像文学排版(超大斜体居中、嵌入照片)。布局示例参考
references/brand-guidelines.md
中的「Brand Board Layout — Differentiate per Option」部分。
如果不阅读标签就无法区分是哪个品牌,请重新生成。
页面尺寸:1200×850px。渲染工具:使用Pika MCP的
mcp__pika__html_to_pdf
生成PDF输出,使用
mcp__pika__html_to_png
生成PNG预览用于QA。两者均使用服务器端Chromium;默认不使用本地WeasyPrint或无头Chrome。交付的PDF和QA PNG必须来自MCP渲染路径。如果使用本地/沙箱/浏览器进行调试,需丢弃该输出,通过MCP重新渲染,并对最终MCP生成的PNG进行QA。如果MCP和备用渲染结果不一致,需修复HTML/CSS;不要交付备用渲染结果。
看板质量检查:3页预览必须通过视觉QA,而非仅渲染QA。
  1. 在发送PDF前检查每个PNG预览。拒绝低分辨率、层次薄弱、单调浑浊配色、小文本不可读、空样机/图片槽、文字截断、正文或非报头文本与图标、色卡、印章、照片、手机样机或装饰线条相交的情况。同时检查符号在16×16和512×512尺寸下的效果;如果小尺寸下不可读、模糊、过于通用或变成噪点,需重新生成或简化后再交付。品牌看板上的网站/社交/应用样机为可选;仅当包含真实内容时才添加。如果添加,纯色矩形仅在明确作为色卡样本时才有效,否则视为空占位符。
  2. 使用
    references/brand-guidelines.md
    中「Full-Deck Visual QA」部分的精确PASS/FAIL提示,通过
    mcp__pika__analyze_media
    工具分析每个看板PNG——其中的低对比度条款至关重要(没有它,模型会将低对比度文本判定为「可读」并通过PASS,而这实际是缺陷)。除了提示中的检查项,还要检查上述所有问题。仅当使用刻意的负空间或对比遮罩并通过
    references/brand-guidelines.md
    规则5中定义的对比度QA时,才允许照片上叠加报头文字标志/标语/元数据。图片上的正文仍然禁止使用。
  3. 解释结果前先处理工具状态。如果
    mcp__pika__analyze_media
    返回
    {task_id, status: "running"}
    ,需轮询
    mcp__pika__task_status({task_id})
    直到任务完成。如果工具缺失、返回
    tool_not_found
    provider_unavailable
    unsupported_media_type
    rate_limited
    quota_exceeded
    auth_error
    、任何HTTP 4xx/5xx错误、传输错误,表示无法分析图片,或返回的最终文本不匹配正则表达式
    /^\s*[`*]{0,2}(PASS|FAIL)\b/
    ,则视为QA不可用,停止操作并发出人工审核警告。
  4. 仅使用上述正则表达式解释结果。交付前修复所有捕获到的FAIL项。如果捕获结果为PASS,但说明中列出了阻塞性冲突、截断、不可读文本或缺失看板必要内容,仍视为FAIL。不要静默继续。
字体规则:每个品牌使用全新字体,且字体必须具有特色。绝不要默认使用Inter / Karla / Outfit / DM Sans / Lato——这些字体作为展示字体没有独特性。探索完整的Google Fonts库。按风格划分的高特色可选字体参考
references/brand-guidelines.md
中的「Must Have Character — Don't Default to Safe Fonts」部分(如Fraunces、Instrument Serif、Bricolage Grotesque、Funnel Display、Bodoni Moda、Reddit Mono等)。对于MCP渲染,使用HTTPS字体URL或内嵌
data:font/...
资源;服务器端无法访问本地
file://
字体路径。
在用户选定选项前不要制作完整的15页指南PDF——这会浪费时间在被拒绝的标识上。
清晰呈现所有3个选项,请用户选择一个,或混合不同选项的元素。

Step 4 — Build the Full Brand Guidelines PDF

步骤4 — 制作完整品牌指南PDF

Once user confirms an identity, build the 15-page brand guidelines PDF.
Page structure (15 pages; becomes 16 if hybrid imagery split):
  1. Cover — Brand name, tagline, hero mood image. Full-bleed.
  2. Strategy & Positioning — Direction name. Positioning statement (one punchy sentence). Target audience system: primary segment, secondary segment(s), and one vivid anchor persona inside the primary segment. Do not describe only one over-specific customer. 3-4 reference brands with "borrow this" notes.
  3. Brand Foundation — Mission. Brand values (3-5). The "why this exists" story (2-3 paragraphs of real copy in brand voice — not a template).
  4. Logo — Primary mark + all variants (horizontal, icon-only, reversed), usage rules (on dark / on light / on color), logo mark explanation, clear space rule.
  5. Logo Don'ts — Explicit misuse rendered in CSS: never stretch, never rotate, never wrong background, never recolor, never use drop shadow. Show each violation visually with a ✗ label.
  6. Color — All swatches with hex + RGB + CMYK, primary pairings, accessibility/contrast note, never-do combinations. Full-bleed color columns, not swatches floating on white.
  7. Typography — Full hierarchy (H1 through caption with exact px sizes), display/accent/body fonts, usage rules per context, type on color backgrounds, minimum sizes.
  8. Icons — UI icon system: 8-12 essential icons (arrow-right, check, close, plus, settings, search, user, bell, menu, info, etc.) rendered in the brand's geometric style + stroke/corner/grid rules + library recommendation for icons beyond the set. See
    references/brand-guidelines.md
    "Icons Page — Structure & Rules" section.
  9. Voice & Tone — Tone adjectives, copy examples by context (headline, body, button, error state, social caption), forbidden words/phrases. Show actual brand copy, not generic example copy.
  10. Imagery Rules — adapts to the brand's medium. Photography-led → photography rules (subject/light/color/cast/texture/forbidden + 1 example photo). Illustration-led → illustration rules (style/color/line/character/composition/forbidden + 1 example illustration). Hybrid (both equally) → split into two pages, guidelines becomes 16 pages. See
    references/brand-guidelines.md
    "Imagery Rules Page — Adapts per Brand."
  11. Visual World — Full-bleed 4-column grid of 4 images matching the brand's medium mix (all photos, all illustrations, or mixed). Cast must be racially diverse for any people-featuring images.
  12. Touchpoints — Real Photos — A 2×2 grid of 4 REAL GENERATED PHOTOGRAPHS showing the brand in physical context. Adapt to the brand type:
    • Physical product brand: hang tag on garment, woven label macro, kraft mailer with tissue, flat lay of product + packaging
    • Digital / app brand: phone in hand showing the app, laptop on desk showing the site, sticker on water bottle, tote bag in a real scene
    • Service brand: business card in hand, branded notebook on desk, signage on a building, swag in context No CSS vector mockups on this page — without real generated photos the touchpoints look like a Figma exercise, not a brand. Real images prove the brand can survive contact with the physical world.
  13. Brand Applications — CSS Mockups — CSS-rendered mockups of secondary applications, each labeled with specs: business card (with dimensions), social avatar (circle crop), sticker/app icon (rounded square), email signature, presentation cover slide. For product brands also include: hang tag spec, woven label spec, shopping bag spec.
  14. Digital / Social — Website hero aesthetic (colors, fonts, layout feel), Instagram grid style (3×3 mockup with color palette + caption tone), story template (brand colors + logo placement), link-in-bio layout.
  15. Do & Don't — 5 dos and 5 don'ts, brand-specific and actionable. Not generic ("do use the logo correctly") — brand-specific ("do leave a full em-dash of space around the wordmark in social posts; never crop our tagline mid-word").
Keep Page 2 (strategy) before Page 3 (foundation), and keep both before logo/color/type. Strategy frames every visual decision that follows.
Include every page in the structure. If the brand has no packaging, adapt the touchpoints page to the brand type instead of skipping it; the guidelines should still show how the identity survives in real contexts.
Completion gate: do not deliver a condensed or partial guidelines PDF. A condensed 6-page deck, missing Visual World page, missing Touchpoints page, or missing Brand Applications page is a failed checkpoint, not a final deliverable. If time runs out, stop with a resumable checkpoint that lists completed pages, missing pages, generated asset URLs, and the next render step. Do not present the deck as done until all mandatory pages have rendered and passed QA.
All build rules in
references/brand-guidelines.md
apply: server-side Chromium render contract, explicit 1200×850 page dimensions, HTTPS/data-URI assets, no load-bearing text on generated images, no duplicate generated images across deck, text contrast thresholds on dark backgrounds, and mandatory pre-send QA previews for every page.
Deliver as PDF.
mcp__pika__html_to_pdf
returns a CDN
file_url
; give the user that URL. If exporting a local kit later, download that
file_url
into
~/Desktop/[brand-name]-brand-guidelines.pdf
or the kit folder as
brand-guidelines.pdf
. Do not use
mcp__pika__upload_asset
for PDFs; that tool still only accepts images/audio/video.
用户确认标识后,制作15页的品牌指南PDF。
页面结构(15页;如果是混合图像则分为两页,变为16页):
  1. 封面 — 品牌名称、标语、主情绪图片,全屏bleed。
  2. 策略与定位 — 方向名称。定位声明(一句有力的话)。目标受众体系:核心群体、次要群体,以及核心群体中的一个典型人物画像。不要仅描述一个过于具体的客户。3-4个参考品牌,附带「可借鉴点」说明。
  3. 品牌基础 — 使命。品牌价值观(3-5个)。「品牌创立初衷」故事(2-3段符合品牌语气的真实文案,而非模板)。
  4. Logo — 主标志+所有变体(横向、仅图标、反向),使用规则(深色背景/浅色背景/彩色背景),标志图形说明,留白规则。
  5. Logo禁用规范 — 使用CSS渲染明确的错误用法:绝不要拉伸、旋转、使用错误背景、重新着色、添加阴影。每个违规示例都标注✗。
  6. 配色 — 所有色卡,包含十六进制+RGB+CMYK代码,主色搭配方案,可访问性/对比度说明,禁用组合。全屏bleed配色栏,而非白色背景上的浮动色卡。
  7. 字体 — 完整层级(H1到说明文字,含精确像素尺寸),展示字体/强调字体/正文字体,不同场景的使用规则,彩色背景上的字体效果,最小尺寸。
  8. 图标 — UI图标系统:8-12个必备图标(右箭头、勾选、关闭、加号、设置、搜索、用户、铃铛、菜单、信息等),采用品牌的几何风格,附带线条/圆角/网格规则,以及超出该图标集的图标库推荐。结构和规则参考
    references/brand-guidelines.md
    中的「Icons Page — Structure & Rules」部分。
  9. 语气语调 — 语气形容词,不同场景的文案示例(标题、正文、按钮、错误状态、社交文案),禁用词汇/短语。展示真实的品牌文案,而非通用示例文案。
  10. 图像规则 — 适配品牌媒介。摄影主导→摄影规则(主题/光线/色彩/色调/纹理/禁用内容+1张示例照片)。插画主导→插画规则(风格/色彩/线条/角色/构图/禁用内容+1张示例插画)。混合风格(两者同等重要)→分为两页,指南变为16页。参考
    references/brand-guidelines.md
    中的「Imagery Rules Page — Adapts per Brand」部分。
  11. 视觉风格 — 全屏bleed的4列网格,包含4张匹配品牌媒介组合的图片(全照片、全插画或混合)。如果图片包含人物,必须体现种族多样性。
  12. 触点——真实照片 — 2×2网格,包含4张真实生成的照片,展示品牌在真实场景中的应用。适配品牌类型:
    • 实体产品品牌:服装吊牌、织唛特写、牛皮纸快递袋+纸巾、产品+包装平铺
    • 数字/应用品牌:手持手机展示应用、桌面笔记本电脑展示网站、水瓶贴纸、真实场景中的手提袋
    • 服务品牌:手持名片、桌面品牌笔记本、建筑标识、场景中的周边产品 此页禁止使用CSS矢量样机——没有真实生成的照片,触点页面会看起来像Figma练习,而非真实品牌。真实图片证明品牌能在现实世界中落地。
  13. 品牌应用——CSS样机 — CSS渲染的次要应用样机,每个标注规格:名片(含尺寸)、社交头像(圆形裁剪)、贴纸/应用图标(圆角方形)、邮件签名、演示文稿封面。对于产品品牌,还需包含:吊牌规格、织唛规格、购物袋规格。
  14. 数字与社交 — 网站hero区风格(配色、字体、布局感觉)、Instagram网格风格(3×3样机+配色方案+文案语气)、故事模板(品牌配色+Logo位置)、链接简介页布局。
  15. 规范与禁忌 — 5条规范和5条禁忌,必须针对品牌且具有可操作性,不能是通用内容(如「正确使用Logo」),而要具体到品牌(如「社交帖子中文字标志周围需保留一个全破折号的留白;绝不要截断我们的标语」)。
保持第2页(策略)在第3页(基础)之前,且两者都在Logo/配色/字体页面之前。策略是后续所有视觉决策的框架。
包含结构中的所有页面。如果品牌没有包装,需根据品牌类型调整触点页面,而非跳过;指南仍需展示标识如何在真实场景中落地。
完成检查:不要交付精简版或不完整的指南PDF。6页的精简演示文稿、缺失视觉风格页、缺失触点页或缺失品牌应用页均视为检查失败,而非最终交付物。如果时间耗尽,请停止操作并保存可恢复检查点,列出已完成页面、缺失页面、生成资产URL以及下一步渲染步骤。所有必填页面都已渲染并通过QA后,才能将演示文稿标记为完成。
references/brand-guidelines.md
中的所有构建规则均适用:服务器端Chromium渲染协议、明确的1200×850页面尺寸、HTTPS/data-URI资产、生成图像上无关键文本、演示文稿中无重复生成图像、深色背景上的文本对比度阈值、发送前每个页面必须进行QA预览。
交付格式为PDF
mcp__pika__html_to_pdf
返回CDN
file_url
;将该URL提供给用户。如果后续导出本地工具包,需将该
file_url
下载到
~/Desktop/[brand-name]-brand-guidelines.pdf
或工具包文件夹中,命名为
brand-guidelines.pdf
。不要使用
mcp__pika__upload_asset
上传PDF;该工具目前仅支持图片/音频/视频。

Step 5 — Export the Brand Kit (after user confirms guidelines)

步骤5 — 导出品牌工具包(用户确认指南后)

After delivering the 15-page guidelines PDF, wait for explicit user confirmation that they're happy with the brand. Don't auto-export — the kit codifies the final brand, so only build it once the brand is locked.
Then build a comprehensive brand kit zip that lets the user produce on-brand work anywhere — in Claude, GPT, Figma, with a designer, with a developer.
Kit structure:
[brand-name]-brand-kit.zip
├── brand.md                       # comprehensive machine-readable spec
├── brand-guidelines.pdf           # full 15-page guidelines PDF (the visual deliverable)
├── README.md                      # 1-page how-to-use guide
├── logo/
│   ├── symbol/                    # standalone mark, one set per color variant
│   │   ├── symbol-[color].png     # transparent background, 1024×1024+
│   │   ├── symbol-[color].svg     # only when vector-authored
│   │   └── symbol-[color].pdf     # vector PDF or raster PDF wrapper
│   ├── wordmark/                  # the brand name styled
│   │   └── wordmark-[color].{svg,png,pdf}
│   └── lockup/                    # symbol + wordmark together
│       ├── horizontal/
│       │   └── lockup-h-[color].{png,pdf} + svg when fully vector-authored
│       └── stacked/
│           └── lockup-s-[color].{png,pdf} + svg when fully vector-authored
├── icons/                         # the 12 UI icons from page 8 as SVGs
│   ├── arrow-right.svg
│   ├── check.svg
│   ├── close.svg
│   ├── plus.svg
│   ├── search.svg
│   ├── user.svg
│   ├── settings.svg
│   ├── bell.svg
│   ├── menu.svg
│   ├── info.svg
│   └── [+ any brand-specific icons]
├── fonts/                         # actual TTF font files (OFL-licensed Google Fonts)
│   ├── [display-font]-Variable.ttf
│   ├── [body-font]-Variable.ttf
│   └── README.md                  # license + install instructions
├── tokens/                        # design tokens for devs
│   ├── tokens.css                 # CSS custom properties — paste into :root
│   ├── tokens.json                # same content in JSON — for AI tools / CI
│   └── tailwind.config.snippet.js # paste into tailwind.config.js extend block
└── prompts/                       # AI prompts for downstream brand use
    ├── system-prompt.md           # paste at the top of a Claude/GPT thread for brand voice
    ├── tweet.md                   # task-specific starter: write a tweet
    ├── landing-hero.md            # task-specific starter: landing page hero copy
    ├── email.md                   # task-specific starter: marketing/transactional email
    ├── error-message.md           # task-specific starter: write a friendly error
    ├── photography.md             # task starter: generate brand-style photography (with cliché guardrails + brand-photography rules embedded)
    └── illustration.md            # task starter: generate brand-style illustration (only if the brand uses illustration as a medium)
Color variants to export (per logo): primary-on-light, primary-on-dark, neutral-on-light (ink), neutral-on-dark (cream), and one accent-on-color combination. Usually 4-5 color sets per logo type.
brand.md — see
references/brand-md-template.md
for the full structure. It must include:
  • Quick reference block (name, tagline, primary color, fonts, voice in one scannable section)
  • Positioning + audience segments
  • Mission, values, story
  • Voice & tone (adjectives, copy examples by context, forbidden words)
  • Colors (table with hex / RGB / CMYK / Pantone / role)
  • Typography (display + body + Google Fonts URLs + full hierarchy)
  • Logo (wordmark description + symbol description + lockup specs + file list with paths)
  • Photography rules
  • Visual world description
  • Touchpoint specs
  • Do & don't list
  • Reference brands with "borrow this" notes
  • How-to-use section telling downstream tools/people how to apply the spec
Asset generation pipeline:
  1. Symbol master asset — for new marks, prefer the generated PNG route: call
    mcp__pika__generate_image
    with
    provider="gpt-image-2"
    for a clean isolated mark on transparent background, centered, no text, no watermark, no mockup, no shadows. Use SVG only if the mark is deliberately simple and still reads at 16×16. Keep the best generated PNG as the source of truth when it is stronger than SVG.
  2. Logo SVGs — write SVG variants for wordmarks, lockups, and any symbol that is actually vector-authored. Do not trace a rich generated PNG into a weak SVG just to satisfy a vector preference. For raster-generated symbols, note in
    brand.md
    that the symbol master is PNG.
  3. Logo PNGs — export every symbol / wordmark / lockup as PNG on transparent background at
    1024×1024+
    for symbols and enough width for wordmarks. For SVG-derived assets, render SVG → PNG via
    mcp__pika__html_to_png
    : HTML wrapper with
    <body style="margin:0;background:transparent;">
    containing just the SVG,
    raster_options.viewport_px:1024x1024
    ,
    transparent_background:true
    when available. For generated PNG symbols, preserve the original high-res transparent PNG and create color variants only when they remain crisp.
  4. Logo PDFs — render SVG logo wrappers via
    mcp__pika__html_to_pdf
    when vector source exists. For raster-generated symbols, create a PDF wrapper that embeds the high-res PNG at full resolution and label it as raster-source in the README; do not pretend it is vector.
  5. Icon SVGs — write each of the 12 icons as a standalone SVG with
    stroke="currentColor"
    ,
    viewBox="0 0 24 24"
    , and the brand's chosen stroke weight + corner style applied consistently. See
    references/brand-guidelines.md
    "Icons Page — Structure & Rules" for which icons to include.
  6. Design tokens — generate all three files from the brand spec:
    • tokens.css
      :root
      block with
      --color-*
      ,
      --font-*
      ,
      --font-size-*
      ,
      --line-height-*
      ,
      --space-*
      ,
      --radius-*
      ,
      --shadow-*
      custom properties
    • tokens.json
      — same content as JSON object with sections:
      color
      ,
      font
      ,
      fontSize
      ,
      lineHeight
      ,
      spacing
      ,
      radius
      ,
      shadow
    • tailwind.config.snippet.js
      — JavaScript snippet to paste inside
      module.exports.theme.extend
      covering colors, fontFamily, fontSize, borderRadius, boxShadow
  7. AI prompts — generate each prompt file with brand specifics interpolated:
    • system-prompt.md
      — a system prompt to paste at the top of any Claude/GPT thread. Includes: brand voice adjectives, forbidden words, copy rules, photography direction, color/font specs, sample voice examples. End with "Always apply this brand voice unless explicitly instructed otherwise."
    • tweet.md
      — task starter: max 280 chars, voice constraints, sample target tweets, then "Task: [USER FILLS IN]"
    • landing-hero.md
      — task starter: hero copy structure (headline + subheadline + CTA), brand voice rules, examples from the guidelines
    • email.md
      — task starter: email tone, subject line guidance, body structure, sign-off conventions
    • error-message.md
      — task starter: how the brand handles error/empty/loading states in voice (warm not robotic, specific not vague)
    • photography.md
      — task starter for generating brand-style photography (gpt-image-2 etc.). Must include: master prompt template tailored to the brand's photo direction (subject, light, color grade, cast diversity, texture); explicit "what to AVOID in the prompt" list (studio strobes, stock terms, glass coworking spaces, "engineers at laptops," "professional," "premium," etc.); banned cliché concepts list (hourglasses, lightbulbs, handshakes, network nodes, glowing brains, etc.); subject substitutes for "person doing X"; quality requirements (butter accent, diversity, film grain, documentary); explicit no-text guardrail string; note about never naming real publications.
    • illustration.md
      — only if the brand uses illustration as a medium. Task starter for generating brand-style illustrations. Master prompt template with strict palette + style rules (flat vector / line art / etc), banned elements (gradients, drop shadows, 3D, photographic textures), when to use illustration vs photography. Skip this file entirely if the brand has no illustration in its visual world.
  8. Brand fonts — local export step. Download the actual font files from Google Fonts (or wherever the brand fonts live) and include in
    fonts/
    :
    • Variable font files when available:
      [FontName]-Variable.ttf
      (single file, supports all weights)
    • Or static weights at the levels the brand uses
    • GitHub mirror pattern:
      https://github.com/google/fonts/raw/main/ofl/[fontname]/[FontName][wght].ttf
    • Add a
      fonts/README.md
      noting the license (OFL is common, allows redistribution) + Google Fonts URL for online installation
  9. Brand guidelines PDF — download the
    html_to_pdf.file_url
    from Step 4 into the kit as
    brand-guidelines.pdf
    . The kit is incomplete without it.
  10. README.md — 1-page guide telling the user: what's in the kit, how to use brand.md with AI tools, which logo file for which context, where to install fonts (local TTFs or Google Fonts URLs), how to use the photography/illustration prompts.
  11. Zip everything:
    zip -r [brand]-brand-kit.zip brand.md brand-guidelines.pdf README.md logo/ icons/ fonts/ tokens/ prompts/
Brand-kit completion gate: when the user confirms export, or when
export_kit
is set in
--config
, the brand kit zip is a required deliverable. Do not mark the brand kit complete until the zip exists and contains
brand.md
,
brand-guidelines.pdf
, README, logo assets, icons, fonts, tokens, and prompts. If any required file cannot be produced, stop with a resumable checkpoint and list the missing files instead of shipping a partial zip.
README.md — 1-page guide telling the user:
  • What's in the kit
  • How to use
    brand.md
    with AI tools (paste into Claude/GPT to generate on-brand work)
  • Which logo file to use for which context (web favicon → symbol PNG; print collateral → PDF; web header → wordmark SVG; etc.)
  • Font installation links (Google Fonts URLs)
Delivery:
  • Save zip to
    ~/Desktop/[brand-name]-brand-kit.zip
    for local Mac users.
  • If the environment cannot download fonts or write a local zip, do not ship a partial kit. Stop with a resumable blocked checkpoint that lists the completed artifacts (
    file_url
    ,
    brand.md
    , tokens, logo assets), the missing files, and the exact filesystem/network blocker.
  • Upload the completed zip to a CDN if needed.
  • Tell the user what's in the completed zip and link to the
    brand.md
    so they can preview without unzipping.
交付15页指南PDF后,等待用户明确确认他们对品牌满意。不要自动导出——工具包是品牌最终版本的固化,因此仅在品牌锁定后再制作。
然后制作一个全面的品牌工具包压缩包,让用户可以在任何地方生成符合品牌规范的内容——在Claude、GPT、Figma中,与设计师、开发者协作时均可使用。
工具包结构:
[brand-name]-brand-kit.zip
├── brand.md                       # 全面的机器可读规范
├── brand-guidelines.pdf           # 完整的15页品牌指南PDF(视觉交付物)
├── README.md                      # 1页使用指南
├── logo/
│   ├── symbol/                    # 独立标志图形,每个颜色变体一套
│   │   ├── symbol-[color].png     # 透明背景,1024×1024+
│   │   ├── symbol-[color].svg     # 仅当矢量绘制时提供
│   │   └── symbol-[color].pdf     # 矢量PDF或栅格PDF封装
│   ├── wordmark/                  # 品牌名称样式
│   │   └── wordmark-[color].{svg,png,pdf}
│   └── lockup/                    # 标志图形+文字标志组合
│       ├── horizontal/
│       │   └── lockup-h-[color].{png,pdf} + svg(仅当完全矢量绘制时)
│       └── stacked/
│           └── lockup-s-[color].{png,pdf} + svg(仅当完全矢量绘制时)
├── icons/                         # 第8页中的12个UI图标,格式为SVG
│   ├── arrow-right.svg
│   ├── check.svg
│   ├── close.svg
│   ├── plus.svg
│   ├── search.svg
│   ├── user.svg
│   ├── settings.svg
│   ├── bell.svg
│   ├── menu.svg
│   ├── info.svg
│   └── [+ 任何品牌特定图标]
├── fonts/                         # 实际TTF字体文件(OFL授权的Google Fonts)
│   ├── [display-font]-Variable.ttf
│   ├── [body-font]-Variable.ttf
│   └── README.md                  # 授权说明+安装指南
├── tokens/                        # 面向开发者的设计令牌
│   ├── tokens.css                 # CSS自定义属性——粘贴到:root中
│   ├── tokens.json                # 相同内容的JSON格式——用于AI工具/CI
│   └── tailwind.config.snippet.js # 粘贴到tailwind.config.js的extend块中
└── prompts/                       # 用于后续品牌内容生成的AI提示词
    ├── system-prompt.md           # 粘贴到Claude/GPT对话顶部,确保品牌语气
    ├── tweet.md                   # 任务特定起始提示:撰写推文
    ├── landing-hero.md            # 任务特定起始提示:着陆页hero区文案
    ├── email.md                   # 任务特定起始提示:营销/事务性邮件
    ├── error-message.md           # 任务特定起始提示:撰写友好的错误提示
    ├── photography.md             # 任务起始提示:生成符合品牌风格的照片(包含陈词滥调防护+品牌摄影规则)
    └── illustration.md            # 任务起始提示:生成符合品牌风格的插画(仅当品牌使用插画作为媒介时)
需导出的颜色变体(每个Logo):浅色背景主色、深色背景主色、浅色背景中性色(墨色)、深色背景中性色(奶油色),以及一个强调色+底色组合。通常每个Logo类型有4-5套颜色。
brand.md — 完整结构参考
references/brand-md-template.md
。必须包含:
  • 快速参考块(名称、标语、主色、字体、语气,集中展示便于快速查看)
  • 定位+受众细分
  • 使命、价值观、故事
  • 语气语调(形容词、不同场景的文案示例、禁用词汇)
  • 配色(表格包含十六进制/RGB/CMYK/Pantone/角色)
  • 字体(展示字体+正文字体+Google Fonts URL+完整层级)
  • Logo(文字标志说明+图形标志说明+组合标志规格+文件列表及路径)
  • 摄影规则
  • 视觉风格描述
  • 触点规格
  • 规范与禁忌列表
  • 参考品牌及「可借鉴点」说明
  • 使用指南部分,告知下游工具/人员如何应用该规范
资产生成流程:
  1. 标志图形主资产 — 对于新标志,优先选择生成PNG的方式:调用
    mcp__pika__generate_image
    工具,使用
    provider="gpt-image-2"
    生成透明背景的干净独立标志,居中放置,无文字、无水印、无样机、无阴影。仅当标志故意设计得简单且在16×16尺寸下仍清晰可读时,才使用SVG。当生成的PNG比SVG效果更好时,保留最佳生成PNG作为源文件。
  2. Logo SVG — 为文字标志、组合标志以及任何矢量绘制的标志图形编写SVG变体。不要将高质量生成PNG追踪为低质量SVG以满足矢量需求。对于栅格生成的标志,需在
    brand.md
    中注明标志主资产为PNG。
  3. Logo PNG — 将每个标志图形/文字标志/组合标志导出为透明背景的PNG,标志图形尺寸为
    1024×1024+
    ,文字标志宽度足够。对于SVG衍生资产,通过
    mcp__pika__html_to_png
    将SVG渲染为PNG:HTML包装器包含
    <body style="margin:0;background:transparent;">
    ,仅嵌入SVG,
    raster_options.viewport_px:1024x1024
    ,如果支持则设置
    transparent_background:true
    。对于生成的PNG标志,保留原始高分辨率透明PNG,仅当颜色变体仍清晰时才创建。
  4. Logo PDF — 当存在矢量源文件时,通过
    mcp__pika__html_to_pdf
    渲染SVG Logo封装。对于栅格生成的标志,创建嵌入高分辨率PNG的PDF封装,并在README中注明为栅格源;不要伪装成矢量文件。
  5. 图标SVG — 将12个图标分别编写为独立SVG,设置
    stroke="currentColor"
    viewBox="0 0 24 24"
    ,并统一应用品牌选定的线条粗细+圆角样式。需包含的图标参考
    references/brand-guidelines.md
    中的「Icons Page — Structure & Rules」部分。
  6. 设计令牌 — 根据品牌规范生成三个文件:
    • tokens.css
      :root
      块,包含
      --color-*
      --font-*
      --font-size-*
      --line-height-*
      --space-*
      --radius-*
      --shadow-*
      自定义属性
    • tokens.json
      — 相同内容的JSON对象,包含:
      color
      font
      fontSize
      lineHeight
      spacing
      radius
      shadow
      章节
    • tailwind.config.snippet.js
      — JavaScript代码片段,粘贴到
      module.exports.theme.extend
      中,涵盖颜色、字体族、字体大小、边框圆角、阴影
  7. AI提示词 — 生成每个提示词文件,插入品牌特定内容:
    • system-prompt.md
      — 粘贴到任何Claude/GPT对话顶部的系统提示词。包含:品牌语气形容词、禁用词汇、文案规则、摄影方向、配色/字体规格、语气示例。结尾添加「除非明确指示,否则始终遵循此品牌语气。」
    • tweet.md
      — 任务起始提示:最多280字符,语气限制,目标推文示例,然后是「任务:[用户填写]」
    • landing-hero.md
      — 任务起始提示:hero区文案结构(标题+副标题+CTA)、品牌语气规则、指南中的示例
    • email.md
      — 任务起始提示:邮件语气、主题行指导、正文结构、签名惯例
    • error-message.md
      — 任务起始提示:品牌如何处理错误/空/加载状态的语气(友好而非机械,具体而非模糊)
    • photography.md
      — 生成符合品牌风格照片的任务起始提示(gpt-image-2等工具)。必须包含:适配品牌摄影方向的主提示模板(主题、光线、色彩分级、演员多样性、纹理);明确的「提示词中需避免的内容」列表(影室闪光灯、 stock 术语、玻璃 coworking 空间、「工程师在笔记本电脑前」、「专业」、「高端」等);禁用陈词滥调概念列表(沙漏、灯泡、握手、网络节点、发光大脑等);「人物做X」的替代主题;质量要求(柔和高光、多样性、胶片颗粒、纪实风格);明确的无文字防护语句;不要提及真实出版物的说明。
    • illustration.md
      — 仅当品牌使用插画作为媒介时提供。生成符合品牌风格插画的任务起始提示。主提示模板包含严格的配色+风格规则(扁平化矢量/线条艺术等)、禁用元素(渐变、阴影、3D、摄影纹理)、何时使用插画vs摄影。如果品牌视觉风格中没有插画,完全跳过此文件。
  8. 品牌字体 — 本地导出步骤。从Google Fonts(或品牌字体所在位置)下载实际字体文件,放入
    fonts/
    • 如果有可变字体文件:
      [FontName]-Variable.ttf
      (单个文件,支持所有字重)
    • 否则,下载品牌使用的静态字重
    • GitHub镜像路径:
      https://github.com/google/fonts/raw/main/ofl/[fontname]/[FontName][wght].ttf
    • 添加
      fonts/README.md
      ,注明授权(OFL常见,允许重新分发)+ Google Fonts在线安装URL
  9. 品牌指南PDF — 将步骤4中
    html_to_pdf.file_url
    下载到工具包中,命名为
    brand-guidelines.pdf
    。工具包中必须包含此文件。
  10. README.md — 1页指南,告知用户:工具包内容、如何将brand.md用于AI工具、不同场景使用哪个Logo文件、字体安装位置(本地TTF或Google Fonts URL)、如何使用摄影/插画提示词。
  11. 压缩所有内容
    zip -r [brand]-brand-kit.zip brand.md brand-guidelines.pdf README.md logo/ icons/ fonts/ tokens/ prompts/
品牌工具包完成检查:当用户确认导出,或
--config
中设置了
export_kit
时,品牌工具包压缩包是必填交付物。仅当压缩包存在且包含
brand.md
brand-guidelines.pdf
、README、Logo资产、图标、字体、令牌和提示词时,才能标记品牌工具包已完成。如果任何必填文件无法生成,需停止操作并保存可恢复检查点,列出缺失文件,而非交付不完整的压缩包。
README.md — 1页指南,告知用户:
  • 工具包内容
  • 如何将
    brand.md
    用于AI工具(粘贴到Claude/GPT中生成符合品牌规范的内容)
  • 不同场景使用哪个Logo文件(网站图标→标志图形PNG;印刷物料→PDF;网站页眉→文字标志SVG等)
  • 字体安装链接(Google Fonts URL)
交付方式:
  • 对于本地Mac用户,将压缩包保存到
    ~/Desktop/[brand-name]-brand-kit.zip
  • 如果环境无法下载字体或写入本地压缩包,不要交付不完整的工具包。停止操作并保存可恢复阻塞检查点,列出已完成的工件(
    file_url
    brand.md
    、令牌、Logo资产)、缺失文件以及确切的文件系统/网络阻塞原因。
  • 如果需要,将完整压缩包上传到CDN。
  • 告知用户完整压缩包的内容,并提供
    brand.md
    的链接,方便用户无需解压即可预览。

Key Principles

核心原则

  • The input is the brief. Don't ask for lengthy intake forms. Read what's in front of you and ask 3-5 precise questions.
  • Be specific about customers without narrowing the brand to one person. Vague audiences = weak brands, but one hyper-specific individual can make the output unusably narrow. Define audience segments first: a primary segment, 1-2 secondary segments, and one anchor persona that makes the primary segment feel concrete.
  • 3 options at each choice point. Direction (step 2), then identity (step 3). Always 3.
  • Opinionated but collaborative. Present your read confidently. They can push back.
  • Generate actual copy — don't give templates with [BRACKETS]. Write real words in the brand voice.
  • All images must look real and crafted. Generated lifestyle/touchpoint images need film grain, natural light, slight imperfections, editorial composition. Banned: perfect symmetry, gradient backgrounds, studio strobes, stock-photo energy, AI-smooth surfaces, floating objects on white. If it looks fake — regenerate.
  • Single deliverable. One brand guidelines PDF. Not a press kit. Not a launch package. Not a social calendar. Just the brand.

  • 输入即需求 brief。不要要求冗长的需求收集表单。解读眼前的输入,提出3-5个精准问题。
  • 明确客户群体但不要将品牌局限于某个人。模糊的受众=薄弱的品牌,但过于具体的个体可能导致输出过于狭窄无法使用。先定义受众细分:核心群体、1-2个次要群体,以及一个让核心群体更具象的典型人物画像。
  • 每个选择节点提供3个选项。方向(步骤2),然后是标识(步骤3)。始终提供3个选项。
  • 有主见但保持协作。自信地呈现你的解读,用户可以提出异议。
  • 生成真实文案——不要提供带[方括号]的模板。用品牌语气撰写真实内容。
  • 所有图片必须看起来真实且经过精心设计。生成的生活方式/触点图片需要胶片颗粒、自然光、轻微瑕疵、编辑级构图。禁止:完美对称、渐变背景、影室闪光灯、 stock 照片感、AI平滑表面、白色背景上的悬浮物体。如果看起来虚假——重新生成。
  • 单一交付物。一份品牌指南PDF。不是新闻稿包,不是上线套餐,不是社交日历。仅专注于品牌本身。

Brand Quality Standards

品牌质量标准

Every brand produced by this skill should meet the following standards. Generic output is a failure state because the deliverable is meant to guide real design decisions, not decorate a template.
此技能生成的每个品牌都应符合以下标准。通用化输出是失败状态,因为交付物旨在指导真实的设计决策,而非装饰模板。

The Anti-Generic Test

反通用测试

Before delivering anything, ask: Could this be a brand for literally anything else? If yes — it's not done.
Strong brand = specific product/service + clear audience model + specific point of view. Weak brand = vibes + aesthetic mood board + empty tagline. Never deliver the second.
交付任何内容前,问自己:这个品牌可以用于任何其他事物吗? 如果答案是肯定的——说明还未完成。
强大的品牌=特定产品/服务+清晰的受众模型+明确的立场。薄弱的品牌=氛围+美学情绪板+空洞的标语。绝不要交付后者。

Copy Standards

文案标准

What good brand copy sounds like:
  • It makes a specific claim: "Heavy wool. Made to last a decade." / "Built for one quiet hour a day."
  • It has a point of view: "Not trend-led. Not mass-made."
  • It can speak concretely to a reader inside a segment: "The app you reach for before checking your phone." This is copy style, not audience strategy; do not collapse the brand's audience model to only that reader.
  • It creates tension or contrast: "Handmade. Overused. On purpose."
  • It trusts the reader: no over-explaining, no "perfect for any occasion", no "cozy vibes"
What bad brand copy sounds like:
  • "Crafted with love" / "Made with care" / "Designed with passion"
  • "Perfect for any occasion" / "A timeless addition"
  • "Quality you can feel" / "Designed to inspire"
  • Generic taglines: "Where quality meets style" / "Wear your story"
  • Hollow superlatives: "premium", "luxury", "elevated", "curated", "artisanal"
  • Anything that could describe 500 other brands without changing a word
Tagline test: A great tagline could only belong to this brand. "Handmade. Overused. On purpose." is WORN's. "Just do it." is Nike's. If your tagline could appear on any random Etsy shop or Squarespace site without anyone noticing — rewrite it.
优秀品牌文案的特点:
  • 提出具体主张:「厚重羊毛。可穿十年。」/「专为每天一小时的宁静时光打造。」
  • 有明确立场:「不追潮流。不批量生产。」
  • 能与细分群体中的读者进行具体沟通:「你查看手机前会打开的应用。」这是文案风格,而非受众策略;不要将品牌的受众模型简化为仅针对该读者。
  • 制造张力或对比:「手工制作。刻意磨损。」
  • 信任读者:不过度解释,不说「适合任何场合」,不说「温馨氛围」
糟糕品牌文案的特点:
  • 「用心制作」/「精心打造」/「满怀热情设计」
  • 「适合任何场合」/「永恒之选」
  • 「品质触手可及」/「设计激发灵感」
  • 通用标语:「品质与风格兼具」/「彰显你的故事」
  • 空洞的最高级:「高端」、「奢华」、「升级」、「精选」、「手工」
  • 无需修改即可描述500个其他品牌的内容
标语测试: 优秀的标语只能属于这个品牌。「手工制作。刻意磨损。」是WORN品牌的。「Just do it.」是Nike的。如果你的标语可以出现在任何Etsy店铺或Squarespace网站上而不被注意——重写它。

Design Standards

设计标准

What editorial brand design looks like:
  • Strong typographic hierarchy — one thing is clearly the most important
  • Color used with conviction — large fields, not accent dots
  • Photography bleeds to edges — no floating images with shadow drops
  • Scale contrast — one element dominates, others recede
  • Pages feel designed, not assembled
  • Whitespace is intentional, not default padding
What generic brand design looks like:
  • Equal-sized boxes arranged in a grid
  • Body copy the same size as everything else
  • Centered everything
  • White background with a few colored boxes
  • Photos floating in white space with rounded corners
  • Font specimens that say "Font Name Here" or "Sample Text"
  • Color swatches that look like a paint store brochure
Layout rule: If a page could have been made in Canva or PowerPoint in 10 minutes — it's not good enough. Every page should require design decisions only someone with taste would make.
编辑级品牌设计的特点:
  • 清晰的字体层级——有一个明确的视觉焦点
  • 配色大胆——大面积使用,而非点缀
  • 照片全屏bleed——无带阴影的浮动图片
  • 比例对比——一个元素主导,其他元素退居次要
  • 页面看起来是设计出来的,而非拼凑的
  • 留白是刻意设计的,而非默认内边距
通用品牌设计的特点:
  • 等大的盒子排列成网格
  • 正文字体与其他内容大小相同
  • 所有内容居中
  • 白色背景+几个彩色盒子
  • 照片悬浮在白色空间中,带圆角
  • 字体样本显示「Font Name Here」或「Sample Text」
  • 色卡看起来像油漆店手册
布局规则: 如果一个页面可以在10分钟内用Canva或PowerPoint制作出来——说明不够好。每个页面都需要有品味的人做出设计决策。

Photography & Diversity Standards

摄影与多样性标准

Generated image sets featuring people should show racial diversity. This avoids defaulting every brand world to the same narrow cast.
  • Default to a mixed cast across all 4+ lifestyle images: include Black, Asian, Latina, South Asian, Middle Eastern, or mixed-race subjects
  • Vary body types, not just skin tone
  • If only one person is shown, make a deliberate choice about who that person is — don't default to white/light-skinned
  • Diversity is not a checkbox. It's a design choice that makes the brand more resonant and more honest
Photography must feel found, not staged:
  • Real rooms with real lives in them (papers, plants, worn furniture)
  • Imperfect light (window light, overcast, early morning)
  • Film grain always — even a little
  • Subjects not looking at camera unless it's a strong choice
包含人物的生成图片集应体现种族多样性。这避免了将每个品牌的视觉世界默认局限于单一群体。
  • 默认在所有4+张生活方式图片中使用混合演员阵容:包含黑人、亚洲人、拉丁裔、南亚人、中东人或混血人群
  • 体型多样,不仅是肤色多样
  • 如果仅展示一个人,需刻意选择人物身份——不要默认选择白人/浅肤色
  • 多样性不是 checkbox。这是让品牌更有共鸣、更真实的设计选择
摄影必须看起来是真实捕捉的,而非摆拍:
  • 真实房间,真实生活场景(文件、植物、旧家具)
  • 不完美的光线(窗户光、阴天、清晨)
  • 始终添加胶片颗粒——即使只是一点点
  • 除非是刻意设计,否则人物不要直视镜头

Deck / Guidelines Design Standards

演示文稿/指南设计标准

  • Typography must load. Use HTTPS font URLs or inline
    data:font/...
    sources in MCP-rendered HTML; local
    file://
    paths are not available to server-side Chromium. Always verify loaded fonts before signing off on a render. If fonts fall back to system defaults — the deck is broken, not deliverable.
  • See
    references/brand-guidelines.md
    for the full MCP render contract and QA rules.
  • Every page must have a clear visual hierarchy — one thing to look at first.
  • Full-bleed photography pages should feel like magazine spreads, not slideshow slides.
  • Color palette pages: full-bleed color columns, not swatches floating on white.
  • Logo page: logo dramatically large, with clear variants, not timid or small.
  • Voice page: show actual brand copy, not generic example copy.
  • Touchpoints page: must include generated photographs of actual touchpoints — never CSS boxes.
Deliver the guidelines as one PDF, not as individual page images. Return the
mcp__pika__html_to_pdf
CDN URL and save a local copy when practical for the brand-kit zip.
  • 字体必须加载成功。在MCP渲染的HTML中使用HTTPS字体URL或内嵌
    data:font/...
    资源;服务器端Chromium无法访问本地
    file://
    路径。签署渲染前始终验证字体是否加载成功。如果字体回退到系统默认字体——说明演示文稿有问题,不能交付。
  • 完整的MCP渲染协议和QA规则参考
    references/brand-guidelines.md
  • 每个页面必须有清晰的视觉层级——有一个首要视觉焦点。
  • 全屏bleed摄影页面应看起来像杂志排版,而非幻灯片。
  • 配色页面:全屏bleed配色栏,而非白色背景上的浮动色卡。
  • Logo页面:Logo要醒目放大,展示清晰的变体,不要小巧拘谨。
  • 语气页面:展示真实的品牌文案,而非通用示例文案。
  • 触点页面:必须包含生成的真实触点照片——绝不要使用CSS盒子。
将指南作为一份PDF交付,而非单独的页面图片。返回
mcp__pika__html_to_pdf
的CDN URL,并在可行时保存本地副本用于品牌工具包压缩包。

The Taste Check

品味检查

Before delivering any brand output, ask yourself:
  1. Would a 25-year-old with good taste want to buy from / use / work for this brand?
  2. Does the copy sound like a real person wrote it?
  3. Does the design look like a real designer made it?
  4. Are the photos diverse and real-looking?
  5. Is there a specific point of view — something this brand stands for that another brand doesn't?
If any answer is "not sure" — improve it before delivering. Strong and specific beats safe and generic every time.

交付任何品牌输出前,问自己:
  1. 有品味的25岁年轻人是否愿意购买/使用/为这个品牌工作?
  2. 文案听起来像真人写的吗?
  3. 设计看起来像真正的设计师做的吗?
  4. 照片是否多样且真实?
  5. 是否有明确的立场——这个品牌主张的是其他品牌没有的什么?
如果任何一个答案是「不确定」——交付前改进。明确且独特的品牌永远比安全且通用的品牌更好。

Load-bearing phrases

关键短语

These are the anchors that keep this skill from drifting into generic brand-book output:
PhraseWhereWhy load-bearing
different business answer — not aesthetic variations
Step 2 directionsForces positioning variety before visual variety.
fonts must have character
Step 3 identity optionsPrevents safe-font defaults from making every brand feel interchangeable.
no load-bearing text on generated images
Guidelines build rulesKeeps brand claims editable and legible in deterministic HTML/PDF.
film grain, natural light, slight imperfections
Image quality standardsPushes lifestyle/touchpoint images away from stock-photo smoothness.
Name a specific ethnicity per prompt
Diverse-cast recoveryFixes the model tendency toward all-white casts more reliably than generic diversity language.

这些短语是防止此技能输出通用品牌手册的核心:
短语位置核心作用
different business answer — not aesthetic variations
步骤2品牌方向强制在视觉变化前先实现定位差异。
fonts must have character
步骤3标识选项避免安全字体默认值导致所有品牌感觉雷同。
no load-bearing text on generated images
指南构建规则确保品牌主张在确定性HTML/PDF中可编辑且清晰可读。
film grain, natural light, slight imperfections
图片质量标准推动生活方式/触点图片远离 stock 照片的平滑感。
Name a specific ethnicity per prompt
多元化演员阵容修复比通用的多样性语言更可靠地解决模型默认白人阵容的问题。

Engine choice: gpt-image-2 (with caveats)

引擎选择:gpt-image-2(附带注意事项)

Default to
gpt-image-2
at
quality: "medium"
for all brand imagery. Why:
  • Best instruction-following for cast-diversity prompts (nano-banana-pro tends to drift toward a white default unless heavily prompted).
  • Strongest no-text guardrail adherence — critical for touchpoint shots (hang tag / woven label / sticker) where any baked-in text would ruin the mockup.
  • Native 3:4 / 4:3 / 9:16 ratios crop cleanly on sharp subjects without weird stretching.
Avoid
nano-banana-pro
for this skill — it bakes magazine-cover-style text into product shots when prompts mention "editorial." 1K from gpt-image-2 is plenty for a 1200×850 PDF page; bump to gpt-image-2's 2K tier (or escalate to
seedream
for higher) only if a specific touchpoint genuinely needs print-tier resolution. (4K on gpt-image-2 is 16:9 / 9:16 only — this skill's 3:4 / 4:3 ratios route to
seedream
if 4K is required.)
所有品牌图片默认使用
gpt-image-2
,设置
quality: "medium"
。原因:
  • 对演员多样性提示词的指令遵循最佳(nano-banana-pro如果不大量提示,往往会默认白人阵容)。
  • 最严格遵守无文字规则——这对触点照片(吊牌/织唛/贴纸)至关重要,任何内置文字都会破坏样机。
  • 原生3:4 / 4:3 / 9:16比例在清晰主体上裁剪干净,不会出现奇怪的拉伸。
此技能避免使用
nano-banana-pro
——当提示词提到「editorial」时,它会在产品照片中添加杂志封面风格的文字。gpt-image-2的1K分辨率对于1200×850 PDF页面已足够;仅当特定触点确实需要印刷级分辨率时,才升级到gpt-image-2的2K层级(或更高的
seedream
)。(gpt-image-2的4K仅支持16:9 / 9:16比例——此技能的3:4 / 4:3比例如果需要4K,将使用
seedream
。)

Runtime expectations

运行时间预期

Tell the user the rough total up front — long stages without status updates feel broken.
StageTimeNotes
Stage 0 → Step 1 (Q&A loop)5–15 minUser-paced; questions in one message
Step 2 (3 directions, text)1–2 minPure model output
Step 3 (3 identities + brand board PDF)5–7 min3 brand boards rendered via
mcp__pika__html_to_pdf
/
mcp__pika__html_to_png
QA
Step 4 image gen (8 photos via gpt-image-2 in 2 parallel batches of 4)8–12 minThe longest stage; each batch ≈ 4–6 min
Step 4 page build (15-page HTML + MCP render)2–5 min
mcp__pika__html_to_pdf
async;
mcp__pika__html_to_png
previews for QA
Step 5 brand kit zip3–5 min4 colors × 3 logo types × 3 formats + 12 icons + tokens + fonts + prompts
Total: ~25–45 min wall-clock excluding user response time.

提前告知用户大致总时长——长时间无状态更新会让用户觉得流程中断。
阶段时间说明
阶段0 → 步骤1(问答循环)5–15分钟用户主导;问题整合为一条消息
步骤2(3个方向,文字)1–2分钟纯模型输出
步骤3(3个标识+品牌看板PDF)5–7分钟3个品牌看板通过
mcp__pika__html_to_pdf
/
mcp__pika__html_to_png
渲染并QA
步骤4图片生成(通过gpt-image-2生成8张照片,分2组并行,每组4张)8–12分钟最长阶段;每组≈4–6分钟
步骤4页面构建(15页HTML + MCP渲染)2–5分钟
mcp__pika__html_to_pdf
异步;
mcp__pika__html_to_png
预览用于QA
步骤5品牌工具包压缩3–5分钟4种颜色×3种Logo类型×3种格式 + 12个图标 + 令牌 + 字体 + 提示词
总时长:约25–45分钟(不含用户回复时间)。

Failure modes

故障模式

SymptomCauseFix
Fonts render as Times / Arial in the PDFFont URL not reachable by server-side Chromium, or
@font-face
points to a local path
Use HTTPS raw font URLs or inline
data:font/...
sources; render a one-page
mcp__pika__html_to_png
preview before the full PDF
Generated image has baked-in magazine title or watermarkPrompt mentioned "magazine cover," "Vogue," "TIME," "Bloomberg," or any real publicationStrip publication names from prompt; append the verbatim no-text guardrail; regenerate. Describe visual qualities, not publications
Touchpoint / lifestyle photo shows only forehead / hand-only crop9:16 portrait source got cropped to a landscape cellRegen with
aspect_ratio: "4:3"
or
"16:9"
to match the cell aspect, OR change the layout to a portrait cell
Page overflows the 850px ceilingHeadline > 60px combined with > 3 body paragraphs on the same pageCut content, drop headline to 48px, or split across two pages. Re-render and verify with a screenshot
Board technically fits but looks uglyToo much decorative styling, tiny text, muddy one-note palette, empty mockups, or weak hierarchyRewrite board copy to fit the budgets in
brand-identity.md
, remove decorative microtype, increase body text to 18px+, add negative space/contrast, and rerun PNG + visual QA
Text overlaps icons/swatches/seals/mockupsDecorative or absolute-positioned elements share the same reading area as copyGive text a clean reading column/card, move graphics behind non-text areas only, and rerender. Passing
scrollHeight
is not enough if a sibling graphic occludes text
Brand board pages feel like recolored templatesSame template reused with palette swapsRebuild from
references/brand-guidelines.md
"Brand Board Layout — Differentiate per Option" — each board's layout must physically embody its design philosophy
Multi-page merge failsTrying to stitch local page PDFs instead of using MCPPrefer one
mcp__pika__html_to_pdf
call with native
@page
, or use
body_pages
+
shared_head
so the server merges pages
User picks a hybrid identity ("02's palette + 01's voice")Skill assumes single-option pickBuild a hybrid spec brief before Step 4, confirm with user before rendering 15 pages
PDF upload to pika MCP returns "Unsupported file type"
mcp__pika__upload_asset
allowlist is images/audio/video only — no PDFs
Don't use
mcp__pika__upload_asset
for PDFs. Use the
file_url
returned by
mcp__pika__html_to_pdf
; optionally save a local copy
Lifestyle grid all-white-cast despite diverse-cast rulegpt-image-2 defaults to lighter skin tone when ethnicity isn't named explicitly per promptName a specific ethnicity per prompt (Black, mixed-race East-Asian-and-white, East Asian, Latina, South Asian, Middle Eastern) — vary across the 4 grid prompts
症状原因修复方案
PDF中字体显示为Times / Arial服务器端Chromium无法访问字体URL,或
@font-face
指向本地路径
使用HTTPS原始字体URL或内嵌
data:font/...
资源;在生成完整PDF前,渲染单页
mcp__pika__html_to_png
预览验证字体加载
生成图片包含内置杂志标题或水印提示词提到「magazine cover」「Vogue」「TIME」「Bloomberg」或任何真实出版物从提示词中删除出版物名称;添加明确的无文字防护语句;重新生成。描述视觉特征,而非出版物名称
触点/生活方式照片仅显示额头/手部特写9:16竖版原图被裁剪为横版单元格使用
aspect_ratio: "4:3"
"16:9"
重新生成以匹配单元格比例,或调整布局为竖版单元格
页面超出850px高度限制标题>60px且同一页面包含>3段正文精简内容,将标题缩小到48px,或拆分到两个页面。重新渲染并通过截图验证
看板技术上符合尺寸但外观丑陋装饰性样式过多、文字过小、配色单调浑浊、空样机、或层级薄弱按照
brand-identity.md
中的篇幅限制重写看板文案,移除装饰性微排版,将正文字号增加到18px+,增加留白/对比度,重新生成PNG并进行视觉QA
文字与图标/色卡/印章/样机重叠装饰性或绝对定位元素与文案共享同一阅读区域为文案设置干净的阅读栏/卡片,仅将图形移至非文字区域后方,重新渲染。即使
scrollHeight
符合要求,如果兄弟图形遮挡文字也不行
品牌看板页面看起来像换色模板重复使用同一模板仅更换配色参考
references/brand-guidelines.md
中的「Brand Board Layout — Differentiate per Option」重新构建——每个看板的布局必须体现其设计理念
多页合并失败尝试拼接本地页面PDF而非使用MCP优先使用一次
mcp__pika__html_to_pdf
调用,使用原生
@page
,或使用
body_pages
+
shared_head
让服务器合并页面
用户选择混合标识(「02的配色+01的语气」)技能默认用户选择单一选项在步骤4前构建混合规范brief,渲染15页前与用户确认
PDF上传到pika MCP返回「Unsupported file type」
mcp__pika__upload_asset
仅允许图片/音频/视频——不支持PDF
不要使用
mcp__pika__upload_asset
上传PDF。使用
mcp__pika__html_to_pdf
返回的
file_url
;必要时保存本地副本
生活方式网格全为白人阵容,违反多样性规则gpt-image-2如果提示词中未明确指定种族,默认浅肤色每个提示词指定一个特定种族(黑人、混血东亚白人、东亚人、拉丁裔、南亚人、中东人)——在4张网格图片的提示词中轮换