youtube-thumbnail

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

YouTube Thumbnail

YouTube缩略图

CRITICAL: Auto-start on load

重要提示:加载时自动启动

When this skill triggers, go straight to Step 1.
当触发此技能时,直接进入步骤1。

Step 1. Gather inputs

步骤1:收集输入信息

Check the project for a reference photo config. Look in this order:
  1. thumbnail-config.md
    in the project root
  2. brand-kit.md
    — look for a reference image path and brand colours
  3. about-me.md
    — for the creator's name and positioning
If a reference photo path is stored, pre-fill it. Otherwise ask:
Upload or provide the path to the reference photo of yourself you want used in the thumbnail. Ideally a clear headshot with distinctive lighting and expression you plan to reuse across videos for brand consistency.
Then call AskUserQuestion:
json
[
  {
    "question": "What is the video title?",
    "header": "Title",
    "multiSelect": false,
    "options": [
      {"label": "I will type the title", "description": "Type the full working title"},
      {"label": "Suggest one", "description": "Given the topic, propose 3 click-worthy titles first"}
    ]
  },
  {
    "question": "Emotional tone?",
    "header": "Tone",
    "multiSelect": false,
    "options": [
      {"label": "Shock / surprise", "description": "Wide eyes, open mouth, bold reaction"},
      {"label": "Curious / thinking", "description": "Slight smirk, raised eyebrow, gaze off-frame"},
      {"label": "Confident / direct", "description": "Eye contact, calm, assertive"},
      {"label": "Frustrated / strong take", "description": "Intense gaze, hand gesture, tension"}
    ]
  }
]
检查项目中是否有参考照片配置,按以下顺序查找:
  1. 项目根目录下的
    thumbnail-config.md
  2. brand-kit.md
    —— 查找参考图片路径和品牌色彩
  3. about-me.md
    —— 获取创作者姓名和定位
如果已存储参考照片路径,则自动填充。否则询问用户:
上传或提供你想要用于缩略图的个人参考照片路径。理想情况下应为清晰的头像照片,带有独特的光线和表情,便于你在多个视频中重复使用以保持品牌一致性。
然后调用AskUserQuestion:
json
[
  {
    "question": "What is the video title?",
    "header": "Title",
    "multiSelect": false,
    "options": [
      {"label": "I will type the title", "description": "Type the full working title"},
      {"label": "Suggest one", "description": "Given the topic, propose 3 click-worthy titles first"}
    ]
  },
  {
    "question": "Emotional tone?",
    "header": "Tone",
    "multiSelect": false,
    "options": [
      {"label": "Shock / surprise", "description": "Wide eyes, open mouth, bold reaction"},
      {"label": "Curious / thinking", "description": "Slight smirk, raised eyebrow, gaze off-frame"},
      {"label": "Confident / direct", "description": "Eye contact, calm, assertive"},
      {"label": "Frustrated / strong take", "description": "Intense gaze, hand gesture, tension"}
    ]
  }
]

Step 2. Apply thumbnail best practices

步骤2:应用缩略图最佳实践

Every thumbnail must follow these rules:
  • Face fills 30 to 50 percent of the frame. Readable at small sizes.
  • 3 to 5 words maximum of large text. 6 if absolutely necessary.
  • Two colours dominate. Brand primary + one high-contrast accent (yellow, red, cyan work well).
  • One clear focal element besides the face. Tool logo, bold number, arrow, or prop.
  • High contrast between face, text, and background. Test by squinting.
  • Text is not a sentence. It is a hook phrase. Examples: "I fired my team", "Claude can now...", "Don't do this".
  • No small text, no logos bottom-right (watch time icon sits there).
所有缩略图必须遵循以下规则:
  • 面部占画面30%-50%:在小尺寸下仍清晰可读。
  • 最多3-5个大字:必要时可增加到6个。
  • 两种主色调:品牌主色 + 一种高对比度强调色(黄色、红色、青色效果较好)。
  • 除面部外有一个清晰的焦点元素:工具logo、醒目数字、箭头或道具。
  • 面部、文字与背景之间高对比度:可通过眯眼测试验证。
  • 文字非完整句子:而是吸引点击的短语。例如:"我解雇了团队"、"Claude现在可以..."、"别这么做"。
  • 无小号文字,右下角无logo(观看时长图标会显示在该位置)。

Step 3. Build the thumbnail brief

步骤3:撰写缩略图brief

Output a concise brief the user can review:
THUMBNAIL BRIEF: [video title]

Composition: [face position, % of frame, direction of gaze]
Text: "[hook phrase, 3-5 words]"
Text placement: [left, right, top, wraps around face]
Colour palette: [primary hex], [accent hex], [background hex]
Supporting element: [logo / prop / arrow / number]
Emotional tone: [tone from Step 1]
Then ask:
Here's the brief. Say "generate" to output the image prompt or tell me what to change.
输出一份简洁的brief供用户审核:
THUMBNAIL BRIEF: [视频标题]

构图:[面部位置、画面占比、视线方向]
文字:"[吸引点击的短语,3-5个词]"
文字位置:[左、右、上、环绕面部]
调色板:[主色十六进制码]、[强调色十六进制码]、[背景色十六进制码]
辅助元素:[logo/道具/箭头/数字]
情感基调:[步骤1中选择的基调]
然后询问:
这是生成brief。说“生成”即可输出图像提示词,或告诉我需要修改的内容。

Step 4. Output the Gemini prompt

步骤4:输出Gemini提示词

Once approved, output the image generation prompt in a code block:
Using the attached reference photo of me, generate a YouTube thumbnail at 1280 x 720 pixels (16:9).

Composition:
- Place me [left / right / centre] filling [30-50]% of the frame
- My expression: [tone details — e.g., shocked with wide eyes and open mouth]
- My gaze: [direction — e.g., looking directly at camera / looking off-frame toward the text]

Text:
- Display "[hook phrase]" in large bold sans-serif typography
- Text colour: [hex]
- Text outline: [colour, thickness for readability]
- Text placement: [specific area]

Colour palette:
- Primary: [hex]
- Accent: [hex]
- Background: [hex] — [describe treatment: flat, gradient, blurred scene, etc.]

Supporting element: [specific description of the supporting visual]

Constraints:
- Face must be clear and sharp
- Text must be readable at 320px wide (YouTube mobile size)
- No watermarks, no YouTube UI elements, no bottom-right corner text
- High contrast between face, text, and background
Tell the user:
Paste this into a new Gemini chat, attach your reference photo, enable Create Image, and select Nano Banana. Generate at 1280x720.
获得批准后,在代码块中输出图像生成提示词:
Using the attached reference photo of me, generate a YouTube thumbnail at 1280 x 720 pixels (16:9).

Composition:
- Place me [left / right / centre] filling [30-50]% of the frame
- My expression: [tone details — e.g., shocked with wide eyes and open mouth]
- My gaze: [direction — e.g., looking directly at camera / looking off-frame toward the text]

Text:
- Display "[hook phrase]" in large bold sans-serif typography
- Text colour: [hex]
- Text outline: [colour, thickness for readability]
- Text placement: [specific area]

Colour palette:
- Primary: [hex]
- Accent: [hex]
- Background: [hex] — [describe treatment: flat, gradient, blurred scene, etc.]

Supporting element: [specific description of the supporting visual]

Constraints:
- Face must be clear and sharp
- Text must be readable at 320px wide (YouTube mobile size)
- No watermarks, no YouTube UI elements, no bottom-right corner text
- High contrast between face, text, and background
告知用户:
将此内容粘贴到新的Gemini对话中,附上你的参考照片,启用“创建图像”功能,选择Nano Banana。生成尺寸为1280x720。

Step 5. Offer the next move

步骤5:提供后续操作建议

Want me to outline the video next? Hook, mid, CTA from the thumbnail. Or call the create skill if you have one.
需要我接下来帮你梳理视频大纲吗?基于缩略图生成钩子、主体内容和行动号召。或者如果你有创建技能,可以调用该技能。

Rules

规则

  • 1280x720 pixels (16:9). YouTube's native thumbnail size.
  • Never include the reference photo path in the prompt itself — the user attaches the photo separately.
  • Never allow more than 6 words of text, 5 is ideal, 3 is best.
  • Face must always be a visible focal point. No face-hidden compositions.
  • Never use em dashes.
  • British English unless voice.md specifies otherwise.
  • If brand-kit.md is in the project, read it and use exact brand colours.
  • Recommend the user keep a consistent thumbnail style across videos for channel recognition.
  • 尺寸为1280x720像素(16:9):YouTube原生缩略图尺寸。
  • 切勿在提示词中包含参考照片路径——用户需单独附加照片。
  • 文字数量不得超过6个,理想为5个,最佳为3个。
  • 面部必须始终是清晰的焦点元素,不得使用面部被遮挡的构图。
  • 切勿使用破折号。
  • 默认使用英式英语,除非
    voice.md
    中有其他指定。
  • 如果项目中存在
    brand-kit.md
    ,需读取并使用其中的精确品牌色彩。
  • 建议用户在所有视频中保持一致的缩略图风格,以提升频道辨识度。