baoyu-comic
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseKnowledge Comic Creator
知识漫画生成器
Adapted from baoyu-comic for Hermes Agent's tool ecosystem.
Create original knowledge comics with flexible art style × tone combinations.
改编自 baoyu-comic,适配Hermes Agent的工具生态。
创建原创知识漫画,支持灵活的艺术风格与基调组合。
When to Use
使用场景
Trigger this skill when the user asks to create a knowledge/educational comic, biography comic, tutorial comic, or uses terms like "知识漫画", "教育漫画", or "Logicomix-style". The user provides content (text, file path, URL, or topic) and optionally specifies art style, tone, layout, aspect ratio, or language.
当用户要求创建知识/教育漫画、传记漫画、教程漫画,或使用“知识漫画”“教育漫画”“Logicomix风格”等术语时,触发此技能。用户需提供内容(文本、文件路径、URL或主题),还可选择性指定艺术风格、基调、布局、宽高比或语言。
Reference Images
参考图像
Hermes' tool is prompt-only — it accepts a text prompt and an aspect ratio, and returns an image URL. It does NOT accept reference images. When the user supplies a reference image, use it to extract traits in text that get embedded in every page prompt:
image_generateIntake: Accept file paths when the user provides them (or pastes images in conversation).
- File path(s) → copy to alongside the comic output for provenance
refs/NN-ref-{slug}.{ext} - Pasted image with no path → ask the user for the path via , or extract style traits verbally as a text fallback
clarify - No reference → skip this section
Usage modes (per reference):
| Usage | Effect |
|---|---|
| Extract style traits (line treatment, texture, mood) and append to every page's prompt body |
| Extract hex colors and append to every page's prompt body |
| Extract scene composition or subject notes and append to the relevant page(s) |
Record in each page's prompt frontmatter when refs exist:
yaml
references:
- ref_id: 01
filename: 01-ref-scene.png
usage: style
traits: "muted earth tones, soft-edged ink wash, low-contrast backgrounds"Character consistency is driven by text descriptions in (written in Step 3) that get embedded inline in every page prompt (Step 5). The optional PNG character sheet generated in Step 7.1 is a human-facing review artifact, not an input to .
characters/characters.mdimage_generateHermes的工具仅支持文本提示——它接收文本提示和宽高比,返回图像URL。它不接受参考图像。当用户提供参考图像时,需从中提取文本特征,并嵌入到每一页的提示词中:
image_generate接收方式:当用户提供文件路径(或在对话中粘贴图像)时接受:
- 文件路径 → 复制到漫画输出目录旁的,用于溯源
refs/NN-ref-{slug}.{ext} - 粘贴的无路径图像 → 通过工具询问用户路径,或作为文本回退方案,口头提取风格特征
clarify - 无参考图像 → 跳过此部分
参考图像使用模式(每幅参考图像对应一种模式):
| 使用模式 | 效果 |
|---|---|
| 提取风格特征(线条处理、纹理、氛围),追加到每一页的提示词主体中 |
| 提取十六进制颜色,追加到每一页的提示词主体中 |
| 提取场景构图或主体说明,追加到相关页面的提示词中 |
当存在参考图像时,在每页提示词的前置元数据中记录:
yaml
references:
- ref_id: 01
filename: 01-ref-scene.png
usage: style
traits: "muted earth tones, soft-edged ink wash, low-contrast backgrounds"角色一致性由(步骤3中编写)中的文本描述驱动,这些描述会嵌入到每一页的提示词中(步骤5)。步骤7.1中生成的可选PNG角色表是面向人工审核的产物,并非的输入。
characters/characters.mdimage_generateOptions
选项
Visual Dimensions
视觉尺寸
| Option | Values | Description |
|---|---|---|
| Art | ligne-claire (default), manga, realistic, ink-brush, chalk, minimalist | Art style / rendering technique |
| Tone | neutral (default), warm, dramatic, romantic, energetic, vintage, action | Mood / atmosphere |
| Layout | standard (default), cinematic, dense, splash, mixed, webtoon, four-panel | Panel arrangement |
| Aspect | 3:4 (default, portrait), 4:3 (landscape), 16:9 (widescreen) | Page aspect ratio |
| Language | auto (default), zh, en, ja, etc. | Output language |
| Refs | File paths | Reference images used for style / palette trait extraction (not passed to the image model). See Reference Images above. |
| 选项 | 可选值 | 描述 |
|---|---|---|
| Art | ligne-claire(默认)、manga、realistic、ink-brush、chalk、minimalist | 艺术风格/渲染技术 |
| Tone | neutral(默认)、warm、dramatic、romantic、energetic、vintage、action | 情绪/氛围 |
| Layout | standard(默认)、cinematic、dense、splash、mixed、webtoon、four-panel | 分镜布局 |
| Aspect | 3:4(默认,竖版)、4:3(横版)、16:9(宽屏) | 页面宽高比 |
| Language | auto(默认)、zh、en、ja等 | 输出语言 |
| Refs | 文件路径 | 用于提取风格/调色板特征的参考图像(不传入图像模型)。详见上方参考图像部分 |
Partial Workflow Options
部分工作流选项
| Option | Description |
|---|---|
| Storyboard only | Generate storyboard only, skip prompts and images |
| Prompts only | Generate storyboard + prompts, skip images |
| Images only | Generate images from existing prompts directory |
| Regenerate N | Regenerate specific page(s) only (e.g., |
Details: references/partial-workflows.md
| 选项 | 描述 |
|---|---|
| Storyboard only | 仅生成分镜脚本,跳过提示词和图像生成 |
| Prompts only | 生成分镜脚本+提示词,跳过图像生成 |
| Images only | 从现有提示词目录生成图像 |
| Regenerate N | 仅重新生成指定页面(例如 |
详情:references/partial-workflows.md
Art, Tone & Preset Catalogue
艺术风格、基调与预设目录
-
Art styles (6):,
ligne-claire,manga,realistic,ink-brush,chalk. Full definitions atminimalist.references/art-styles/<style>.md -
Tones (7):,
neutral,warm,dramatic,romantic,energetic,vintage. Full definitions ataction.references/tones/<tone>.md -
Presets (5) with special rules beyond plain art+tone:
Preset Equivalent Hook ohmshamanga + neutral Visual metaphors, no talking heads, gadget reveals wuxiaink-brush + action Qi effects, combat visuals, atmospheric shoujomanga + romantic Decorative elements, eye details, romantic beats concept-storymanga + warm Visual symbol system, growth arc, dialogue+action balance four-panelminimalist + neutral + four-panel layout 起承转合 structure, B&W + spot color, stick-figure characters Full rules at— load the file when a preset is picked.references/presets/<preset>.md -
Compatibility matrix and content-signal → preset table live in references/auto-selection.md. Read it before recommending combinations in Step 2.
-
艺术风格(6种):、
ligne-claire、manga、realistic、ink-brush、chalk。完整定义见minimalist。references/art-styles/<style>.md -
基调(7种):、
neutral、warm、dramatic、romantic、energetic、vintage。完整定义见action。references/tones/<tone>.md -
预设(5种),包含超出单纯艺术风格+基调的特殊规则:
预设 等效组合 特色 ohmshamanga + neutral 视觉隐喻,无对话头像,道具展示 wuxiaink-brush + action 气效表现,战斗画面,氛围感 shoujomanga + romantic 装饰元素,眼部细节,浪漫情节 concept-storymanga + warm 视觉符号系统,成长弧线,对话与动作平衡 four-panelminimalist + neutral + four-panel layout 起承转合结构,黑白+局部色彩,简笔角色 完整规则见——当选择预设时加载该文件。references/presets/<preset>.md -
兼容性矩阵和内容信号→预设对照表位于references/auto-selection.md。在步骤2中推荐组合前,请先阅读该文件。
File Structure
文件结构
Output directory:
comic/{topic-slug}/- Slug: 2-4 words kebab-case from topic (e.g., )
alan-turing-bio - Conflict: append timestamp (e.g., )
turing-story-20260118-143052
Contents:
| File | Description |
|---|---|
| Saved source content (kebab-case slug matches the output directory) |
| Content analysis |
| Storyboard with panel breakdown |
| Character definitions |
| Character reference sheet (downloaded from |
| Generation prompts |
| Generated images (downloaded from |
| User-supplied reference images (optional, for provenance) |
输出目录:
comic/{topic-slug}/- Slug:从主题提取2-4个单词,采用短横线分隔格式(例如)
alan-turing-bio - 冲突处理:追加时间戳(例如)
turing-story-20260118-143052
目录内容:
| 文件 | 描述 |
|---|---|
| 保存的源内容(短横线分隔的slug与输出目录匹配) |
| 内容分析报告 |
| 带分镜拆解的脚本 |
| 角色定义 |
| 角色参考表(从 |
| 生成提示词 |
| 生成的图像(从 |
| 用户提供的参考图像(可选,用于溯源) |
Language Handling
语言处理
Detection Priority:
- User-specified language (explicit option)
- User's conversation language
- Source content language
Rule: Use user's input language for ALL interactions:
- Storyboard outlines and scene descriptions
- Image generation prompts
- User selection options and confirmations
- Progress updates, questions, errors, summaries
Technical terms remain in English.
检测优先级:
- 用户指定的语言(显式选项)
- 用户的对话语言
- 源内容语言
规则:所有交互均使用用户输入语言:
- 分镜大纲和场景描述
- 图像生成提示词
- 用户选择选项与确认信息
- 进度更新、问题、错误提示、总结
技术术语保留英文。
Workflow
工作流
Progress Checklist
进度检查表
Comic Progress:
- [ ] Step 1: Setup & Analyze
- [ ] 1.1 Analyze content
- [ ] 1.2 Check existing directory
- [ ] Step 2: Confirmation - Style & options ⚠️ REQUIRED
- [ ] Step 3: Generate storyboard + characters
- [ ] Step 4: Review outline (conditional)
- [ ] Step 5: Generate prompts
- [ ] Step 6: Review prompts (conditional)
- [ ] Step 7: Generate images
- [ ] 7.1 Generate character sheet (if needed) → characters/characters.png
- [ ] 7.2 Generate pages (with character descriptions embedded in prompt)
- [ ] Step 8: Completion reportComic Progress:
- [ ] Step 1: Setup & Analyze
- [ ] 1.1 Analyze content
- [ ] 1.2 Check existing directory
- [ ] Step 2: Confirmation - Style & options ⚠️ REQUIRED
- [ ] Step 3: Generate storyboard + characters
- [ ] Step 4: Review outline (conditional)
- [ ] Step 5: Generate prompts
- [ ] Step 6: Review prompts (conditional)
- [ ] Step 7: Generate images
- [ ] 7.1 Generate character sheet (if needed) → characters/characters.png
- [ ] 7.2 Generate pages (with character descriptions embedded in prompt)
- [ ] Step 8: Completion reportFlow
流程
Input → Analyze → [Check Existing?] → [Confirm: Style + Reviews] → Storyboard → [Review?] → Prompts → [Review?] → Images → CompleteInput → Analyze → [Check Existing?] → [Confirm: Style + Reviews] → Storyboard → [Review?] → Prompts → [Review?] → Images → CompleteStep Summary
步骤摘要
| Step | Action | Key Output |
|---|---|---|
| 1.1 | Analyze content | |
| 1.2 | Check existing directory | Handle conflicts |
| 2 | Confirm style, focus, audience, reviews | User preferences |
| 3 | Generate storyboard + characters | |
| 4 | Review outline (if requested) | User approval |
| 5 | Generate prompts | |
| 6 | Review prompts (if requested) | User approval |
| 7.1 | Generate character sheet (if needed) | |
| 7.2 | Generate pages | |
| 8 | Completion report | Summary |
| 步骤 | 操作 | 关键输出 |
|---|---|---|
| 1.1 | 分析内容 | |
| 1.2 | 检查现有目录 | 处理冲突 |
| 2 | 确认风格、重点、受众、审核选项 | 用户偏好 |
| 3 | 生成分镜脚本+角色定义 | |
| 4 | 审核大纲(若用户要求) | 用户批准 |
| 5 | 生成提示词 | |
| 6 | 审核提示词(若用户要求) | 用户批准 |
| 7.1 | 生成角色表(若需要) | |
| 7.2 | 生成页面图像 | |
| 8 | 生成完成报告 | 总结 |
User Questions
用户疑问
Use the tool to confirm options. Since handles one question at a time, ask the most important question first and proceed sequentially. See references/workflow.md for the full Step 2 question set.
clarifyclarifyTimeout handling (CRITICAL): can return — this is NOT user consent to default everything.
clarify"The user did not provide a response within the time limit. Use your best judgement to make the choice and proceed."- Treat it as a default for that one question only. Continue asking the remaining Step 2 questions in sequence; each question is an independent consent point.
- Surface the default to the user visibly in your next message so they have a chance to correct it: e.g. — an unreported default is indistinguishable from never having asked.
"Style: defaulted to ohmsha preset (clarify timed out). Say the word to switch." - Do NOT collapse Step 2 into a single "use all defaults" pass after one timeout. If the user is genuinely absent, they will be equally absent for all five questions — but they can correct visible defaults when they return, and cannot correct invisible ones.
使用工具确认选项。由于一次仅处理一个问题,请先询问最重要的问题,再按顺序进行。完整的步骤2问题集见references/workflow.md。
clarifyclarify超时处理(关键):可能返回——这并不代表用户同意所有默认选项。
clarify"The user did not provide a response within the time limit. Use your best judgement to make the choice and proceed."- 仅将其视为该单个问题的默认选择。继续按顺序询问步骤2的剩余问题;每个问题都是独立的确认点。
- 在后续消息中明确告知用户默认选择,以便他们有机会更正:例如——未告知的默认选择等同于从未询问过。
"风格:默认使用ohmsha预设(clarify超时)。如需切换请告知。" - 不要在一次超时后将步骤2简化为“全部使用默认选项”。如果用户确实未回复,他们对所有五个问题都会未回复,但当他们返回时可以更正可见的默认选择,而无法更正不可见的。
Step 7: Image Generation
步骤7:图像生成
Use Hermes' built-in tool for all image rendering. Its schema accepts only and ( | | ); it returns a URL, not a local file. Every generated page or character sheet must therefore be downloaded to the output directory.
image_generatepromptaspect_ratiolandscapeportraitsquarePrompt file requirement (hard): write each image's full, final prompt to a standalone file under (naming: ) BEFORE calling . The prompt file is the reproducibility record.
prompts/NN-{type}-[slug].mdimage_generateAspect ratio mapping — the storyboard's field maps to 's format as follows:
aspect_ratioimage_generate| Storyboard ratio | |
|---|---|
| |
| |
| |
Download step — after every call:
image_generate- Read the URL from the tool result
- Fetch the image bytes using an absolute output path, e.g.
curl -fsSL "<url>" -o /abs/path/to/comic/<slug>/NN-page-<slug>.png - Verify the file exists and is non-empty at that exact path before proceeding to the next page
Never rely on shell CWD persistence for paths. The terminal tool's persistent-shell CWD can change between batches (session expiry, , a failed that leaves you in the wrong directory). is a silent footgun: if CWD has drifted, the file lands somewhere else with no error. Always pass a fully-qualified absolute path to , or pass to the terminal tool. Incident Apr 2026: pages 06-09 of a 10-page comic landed at the repo root instead of because batch 3 inherited a stale CWD from batch 2 and wrote to the wrong directory. The agent then spent several turns claiming the files existed where they didn't.
-oTERMINAL_LIFETIME_SECONDScdcurl -o relative/path.png-oworkdir=<abs path>comic/<slug>/curl -o 06-page-skills.png7.1 Character sheet — generate it (to , aspect ) when the comic is multi-page with recurring characters. Skip for simple presets (e.g., four-panel minimalist) or single-page comics. The prompt file at must exist before invoking . The rendered PNG is a human-facing review artifact (so the user can visually verify character design) and a reference for later regenerations or manual prompt edits — it does not drive Step 7.2. Page prompts are already written in Step 5 from the text descriptions in ; cannot accept images as visual input.
characters/characters.pnglandscapecharacters/characters.mdimage_generatecharacters/characters.mdimage_generate7.2 Pages — each page's prompt MUST already be at before invoking . Because is prompt-only, character consistency is enforced by embedding character descriptions (sourced from ) inline in every page prompt during Step 5. The embedding is done uniformly whether or not a PNG sheet is produced in 7.1; the PNG is only a review/regeneration aid.
prompts/NN-{cover|page}-[slug].mdimage_generateimage_generatecharacters/characters.mdBackup rule: existing and files → rename with suffix before regenerating.
prompts/…md…png-backup-YYYYMMDD-HHMMSSFull step-by-step workflow (analysis, storyboard, review gates, regeneration variants): references/workflow.md.
所有图像渲染均使用Hermes内置的工具。其模式仅接受和( | | );它返回URL,而非本地文件。因此,每个生成的页面或角色表都必须下载到输出目录。
image_generatepromptaspect_ratiolandscapeportraitsquare提示词文件要求(硬性规定):在调用之前,将每个图像的完整最终提示词写入下的独立文件(命名格式:)。提示词文件是可复现性记录。
image_generateprompts/NN-{type}-[slug].md宽高比映射——分镜脚本中的字段与的格式映射如下:
aspect_ratioimage_generate| 分镜脚本宽高比 | |
|---|---|
| |
| |
| |
下载步骤——每次调用后:
image_generate- 从工具结果中读取URL
- 使用绝对输出路径获取图像字节,例如:
curl -fsSL "<url>" -o /abs/path/to/comic/<slug>/NN-page-<slug>.png - 在进行下一页之前,验证该文件是否存在且非空
永远不要依赖shell的当前工作目录(CWD)持久性来指定路径。终端工具的持久shell CWD可能在批次之间变化(会话过期、、失败的命令导致处于错误目录)。是一个隐蔽的陷阱:如果CWD发生变化,文件会被写入其他位置且无错误提示。始终向传递完全限定的绝对路径,或向终端工具传递。2026年4月事件:某10页漫画的第6-9页被写入仓库根目录而非,因为批次3继承了批次2的过期CWD,写入了错误目录。随后Agent多次声称文件存在于预期路径,但实际并非如此。
-oTERMINAL_LIFETIME_SECONDScdcurl -o relative/path.png-oworkdir=<abs path>comic/<slug>/curl -o 06-page-skills.png7.1 角色表——当漫画为多页且有重复角色时,生成角色表(保存到,宽高比为)。对于简单预设(例如四格极简风格)或单页漫画,可跳过此步骤。调用之前,必须存在提示词文件。渲染后的PNG是面向人工审核的产物(以便用户直观验证角色设计),也是后续重新生成或手动编辑提示词的参考——它不驱动步骤7.2。页面提示词已在步骤5中根据中的文本描述编写完成;不接受图像作为视觉输入。
characters/characters.pnglandscapeimage_generatecharacters/characters.mdcharacters/characters.mdimage_generate7.2 页面生成——在调用之前,每页的提示词必须已存在于。由于仅支持文本提示,角色一致性通过在步骤5中将角色描述(源自)嵌入每页提示词来保证。无论是否在7.1中生成PNG角色表,嵌入操作都会统一执行;PNG仅作为审核/重新生成的辅助工具。
image_generateprompts/NN-{cover|page}-[slug].mdimage_generatecharacters/characters.md备份规则:现有和文件→在重新生成前重命名为后缀。
prompts/…md…png-backup-YYYYMMDD-HHMMSS完整的分步工作流(分析、分镜脚本、审核关卡、重新生成变体):references/workflow.md。
References
参考资料
Core Templates:
- analysis-framework.md - Deep content analysis
- character-template.md - Character definition format
- storyboard-template.md - Storyboard structure
- ohmsha-guide.md - Ohmsha manga specifics
Style Definitions:
- - Art styles (ligne-claire, manga, realistic, ink-brush, chalk, minimalist)
references/art-styles/ - - Tones (neutral, warm, dramatic, romantic, energetic, vintage, action)
references/tones/ - - Presets with special rules (ohmsha, wuxia, shoujo, concept-story, four-panel)
references/presets/ - - Layouts (standard, cinematic, dense, splash, mixed, webtoon, four-panel)
references/layouts/
Workflow:
- workflow.md - Full workflow details
- auto-selection.md - Content signal analysis
- partial-workflows.md - Partial workflow options
核心模板:
- analysis-framework.md - 深度内容分析
- character-template.md - 角色定义格式
- storyboard-template.md - 分镜脚本结构
- ohmsha-guide.md - Ohmsha漫画规范
风格定义:
- - 艺术风格(ligne-claire、manga、realistic、ink-brush、chalk、minimalist)
references/art-styles/ - - 基调(neutral、warm、dramatic、romantic、energetic、vintage、action)
references/tones/ - - 带特殊规则的预设(ohmsha、wuxia、shoujo、concept-story、four-panel)
references/presets/ - - 布局(standard、cinematic、dense、splash、mixed、webtoon、four-panel)
references/layouts/
工作流:
- workflow.md - 完整工作流详情
- auto-selection.md - 内容信号分析
- partial-workflows.md - 部分工作流选项
Page Modification
页面修改
| Action | Steps |
|---|---|
| Edit | Update prompt file FIRST → regenerate image → download new PNG |
| Add | Create prompt at position → generate with character descriptions embedded → renumber subsequent → update storyboard |
| Delete | Remove files → renumber subsequent → update storyboard |
IMPORTANT: When updating pages, ALWAYS update the prompt file () FIRST before regenerating. This ensures changes are documented and reproducible.
prompts/NN-{cover|page}-[slug].md| 操作 | 步骤 |
|---|---|
| 编辑 | 首先更新提示词文件 → 重新生成图像 → 下载新的PNG |
| 添加 | 在指定位置创建提示词 → 嵌入角色描述生成图像 → 重新编号后续页面 → 更新分镜脚本 |
| 删除 | 删除文件 → 重新编号后续页面 → 更新分镜脚本 |
重要提示:更新页面时,务必先更新提示词文件(),再重新生成图像。这确保更改被记录且可复现。
prompts/NN-{cover|page}-[slug].mdPitfalls
注意事项
- Image generation: 10-30 seconds per page; auto-retry once on failure
- Always download the URL returned by to a local PNG — downstream tooling (and the user's review) expects files in the output directory, not ephemeral URLs
image_generate - Use absolute paths for — never rely on persistent-shell CWD across batches. Silent footgun: files land in the wrong directory and subsequent
curl -oon the intended path shows nothing. See Step 7 "Download step".ls - Use stylized alternatives for sensitive public figures
- Step 2 confirmation required - do not skip
- Steps 4/6 conditional - only if user requested in Step 2
- Step 7.1 character sheet - recommended for multi-page comics, optional for simple presets. The PNG is a review/regeneration aid; page prompts (written in Step 5) use the text descriptions in , not the PNG.
characters/characters.mddoes not accept images as visual inputimage_generate - Strip secrets — scan source content for API keys, tokens, or credentials before writing any output file
- 图像生成:每页耗时10-30秒;失败时自动重试一次
- 务必下载返回的URL对应的本地PNG——下游工具(以及用户审核)期望文件位于输出目录,而非临时URL
image_generate - 为使用绝对路径——永远不要依赖批次间的持久shell CWD。隐蔽陷阱:文件被写入错误目录,后续对预期路径执行
curl -o会显示无文件。详见步骤7“下载步骤”。ls - 对敏感公众人物使用风格化替代形象
- 步骤2确认是必需的 - 不可跳过
- 步骤4/6为可选 - 仅当用户在步骤2中要求时执行
- 步骤7.1角色表 - 多页漫画推荐生成,简单预设可选。PNG是审核/重新生成的辅助工具;页面提示词(步骤5编写)使用中的文本描述,而非PNG。
characters/characters.md不接受图像作为视觉输入image_generate - 剥离敏感信息 —— 在写入任何输出文件前,扫描源内容中的API密钥、令牌或凭据