printing-press-polish

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

/printing-press-polish

Polish a generated CLI so it passes verification and is ready to publish.

The retro improves the Printing Press. Polish improves the generated CLI. This skill runs in a forked context (

context: fork

) so its diagnostic and fix loop doesn't pollute the caller — the diagnostic spam, fix iterations, and re-diagnose noise stay scoped to the polish session, and the caller receives a clean summary.

bash

/printing-press-polish redfin
/printing-press-polish redfin-pp-cli
/printing-press-polish ~/printing-press/library/redfin

打磨生成的CLI，使其通过验证并具备发布条件。

retro用于优化Printing Press本身，而Polish用于优化生成的CLI。该技能在分叉上下文（

context: fork

）中运行，因此其诊断和修复循环不会污染调用者的环境——诊断日志、修复迭代和重新诊断的冗余信息都被限定在打磨会话范围内，调用者只会收到简洁的总结。

bash

/printing-press-polish redfin
/printing-press-polish redfin-pp-cli
/printing-press-polish ~/printing-press/library/redfin

When to run

运行时机

After any

/printing-press

generation, especially when:

The shipcheck verdict is
```
ship-with-gaps
```
The verify pass rate is below 80%
The scorecard is below 85
You want the CLI publish-ready in one pass

Can also be run standalone on any CLI in

~/printing-press/library/

在执行任何

/printing-press

生成命令后运行，尤其是在以下场景：

shipcheck的判定结果为
```
ship-with-gaps
```
验证通过率低于80%
scorecard得分低于85
希望一次性将CLI打磨至可发布状态

也可独立运行于

~/printing-press/library/

目录下的任意CLI。

Setup

配置

bash

PRESS_HOME="$HOME/printing-press"
PRESS_LIBRARY="$PRESS_HOME/library"

bash

PRESS_HOME="$HOME/printing-press"
PRESS_LIBRARY="$PRESS_HOME/library"

Public-library hint

公共库提示

If the user's request includes phrasing like "polish notion in the public library", "polish from the public library", or "polish the published cal-com" — and the named CLI is not in

$PRESS_LIBRARY/<slug>/

— they're asking to polish a CLI that lives upstream but not locally. Polish runs against the internal library, so the right move is to import first.

Suggest:

/printing-press-import <slug>

to bring it in, then re-run polish. Don't try to polish a CLI that isn't in the internal library.

If the named CLI is already in

$PRESS_LIBRARY/<slug>/

, the "public library" phrasing is informational — just proceed with polish and let the divergence check (below) handle any drift.

如果用户的请求中包含类似“polish notion in the public library”、“polish from the public library”或“polish the published cal-com”的表述，且指定的CLI不在

$PRESS_LIBRARY/<slug>/

目录下，则用户希望打磨的是上游公共库中的CLI而非本地版本。Polish仅针对内部库运行，因此正确的操作是先导入该CLI。

建议：执行

/printing-press-import <slug>

将其导入，然后重新运行打磨命令。不要尝试打磨不在内部库中的CLI。

如果指定的CLI已经存在于

$PRESS_LIBRARY/<slug>/

目录下，那么“public library”的表述仅为信息说明——直接继续打磨即可，后续的差异检查（如下文所述）会处理任何版本漂移问题。

Resolve CLI

解析CLI路径

The argument can be:

A short name:
```
redfin
```
(looks up
```
$PRESS_LIBRARY/redfin
```
)
A full name:
```
redfin-pp-cli
```
(strips suffix, looks up
```
$PRESS_LIBRARY/redfin
```
)
A path:
```
~/printing-press/library/redfin
```
(used directly)

Resolution order:

If the argument is an absolute or
```
~
```
-prefixed path and exists, use it
Try
```
$PRESS_LIBRARY/<arg>
```
(exact match — works for slug like
```
redfin
```
)

If arg has

-pp-cli

suffix, strip it and try

$PRESS_LIBRARY/<slug>

(e.g.,

redfin-pp-cli

→

redfin

)

Fuzzy search:
```
ls $PRESS_LIBRARY/ | grep -i <arg>
```
for close matches

Caller scenarios. Polish has two callers and they pass different argument forms:

Standalone (user-invoked,
/printing-press-polish redfin
). The arg is a slug or binary name; resolution lands on
```
$PRESS_LIBRARY/<slug>/
```
. This is the published copy and the right target.
Mid-pipeline (main printing-press skill Phase 5.5). The arg is
```
$CLI_WORK_DIR
```
— an absolute path to
```
~/printing-press/.runstate/.../runs/.../working/<api>-pp-cli/
```
. Resolution must hit rule 1. Do not paraphrase this to the slug — Phase 5.5 fires before the working CLI is promoted, so
```
$PRESS_LIBRARY/<slug>/
```
either doesn't exist or holds the prior run's stale CLI.

The lock-status check in the next code block is the safety net for the mid-pipeline scenario: if a build lock is held for this CLI (under either name form), polish refuses to run.

printing-press lock

normalizes slug ↔ binary-name internally, so the check works regardless of which form the basename produces.

If no match or multiple matches, present via

AskUserQuestion

. Show at most 4 matches sorted by modification time (most recent first) with human-friendly relative timestamps (e.g., "generated 2 hours ago").

bash

CLI_DIR="<resolved path>"
CLI_NAME="$(basename "$CLI_DIR")"

参数可以是：

短名称：
```
redfin
```
（会查找
```
$PRESS_LIBRARY/redfin
```
）
完整名称：
```
redfin-pp-cli
```
（会去除后缀，查找
```
$PRESS_LIBRARY/redfin
```
）
路径：
```
~/printing-press/library/redfin
```
（直接使用该路径）

解析顺序：

如果参数是绝对路径或带
```
~
```
前缀的路径且存在，则直接使用
尝试
```
$PRESS_LIBRARY/<arg>
```
（精确匹配——适用于
```
redfin
```
这类slug）
如果参数带有
```
-pp-cli
```
后缀，去除后缀后尝试
```
$PRESS_LIBRARY/<slug>
```
（例如：
```
redfin-pp-cli
```
→
```
redfin
```
）
模糊搜索：
```
ls $PRESS_LIBRARY/ | grep -i <arg>
```
查找近似匹配项

调用场景：Polish有两种调用者，它们传递的参数形式不同：

独立调用（用户触发，
/printing-press-polish redfin
）：参数为slug或二进制名称；解析后指向
```
$PRESS_LIBRARY/<slug>/
```
。这是已发布的副本，也是正确的目标。
流水线中调用（主printing-press技能的Phase 5.5阶段）：参数为
```
$CLI_WORK_DIR
```
——指向
```
~/printing-press/.runstate/.../runs/.../working/<api>-pp-cli/
```
的绝对路径。解析必须匹配规则1。不要将其改写为slug——因为Phase 5.5运行时，工作中的CLI尚未被推广到库中，
```
$PRESS_LIBRARY/<slug>/
```
要么不存在，要么保存的是上一次运行的旧CLI。

下一段代码块中的锁状态检查是流水线场景的安全保障：如果该CLI（无论哪种名称形式）存在活跃的构建锁，Polish会拒绝运行。

printing-press lock

会在内部规范化slug与二进制名称的对应关系，因此无论basename生成哪种形式，检查都能正常工作。

如果未找到匹配项或存在多个匹配项，通过

AskUserQuestion

提示用户。最多显示4个按修改时间排序（最新优先）的匹配项，并附带人性化的相对时间戳（例如：“2小时前生成”）。

bash

CLI_DIR="<resolved path>"
CLI_NAME="$(basename "$CLI_DIR")"

Check if there's an active build lock — polish edits would be overwritten

检查是否存在活跃的构建锁——如果存在，打磨的修改会在构建完成推广到库时被覆盖

when the running build promotes to library.

—

_lock_json=$(printing-press lock status --cli "$CLI_NAME" --json 2>/dev/null) if echo "$_lock_json" | grep -q '"held".*true'; then if echo "$_lock_json" | grep -q '"stale".*true'; then echo "Warning: stale lock exists for $CLI_NAME (build may have crashed)." echo "Proceeding with polish. Run 'printing-press lock release --cli $CLI_NAME' to clear." else echo "An active build is in progress for $CLI_NAME." echo "Polish edits would be overwritten when the build promotes." echo "Wait for the build to finish, then run polish." exit 1 fi fi

Verify it's a valid Go CLI

验证是否为有效的Go CLI

if [ ! -f "$CLI_DIR/go.mod" ]; then echo "Not a valid CLI directory: $CLI_DIR" exit 1 fi

echo "Polishing: $CLI_NAME" echo "Location: $CLI_DIR"

undefined

if [ ! -f "$CLI_DIR/go.mod" ]; then echo "Not a valid CLI directory: $CLI_DIR" exit 1 fi

echo "Polishing: $CLI_NAME" echo "Location: $CLI_DIR"

undefined

Find spec and research dir

查找spec和研究目录

bash

API_SLUG="${CLI_NAME%-pp-cli}"
SPEC_PATH=""
for f in "$PRESS_HOME/manuscripts/$API_SLUG"/*/research/*.yaml "$PRESS_HOME/manuscripts/$API_SLUG"/*/research/*.json "$PRESS_HOME/manuscripts/$CLI_NAME"/*/research/*.yaml "$PRESS_HOME/manuscripts/$CLI_NAME"/*/research/*.json; do
  if [ -f "$f" ]; then
    SPEC_PATH="$f"
    break
  fi
done

bash

API_SLUG="${CLI_NAME%-pp-cli}"
SPEC_PATH=""
for f in "$PRESS_HOME/manuscripts/$API_SLUG"/*/research/*.yaml "$PRESS_HOME/manuscripts/$API_SLUG"/*/research/*.json "$PRESS_HOME/manuscripts/$CLI_NAME"/*/research/*.yaml "$PRESS_HOME/manuscripts/$CLI_NAME"/*/research/*.json; do
  if [ -f "$f" ]; then
    SPEC_PATH="$f"
    break
  fi
done

Build the spec flag once. Empty when no spec was found — diagnostic

一次性构建spec参数。如果未找到spec则为空——诊断命令会接受缺失的--spec参数并优雅降级。

commands accept a missing --spec and degrade gracefully.

—

SPEC_FLAG="" if [ -n "$SPEC_PATH" ]; then SPEC_FLAG="--spec $SPEC_PATH" fi

Locate the research dir (parent of the spec's research/ folder, i.e.

定位研究目录（spec所在research/文件夹的父目录，即manuscripts/<api>/<run-id>/）。dogfood的--research-dir参数会触发checkNovelFeatures，将novel_features_built写入research.json，并将已验证的列表同步到.printing-press.json中。

manuscripts/<api>/<run-id>/). dogfood's --research-dir triggers

如果没有这个参数，清单早于novel_features schema的旧版CLI会在publish-validate的transcendence gate中失败。

checkNovelFeatures, which writes novel_features_built back into

—

research.json AND syncs the verified list into .printing-press.json.

—

Without this flag, legacy CLIs whose manifest predates the

—

novel_features schema fail publish-validate's transcendence gate.

—

RESEARCH_DIR="" for d in "$PRESS_HOME/manuscripts/$API_SLUG"//research.json "$PRESS_HOME/manuscripts/$CLI_NAME"//research.json; do if [ -f "$d" ]; then RESEARCH_DIR="$(dirname "$d")" break fi done

RESEARCH_FLAG="" if [ -n "$RESEARCH_DIR" ]; then RESEARCH_FLAG="--research-dir $RESEARCH_DIR" fi

undefined

RESEARCH_DIR="" for d in "$PRESS_HOME/manuscripts/$API_SLUG"//research.json "$PRESS_HOME/manuscripts/$CLI_NAME"//research.json; do if [ -f "$d" ]; then RESEARCH_DIR="$(dirname "$d")" break fi done

RESEARCH_FLAG="" if [ -n "$RESEARCH_DIR" ]; then RESEARCH_FLAG="--research-dir $RESEARCH_DIR" fi

undefined

Divergence check

差异检查

Stop and run this step before Phase 1. Do not skip it. Do not proceed to diagnostics until you have completed the check and resolved any divergence.

The internal copy at

$CLI_DIR

can drift from the public library (

mvanhorn/printing-press-library

) copy if anyone edited the public repo directly after this CLI was last published. Polishing a stale internal copy and re-publishing later silently overwrites those public-only fixes — a real failure mode that shipped CLIs hit.

You must:

Locate the public library clone. Honor
```
$PRINTING_PRESS_LIBRARY_PUBLIC
```
if set; otherwise scan the user's filesystem however fits this platform. Validate every candidate by checking the git remote points at
```
mvanhorn/printing-press-library
```
— other directories may share the name (forks, accidental name collisions). If multiple valid clones exist, prefer the most recently modified; ask the user to disambiguate only if still unclear.

Locate this CLI inside the clone.

find <clone>/library -type d -name "<api>-pp-cli"

or equivalent.

Run
diff -r <public-cli-dir> $CLI_DIR
with these exclusions, all of which are expected to diverge after publish:
- ```
.printing-press-tools-polish.json
```
  (local ledger, not published)
- ```
go.mod
```
  and
```
go.sum
```
  — publish rewrites the module path from
```
<api>-pp-cli
```
  to
```
github.com/mvanhorn/printing-press-library/library/<category>/<api>
```
- All
```
.go
```
  files where the only difference is the rewritten import path (the publish step propagates the new module path through every internal import). When inspecting
```
.go
```
  diffs, scan for substantive changes — anything beyond the module-path prefix swap is real divergence.
Concretely:
```
diff -r --exclude=go.mod --exclude=go.sum --exclude=.printing-press-tools-polish.json <public-cli-dir> $CLI_DIR
```
.
Don't pass
```
--exclude='<api>-pp-cli'
```
or
```
--exclude='<api>-pp-mcp'
```
— those names match both the root-level binary files and the
```
cmd/<api>-pp-cli/
```
and
```
cmd/<api>-pp-mcp/
```
source directories. Excluding by binary name silently skips the entire
```
cmd/
```
subtree, hiding real divergence in
```
main.go
```
. The "Only in $CLI_DIR: <api>-pp-cli" line for the built binary is one row of expected output, not noise worth filtering at the cost of completeness.
Surface the result before continuing.

Outcomes:

No clone found → user doesn't have public locally. State this explicitly ("public library not found locally; proceeding on internal as canonical") and continue.
Clone found but doesn't contain this CLI → never published or under a different name. State this and continue.
Found and diff is empty → in sync. State this and continue.
Found and divergent → stop. Do not run Phase 1 diagnostics yet. List the divergent files for the user. Ask via AskUserQuestion: sync public→internal, or proceed without syncing. If the user picks sync, copy public's version of the divergent files into internal, then continue polish on the synced internal copy.

Before showing the sync prompt, check whether internal has files modified after its

.printing-press.json

timestamp (the user has been polishing locally without publishing). If yes, hedge the prompt explicitly: syncing will overwrite their pending local work. Let them decide whether to keep their local edits or pull public's.

After sync (or explicit skip), the rest of polish operates on

$CLI_DIR

as canonical. The eventual

/printing-press-publish

step pushes internal back to public; no second divergence check is needed there.

The check has run only when one of the four outcomes above is explicitly stated in your response. Silent omission counts as not having run it.

在进入Phase 1之前必须运行此步骤，不可跳过。在完成检查并解决所有差异之前，不要进行诊断。

$CLI_DIR

下的内部副本可能与公共库（

mvanhorn/printing-press-library

）中的副本存在差异，尤其是在CLI上次发布后有人直接编辑了公共仓库的情况下。打磨过时的内部副本并重新发布会静默覆盖公共库中仅有的修复——这是已发布CLI遇到过的实际问题。

你必须：

定位公共库的本地克隆：如果设置了
```
$PRINTING_PRESS_LIBRARY_PUBLIC
```
则优先使用；否则扫描用户文件系统查找符合条件的目录。验证每个候选目录的git远程仓库是否指向
```
mvanhorn/printing-press-library
```
——其他目录可能重名（例如分叉仓库、意外命名冲突）。如果存在多个有效克隆，优先选择最近修改的；如果仍不明确，再询问用户进行区分。
在克隆中定位该CLI：使用
```
find <clone>/library -type d -name "<api>-pp-cli"
```
或类似命令。
运行
diff -r <public-cli-dir> $CLI_DIR
，排除以下所有预期会在发布后产生差异的内容：
- ```
.printing-press-tools-polish.json
```
  （本地记录文件，不会发布）
- ```
go.mod
```
  和
```
go.sum
```
  ——发布步骤会将模块路径从
```
<api>-pp-cli
```
  重写为
```
github.com/mvanhorn/printing-press-library/library/<category>/<api>
```
- 所有仅存在模块路径重写差异的
```
.go
```
  文件（发布步骤会将新模块路径传播到所有内部导入中）。检查
```
.go
```
  文件的差异时，要扫描实质性变化——除模块路径前缀替换之外的任何变化都是真实差异。
具体命令：
```
diff -r --exclude=go.mod --exclude=go.sum --exclude=.printing-press-tools-polish.json <public-cli-dir> $CLI_DIR
```
。
不要使用
```
--exclude='<api>-pp-cli'
```
或
```
--exclude='<api>-pp-mcp'
```
——这些名称既匹配根目录下的二进制文件，也匹配
```
cmd/<api>-pp-cli/
```
和
```
cmd/<api>-pp-mcp/
```
源码目录。排除二进制名称会静默跳过整个
```
cmd/
```
子目录，隐藏
```
main.go
```
中的真实差异。
```
Only in $CLI_DIR: <api>-pp-cli
```
这一行是预期输出，为了完整性不应过滤。
在继续操作前展示检查结果

结果处理：

未找到克隆 → 用户本地没有公共库。明确告知用户（“未找到本地公共库；将以内部副本为基准继续”）并继续。
找到克隆但未包含该CLI → 从未发布过或使用了不同名称。告知用户并继续。
找到克隆且差异为空 → 版本同步。告知用户并继续。
找到克隆且存在差异 → 停止操作。暂不运行Phase 1诊断。向用户列出差异文件。通过
```
AskUserQuestion
```
询问用户：将公共库同步到内部副本，还是不进行同步直接继续。如果用户选择同步，将公共库中差异文件的版本复制到内部副本，然后在同步后的内部副本上继续打磨。

在显示同步提示之前，检查内部副本是否存在比

.printing-press.json

时间戳更新的文件（用户已在本地打磨但未发布）。如果存在，在提示中明确说明：同步会覆盖用户未提交的本地修改。让用户决定保留本地修改还是拉取公共库的版本。

同步（或明确跳过同步）后，后续的打磨操作都以

$CLI_DIR

为基准。最终的

/printing-press-publish

步骤会将内部副本推送到公共库，无需再次进行差异检查。

只有当你的回复中明确说明上述四种结果之一时，才表示已完成差异检查。静默跳过检查等同于未执行此步骤。

Phase 1: Baseline diagnostics

Phase 1: 基准诊断

bash

cd "$CLI_DIR"

bash

cd "$CLI_DIR"

Build

构建

go build -o "$CLI_NAME" ./cmd/"$CLI_NAME" 2>&1

Diagnostics. SPEC_FLAG and RESEARCH_FLAG are set in the "Find spec

诊断。SPEC_FLAG和RESEARCH_FLAG在“查找spec和研究目录”步骤中设置。RESEARCH_FLAG使dogfood能够验证新特性并将其同步到.printing-press.json中（这是publish-validate的transcendence gate所必需的）。

and research dir" step above. RESEARCH_FLAG enables dogfood to

—

verify novel features and sync them into .printing-press.json

—

(required for publish-validate's transcendence gate).

—

printing-press dogfood --dir "$CLI_DIR" $SPEC_FLAG $RESEARCH_FLAG 2>&1 printing-press verify --dir "$CLI_DIR" $SPEC_FLAG --json 2>&1 printing-press workflow-verify --dir "$CLI_DIR" --json > /tmp/polish-workflow-verify.json 2>&1 || true printing-press verify-skill --dir "$CLI_DIR" --json > /tmp/polish-verify-skill.json 2>&1 || true

--live-check samples novel-feature outputs and populates

--live-check会抽样新特性的输出并填充live_check.features[].warnings（Wave B实体检测）——这是下方“输出实体警告”行获取数据所必需的。

live_check.features[].warnings (Wave B entity detection) — required for

—

the "Output entity warnings" row below to have data to read.

—

printing-press scorecard --dir "$CLI_DIR" $SPEC_FLAG --live-check --json > /tmp/polish-scorecard.json 2>&1 || true printing-press scorecard --dir "$CLI_DIR" $SPEC_FLAG 2>&1 printing-press tools-audit "$CLI_DIR" --json > /tmp/polish-tools-audit-before.json 2>&1 || true go vet ./... 2>&1


verify-skill and workflow-verify run alongside dogfood/verify/scorecard so polish catches the same class of failures the public-library CI catches. Polish hard-gates `ship` on `verify-skill` exit 0 (see ship logic at the end).

**If Phase 1 baseline reveals the underlying CLI needs re-discovery** — broken HTML/SSR extraction, sparse capture (fewer than 5 unique endpoints in the source manuscript), wrong endpoint shapes, missing GraphQL operation hashes, or any signal that the CLI was generated from incomplete capture — polish does not normally do browser capture itself, but the shared playbook at `skills/printing-press/references/browser-sniff-capture.md` covers all available capture backends including the Claude chrome-MCP (`mcp__claude-in-chrome__*`) and computer-use (`mcp__computer-use__*`) when the runtime exposes them. Read Step 1 (tool detection), Step 2c.5 (failure-recovery menu), and Step 2e (chrome-MCP capture playbook) of that reference before improvising. Re-discovery from polish is rare but real; when it happens, use the shared backends — do not invent a new capture flow.

Parse findings into categories:

| Category | Source | What to look for |
|----------|--------|------------------|
| Verify failures | verify --json | Commands with score < 3 |
| SKILL static-check failures | verify-skill --json | Any `findings[]` with `severity=error` (flag-names, flag-commands, positional-args, unknown-command, canonical-sections). Hard ship-gate: ship cannot fire while these exist. |
| Workflow gaps | workflow-verify --json | Verdict `workflow-fail`. Soft gate: surface in `remaining_issues` and downgrade to `hold` when the workflow is the CLI's primary value. |
| Dead code | dogfood | Dead functions, dead flags |
| Stale files | dogfood | Unregistered commands |
| Description issues | dogfood | Boilerplate root Short |
| README gaps | scorecard | README score < 8 |
| Example gaps | dogfood | Commands missing examples |
| Go vet issues | go vet | Any output |
| Output entity warnings | scorecard JSON | `live_check.features[].warnings` — raw HTML entities in human output |
| Output plausibility | Phase 4.85 | Findings from the agentic output review |
| MCP tool quality | tools-audit | Empty Short, thin Short, missing read-only annotations, thin MCP descriptions |

**Environmental failures vs. CLI defects.** Some Phase 1 outputs surface failures that aren't real CLI bugs and should not block ship:

- `scorecard --live-check` reporting `SQLITE_BUSY`, network timeouts, `401` from a mock or expired token, or HTTP errors that depend on the test workspace's permissions/state — these are test-environment issues, not CLI defects.
- `verify` mock-harness flakes on commands with binary output (e.g., `qr` returning a PNG that the substring matcher can't validate) or commands with optional positional args where dry-run output legitimately doesn't contain the verify probe string.

Classify these as environmental in `skipped_findings` with the specific reason; do not spend Phase 2 cycles trying to "fix" them. The polish skill's ship logic already excludes live-check failures from gating, but the agent should still annotate them so reviewers can see they were considered and dismissed deliberately.


verify-skill和workflow-verify与dogfood/verify/scorecard一起运行，这样Polish就能捕获公共库CI会检测到的同类问题。Polish将`ship`的硬性条件设置为`verify-skill`退出码为0（见末尾的发布逻辑）。

**如果Phase 1基准诊断显示底层CLI需要重新发现**——例如HTML/SSR提取损坏、捕获内容稀疏（源手稿中少于5个唯一端点）、端点形状错误、缺少GraphQL操作哈希，或任何表明CLI是基于不完整捕获生成的信号——Polish通常不会自行进行浏览器捕获，但`skills/printing-press/references/browser-sniff-capture.md`中的共享手册涵盖了所有可用的捕获后端，包括Claude chrome-MCP（`mcp__claude-in-chrome__*`）和computer-use（`mcp__computer-use__*`）（如果运行时支持）。在临时处理之前，请阅读该参考手册的Step 1（工具检测）、Step 2c.5（故障恢复菜单）和Step 2e（chrome-MCP捕获手册）。从Polish触发重新发现的情况很少见，但确实存在；发生这种情况时，请使用共享后端——不要发明新的捕获流程。

将诊断结果分类：

| 类别 | 来源 | 检查要点 |
|----------|--------|------------------|
| 验证失败 | verify --json | 得分<3的命令 |
| SKILL静态检查失败 | verify-skill --json | 任何`severity=error`的`findings[]`（flag-names、flag-commands、positional-args、unknown-command、canonical-sections）。发布硬性条件：只要存在这些问题，就不能发布。 |
| 工作流缺口 | workflow-verify --json | 判定结果为`workflow-fail`。软性条件：在`remaining_issues`中列出，如果工作流是CLI的核心价值，则降级为`hold`。 |
| 死代码 | dogfood | 未使用的函数、未使用的标志 |
| 过期文件 | dogfood | 未注册的命令 |
| 描述问题 | dogfood | 根命令Short字段为模板化内容 |
| README缺口 | scorecard | README得分<8 |
| 示例缺口 | dogfood | 缺少示例的命令 |
| Go vet问题 | go vet | 任何输出内容 |
| 输出实体警告 | scorecard JSON | `live_check.features[].warnings`——人类可读输出中包含原始HTML实体 |
| 输出合理性 | Phase 4.85 | 智能输出审查的结果 |
| MCP工具质量 | tools-audit | Short字段为空、Short字段内容单薄、缺少只读注解、MCP描述内容单薄 |

**环境故障与CLI缺陷**：Phase 1的某些输出显示的故障并非真正的CLI bug，不应阻止发布：

- `scorecard --live-check`报告`SQLITE_BUSY`、网络超时、模拟环境或过期令牌导致的`401`，或依赖测试工作区权限/状态的HTTP错误——这些是测试环境问题，而非CLI缺陷。
- `verify`模拟测试框架在处理二进制输出命令（例如返回PNG的`qr`命令，子字符串匹配器无法验证）或带有可选位置参数的命令（dry-run输出合法地不包含验证探测字符串）时出现的偶发失败。

将这些情况归类为`skipped_findings`并说明具体原因；不要在Phase 2中尝试“修复”它们。Polish技能的发布逻辑已将live-check失败排除在发布条件之外，但智能体仍需标注这些情况，以便审核者看到它们已被考虑并被故意排除。

Phase 4.85 — Agentic output review (Wave B)

Phase 4.85 — 智能输出审查（Wave B）

After the mechanical diagnostics above complete, invoke the

printing-press-output-review

sub-skill via the Skill tool. The sub-skill carries

context: fork

and owns the dispatch prompt, gate logic, and known blind spots — single source of truth shared with the main printing-press skill.

Skill(
  skill: "cli-printing-press:printing-press-output-review",
  args: "$CLI_DIR"
)

Parse the returned

---OUTPUT-REVIEW-RESULT---

block.

status: WARN

findings flow into the diagnostic categories above so Phase 2 fixes address both rule-based and plausibility issues.

status: SKIP

is informational — record but don't block.

Wave B gating applies: all findings are warnings, never blockers. Fix if obvious and cheap; document with a short comment if deferred.

Record baseline scores: scorecard total, verify pass rate, dogfood verdict, go vet issue count, output-review finding count.

完成上述机械诊断后，通过Skill工具调用

printing-press-output-review

子技能。该子技能带有

context: fork

，并负责调度提示、门限逻辑和已知盲点——这是与主printing-press技能共享的单一事实来源。

Skill(
  skill: "cli-printing-press:printing-press-output-review",
  args: "$CLI_DIR"
)

解析返回的

---OUTPUT-REVIEW-RESULT---

块。

status: WARN

的结果会归入上述诊断类别，以便Phase 2的修复同时解决基于规则和基于合理性的问题。

status: SKIP

仅为信息性内容——记录但不阻止后续操作。

Wave B门限规则：所有结果均为警告，而非阻塞项。如果修复明显且简单则进行修复；如果延期修复则添加简短注释。

记录基准分数：scorecard总分、验证通过率、dogfood判定结果、go vet问题数量、输出审查发现的问题数量。

Phase 2: Fix

Phase 2: 修复

Fix in priority order. After each priority level, update the lock heartbeat:

bash

printing-press lock update --cli "$CLI_NAME" --phase polish 2>/dev/null

按优先级顺序修复。完成每个优先级级别后，更新锁的心跳：

bash

printing-press lock update --cli "$CLI_NAME" --phase polish 2>/dev/null

Runtime variant default checklist

运行时变体默认值检查清单

If a polish fix adds or changes a runtime mode, data-source option, auth tier, transport, or other user-visible default, document this short checklist before selecting the default:

User-visible default: which behavior users get without extra flags or config.
Compatibility risk: whether existing commands, scripts, MCP tools, or stored config change behavior.
Verification command: the exact command that proves the default and the non-default escape hatch both work.

Keep the checklist in the polish notes or result block. Skip it for ordinary bug fixes that do not change runtime variants or defaults.

如果打磨修复添加或更改了运行时模式、数据源选项、认证层级、传输方式或其他用户可见的默认值，请在选择默认值前记录以下简短清单：

用户可见默认值：用户无需额外标志或配置即可获得的行为。
兼容性风险：现有命令、脚本、MCP工具或存储的配置是否会改变行为。
验证命令：能证明默认值和非默认值逃逸舱均有效的确切命令。

将清单保存在打磨笔记或结果块中。对于不改变运行时变体或默认值的普通bug修复，可跳过此清单。

Priority 0: MCP surface migration (legacy CLIs)

优先级0：MCP表面迁移（旧版CLI）

If Phase 1's

dogfood

reported

MCP Surface: FAIL

with a parity mismatch, the CLI was generated before the runtime cobratree walker existed and is still on the static

internal/mcp/tools.go

surface. The fix is mechanical:

bash

printing-press mcp-sync "$CLI_DIR"

That migrates the MCP surface to the runtime walker, regenerates

tools-manifest.json

and

internal/mcp/tools.go

, and applies any

mcp-descriptions.json

overrides. Re-run

dogfood

after; the parity gate flips to PASS. This is a known migration path for every CLI generated before the cobratree landed; running it on a CLI already on the runtime walker is a no-op refresh.

Skip this priority on CLIs where dogfood's MCP gate is already passing.

如果Phase 1的

dogfood

报告

MCP Surface: FAIL

且存在奇偶性不匹配，说明该CLI是在运行时cobratree walker存在之前生成的，仍使用静态的

internal/mcp/tools.go

表面。修复方法是机械性的：

bash

printing-press mcp-sync "$CLI_DIR"

该命令会将MCP表面迁移到运行时walker，重新生成

tools-manifest.json

和

internal/mcp/tools.go

，并应用任何

mcp-descriptions.json

覆盖配置。修复后重新运行

dogfood

；奇偶性门限会变为PASS。这是所有在cobratree推出之前生成的CLI的已知迁移路径；对已使用运行时walker的CLI运行该命令只会进行无操作刷新。

如果dogfood的MCP门限已通过，则跳过此优先级。

Priority 1: Verify failures

优先级1：验证失败

For each command that fails verify dry-run or exec:

Read the command file
Find
```
Args: cobra.ExactArgs(N)
```
or similar constraint
Remove the
```
Args:
```
field

Add at the top of

RunE

if len(args) == 0 {
    return cmd.Help()
}

For commands needing 2+ args, use
```
if len(args) < 2
```
Check for dry-run nil-data crashes and add guards:
go
```
if flags.dryRun {
    return nil
}
```

针对每个在dry-run或exec模式下验证失败的命令：

读取命令文件
找到
```
Args: cobra.ExactArgs(N)
```
或类似约束
删除
```
Args:
```
字段

在

RunE

顶部添加：

if len(args) == 0 {
    return cmd.Help()
}

对于需要2个及以上参数的命令，使用
```
if len(args) < 2
```
检查dry-run时是否存在空数据崩溃，并添加防护：
go
```
if flags.dryRun {
    return nil
}
```

Priority 2: Dead code

优先级2：死代码

For each dead function flagged by dogfood, grep all
```
.go
```
files to verify it's truly unused (not just its definition matching itself)
If truly unused: remove the function
If used by another helper: leave it (false positive)
After removal, remove unused imports
Delete stale files (promoted commands not registered in root.go)

对于dogfood标记的每个未使用函数，在所有
```
.go
```
文件中进行grep搜索以确认其确实未被使用（不仅仅是定义本身匹配）
如果确实未被使用：删除该函数
如果被其他辅助函数使用：保留（误报）
删除函数后，移除未使用的导入
删除过期文件（已升级但未在root.go中注册的命令）

Priority 3: CLI description and metadata

优先级3：CLI描述和元数据

Read root command
```
Short
```
in
```
internal/cli/root.go
```
If it contains boilerplate ("Reverse-engineered...", raw API title), rewrite: Pattern:
```
"<Product> CLI with <capability-1>, <capability-2>, and <capability-3>"
```
Check commands for missing
```
Example
```
fields. Add realistic examples with domain-specific values.

读取
```
internal/cli/root.go
```
中根命令的
```
Short
```
字段
如果包含模板化内容（例如"Reverse-engineered..."、原始API标题），重写为：格式：
```
"<产品名称> CLI，具备<功能1>、<功能2>和<功能3>"
```
检查命令是否缺少
```
Example
```
字段。添加具有领域特定值的真实示例。

Priority 4: README

优先级4：README

Cardinal rule: run
<cli> <cmd> --help
for EVERY command you put in the README. Never guess flag names, argument formats, or valid values. If you write

--start-time

but the flag is

--start

, the README is wrong and users will get errors on their first try.

基本原则：对于README中提到的每个命令，都要运行
<cli> <cmd> --help
。不要猜测标志名称、参数格式或有效值。如果你写了

--start-time

但实际标志是

--start

，那么README就是错误的，用户首次尝试时会遇到错误。

Source-of-truth files for rendered sections

渲染部分的事实来源文件

Before editing README.md, SKILL.md, or

.printing-press.json

, identify whether the section is rendered from a source file. Dogfood and regeneration overwrite these rendered sections, so direct edits there are temporary and should be used only to inspect the current output.

Rendered section or field	Source-of-truth file::field	Polish workflow
README `## Unique Features`	`research.json::novel_features_built[]`	Edit the underlying `research.json` feature description/example, then re-run dogfood with `--research-dir` .
SKILL `## Unique Capabilities`	`research.json::novel_features_built[]`	Edit the underlying `research.json` feature description/example, then re-run dogfood with `--research-dir` .
README Quick Start	`research.json::narrative.quickstart[]`	Edit the command/comment in `research.json` , then regenerate or re-run the dogfood/rendering step.
SKILL Recipes	`research.json::narrative.recipes[]`	Edit the recipe title, command, or explanation in `research.json` , then regenerate or re-run the dogfood/rendering step.
README/SKILL Troubleshooting	`research.json::narrative.troubleshoots[]`	Edit the symptom/fix pair in `research.json` , then regenerate or re-run the dogfood/rendering step.
`.printing-press.json` `display_name` , `description` , `mcp_*`	`WriteManifestForGenerate` ; for description/display-name overrides, edit the spec ( `info.title` , `info.x-display-name` , `info.description` )	Edit the spec or rerun the manifest writer. Do not hand-edit generated manifest metadata unless you are doing temporary diagnosis.

Recommended loop for these rendered sections: edit the source field, re-run dogfood with

--research-dir "$RESEARCH_DIR"

or regenerate the CLI as appropriate, then run a second pass to confirm the rendered README/SKILL text stays fixed. If you edit README.md or SKILL.md directly in one of these sections, expect the next dogfood resync or regeneration to clobber the change.

To find the manuscript source:

bash

PRESS_HOME="$HOME/printing-press"
API_SLUG="${CLI_NAME%-pp-cli}"
RESEARCH_JSON=""
for f in "$PRESS_HOME/manuscripts/$CLI_NAME"/*/research.json \
         "$PRESS_HOME/manuscripts/$API_SLUG"/*/research.json; do
  if [ -f "$f" ]; then RESEARCH_JSON="$f"; break; fi
done

RESEARCH_JSON

exists and a rendered section has bad prose, examples, or flag references, fix the corresponding field in that file first. For novel features, dogfood verifies

research.json::novel_features[]

, writes the surviving set to

research.json::novel_features_built[]

, and syncs README

## Unique Features

, SKILL

## Unique Capabilities

.printing-press.json

novel_features

, and root help Highlights from that verified set.

在编辑README.md、SKILL.md或

.printing-press.json

之前，确定该部分是否是从源文件渲染而来。Dogfood和重新生成操作会覆盖这些渲染部分，因此直接编辑这些文件只是临时的，仅应用于检查当前输出。

渲染部分或字段	事实来源文件::字段	打磨流程
README `## Unique Features`	`research.json::novel_features_built[]`	编辑底层 `research.json` 中的特性描述/示例，然后使用 `--research-dir` 重新运行dogfood。
SKILL `## Unique Capabilities`	`research.json::novel_features_built[]`	编辑底层 `research.json` 中的特性描述/示例，然后使用 `--research-dir` 重新运行dogfood。
README Quick Start	`research.json::narrative.quickstart[]`	编辑 `research.json` 中的命令/注释，然后重新生成或重新运行dogfood/渲染步骤。
SKILL Recipes	`research.json::narrative.recipes[]`	编辑 `research.json` 中的配方标题、命令或说明，然后重新生成或重新运行dogfood/渲染步骤。
README/SKILL Troubleshooting	`research.json::narrative.troubleshoots[]`	编辑 `research.json` 中的症状/修复对，然后重新生成或重新运行dogfood/渲染步骤。
`.printing-press.json` `display_name` , `description` , `mcp_*`	`WriteManifestForGenerate` ；对于描述/显示名称覆盖，编辑spec（ `info.title` , `info.x-display-name` , `info.description` ）	编辑spec或重新运行清单写入器。除非进行临时诊断，否则不要手动编辑生成的清单元数据。

针对这些渲染部分的推荐流程：编辑源字段，使用

--research-dir "$RESEARCH_DIR"

重新运行dogfood或根据情况重新生成CLI，然后再次运行以确认渲染后的README/SKILL文本保持不变。如果你直接编辑README.md或SKILL.md中的这些部分，下次dogfood同步或重新生成时会覆盖你的更改。

查找手稿来源：

bash

PRESS_HOME="$HOME/printing-press"
API_SLUG="${CLI_NAME%-pp-cli}"
RESEARCH_JSON=""
for f in "$PRESS_HOME/manuscripts/$CLI_NAME"/*/research.json \
         "$PRESS_HOME/manuscripts/$API_SLUG"/*/research.json; do
  if [ -f "$f" ]; then RESEARCH_JSON="$f"; break; fi
done

如果

RESEARCH_JSON

存在且渲染部分的文案、示例或标志引用存在问题，请先修复该文件中的对应字段。对于新特性，dogfood会验证

research.json::novel_features[]

，将通过验证的集合写入

research.json::novel_features_built[]

，并将README的

## Unique Features

、SKILL的

## Unique Capabilities

、

.printing-press.json

的

novel_features

和根帮助的Highlights与该验证集合同步。

Required sections (must be present and correct)

必填部分（必须存在且正确）

Title: "# <Product Name> CLI" — use the product's real name with correct casing/punctuation (e.g., "Cal.com" not "Cal Com")
Subtitle: one sentence describing what the CLI does for the user, matching the root
```
Short
```
field. NOT a description of the API.
Install: correct install command. Use the printing-press-library repo URL, not a per-CLI repo that doesn't exist.
Authentication: how to set
```
<API>_API_KEY
```
env var, where to get a key (link to the provider's settings page), self-hosted URL override if supported. Read
```
config.go
```
to find all env vars.
Quick Start: 3-5 commands someone will actually run first. Pick commands that are both most useful (what you'd run daily) and show the CLI's value (why install this over curl). Usually:
```
doctor
```
→
```
sync
```
→ transcendence command (
```
today
```
,
```
health
```
) →
```
search
```
. Avoid raw list commands — they dump data without demonstrating why the CLI exists.
Commands: categorized table. Group by domain function (Scheduling, Analytics, Account, Utilities), not by implementation structure.
Output Formats: show
```
--json
```
,
```
--select
```
,
```
--csv
```
,
```
--compact
```
,
```
--dry-run
```
,
```
--agent
```
. Use a real command, not a placeholder.
Agent Usage: agent-native properties and exit codes.
Cookbook: 8-15 recipes using verified flag names from
```
--help
```
. Show the CLI's unique capabilities: transcendence commands, filters, SQL queries, pipes. Include at least one mutation example.
Health Check: show actual
```
doctor
```
output, not a placeholder.
Configuration: list ALL env vars from config.go with descriptions. Include config file path.
Troubleshooting: common errors mapped to exit codes with fixes.

标题："# <产品名称> CLI"——使用产品的真实名称，大小写和标点正确（例如："Cal.com"而非"Cal Com"）
副标题：一句话描述CLI为用户提供的功能，与根命令的
```
Short
```
字段匹配。不要描述API。
安装：正确的安装命令。使用printing-press-library仓库的URL，而非不存在的每个CLI单独的仓库。
认证：如何设置
```
<API>_API_KEY
```
环境变量，在哪里获取密钥（链接到提供商的设置页面），如果支持自托管则提供URL覆盖选项。读取
```
config.go
```
以找到所有环境变量。
快速开始：3-5个用户实际会首先运行的命令。选择既最有用（你日常会运行的）又能展示CLI价值（为什么安装这个而不是curl）的命令。通常顺序：
```
doctor
```
→
```
sync
```
→ 超越性命令（
```
today
```
、
```
health
```
） →
```
search
```
。避免原始列表命令——它们只会转储数据，无法展示CLI存在的意义。
命令：分类表格。按领域功能（调度、分析、账户、实用工具）分组，而非按实现结构分组。
输出格式：展示
```
--json
```
、
```
--select
```
、
```
--csv
```
、
```
--compact
```
、
```
--dry-run
```
、
```
--agent
```
。使用真实命令，而非占位符。
智能体使用：智能体原生属性和退出码。
食谱：8-15个使用从
```
--help
```
中验证过的标志名称的配方。展示CLI的独特功能：超越性命令、过滤器、SQL查询、管道。至少包含一个变更示例。
健康检查：展示实际的
```
doctor
```
输出，而非占位符。
配置：列出
```
config.go
```
中的所有环境变量及其描述。包含配置文件路径。
故障排除：常见错误与退出码的映射及修复方法。

Optional sections (add at your discretion)

可选部分（可根据需要添加）

Rate Limits: if the API has documented limits
Self-Hosting: if the CLI supports
```
--api-url
```
or
```
BASE_URL
```
override
Pagination: if the API has notable pagination behavior
Sources & Inspiration: credits to community projects (generated by the machine, preserve if present)

速率限制：如果API有文档记录的限制
自托管：如果CLI支持
```
--api-url
```
或
```
BASE_URL
```
覆盖
分页：如果API有显著的分页行为
来源与灵感：社区项目的致谢（由机器生成，如果存在则保留）

Priority 4.5: SKILL static-check failures (verify-skill)

优先级4.5：SKILL静态检查失败（verify-skill）

Read

/tmp/polish-verify-skill.json

for the full finding list. Each finding has a

check

(

flag-names

flag-commands

positional-args

unknown-command

, or

canonical-sections

), a

command

(the path the SKILL claimed), and a

detail

describing the mismatch. Common shapes and fixes:

flag-names
— SKILL references
```
--foo
```
on a
```
<cli> ...
```
invocation but no command in
```
internal/cli/*.go
```
declares it. Either the example is wrong (fix the SKILL or remove the recipe) or the flag was deleted (decide if it should come back). Out of scope: flags on lines that invoke other tools (e.g.
```
npx -y @mvanhorn/printing-press install <api> --cli-only
```
,
```
gh pr create --base ...
```
,
```
go install ...
```
). The recipe-scoped flag-names check ignores those by design — never strip an external-tool flag to make verify-skill exit 0, and never replace the install instructions with a fabricated slash command. If the finding is firing on an external-tool flag anyway, that is a verify-skill bug, not a SKILL bug; report it instead of editing the SKILL.
flag-commands
—
```
--foo is declared elsewhere but not on <cmd>
```
. The flag exists somewhere but not on the command the SKILL invoked it on. Two fixes:
1. If the flag is added via a shared helper like
```
addXxxFlags(cmd, ...)
```
  , inline the
```
cmd.Flags().StringVar(...)
```
  declaration directly in the affected command's source file. The verify-skill grep cannot follow function-call indirection.
2. If the SKILL example is genuinely wrong, fix the example to use a flag the command does declare.
positional-args
—
```
got N positional args; Use: "<cmd> <arg>" expects M-M
```
. The SKILL recipe passed N positional args but the command's
```
Use:
```
declares M required. Two fixes:
1. If the command also accepts the value via a
```
--flag
```
  , change
```
Use: "cmd <arg>"
```
  to
```
Use: "cmd [arg]"
```
  (square brackets = optional). Verify-skill correctly accepts
```
--flag
```
  -only invocations against an optional positional.
2. If the SKILL example is missing a required positional, fix the example.
canonical-sections
—
```
install section drift: hand-edit detected in a generator-owned section
```
. The
```
## Prerequisites: Install the CLI
```
block has been edited away from what the generator would emit for this CLI today. Do not hand-edit the install section. It's templated from
```
internal/generator/templates/skill.md.tmpl
```
parameterized on
```
(api_name, category, uses_browser_http_transport)
```
; any drift means an automation step or person modified text the machine owns. Resolve by regenerating the printed CLI (run
```
printing-press regen
```
against this directory, or for a published CLI, regenerate from the spec and re-publish). If the canonical text itself is wrong (e.g., a real change to the install instructions is needed), fix the template, not the printed CLI.

When editing other parts of SKILL.md, Read the affected section first and Read it again after the Edit.

Edit

replaces a literal string; if the surrounding context has drifted, a single Edit can graft a second copy of a block onto the first instead of replacing it.

After fixing, re-run

printing-press verify-skill --dir "$CLI_DIR"

and confirm exit 0 before moving on.

读取

/tmp/polish-verify-skill.json

获取完整的问题列表。每个问题包含

check

（

flag-names

、

flag-commands

、

positional-args

、

unknown-command

或

canonical-sections

）、

command

（SKILL声明的路径）和

detail

（描述不匹配的内容）。常见情况及修复方法：

flag-names
— SKILL在
```
<cli> ...
```
调用中引用了
```
--foo
```
，但
```
internal/cli/*.go
```
中没有命令声明该标志。要么示例错误（修复SKILL或删除配方），要么标志已被删除（决定是否恢复）。超出范围：调用其他工具的行中的标志（例如
```
npx -y @mvanhorn/printing-press install <api> --cli-only
```
、
```
gh pr create --base ...
```
、
```
go install ...
```
）。配方范围的flag-names检查会故意忽略这些标志——永远不要为了让verify-skill退出码为0而删除外部工具的标志，也不要用虚构的斜杠命令替换安装说明。如果问题仍在外部工具的标志上触发，那是verify-skill的bug，而非SKILL的bug；应报告而非编辑SKILL。
flag-commands
—
```
--foo在其他地方声明但未在<cmd>上声明
```
。标志存在但未在SKILL调用的命令上声明。两种修复方法：
1. 如果标志是通过共享辅助函数（如
```
addXxxFlags(cmd, ...)
```
  ）添加的，将
```
cmd.Flags().StringVar(...)
```
  声明直接内联到受影响命令的源文件中。verify-skill的grep无法跟踪函数调用的间接引用。
2. 如果SKILL示例确实错误，修复示例使其使用命令声明的标志。
positional-args
—
```
got N positional args; Use: "<cmd> <arg>" expects M-M
```
。SKILL配方传递了N个位置参数，但命令的
```
Use:
```
声明需要M个必填参数。两种修复方法：
1. 如果命令也接受通过
```
--flag
```
  传递的值，将
```
Use: "cmd <arg>"
```
  改为
```
Use: "cmd [arg]"
```
  （方括号表示可选）。verify-skill会正确接受针对可选位置参数的仅
```
--flag
```
  调用。
2. 如果SKILL示例缺少必填位置参数，修复示例。
canonical-sections
—
```
install section drift: hand-edit detected in a generator-owned section
```
。
```
## Prerequisites: Install the CLI
```
块已被编辑，偏离了机器当前为该CLI生成的内容。不要手动编辑安装部分。它是从
```
internal/generator/templates/skill.md.tmpl
```
模板生成的，参数为
```
(api_name, category, uses_browser_http_transport)
```
；任何偏离都意味着自动化步骤或人员修改了机器拥有的文本。解决方法是重新生成打印的CLI（对该目录运行
```
printing-press regen
```
，对于已发布的CLI，从spec重新生成并重新发布）。如果规范文本本身错误（例如需要对安装说明进行真实更改），请修复模板，而非打印的CLI。

编辑SKILL.md的其他部分时，请先阅读受影响的部分，编辑后再次阅读。

Edit

会替换字面字符串；如果周围上下文已发生变化，单次编辑可能会将块的第二个副本嫁接到第一个副本上，而非替换它。

修复后，重新运行

printing-press verify-skill --dir "$CLI_DIR"

并确认退出码为0后再继续。

Priority 5: Remaining dogfood issues

优先级5：剩余的dogfood问题

Path validity mismatches
Auth protocol mismatches
Example drift (examples referencing wrong commands)
Data pipeline integrity issues

路径有效性不匹配
认证协议不匹配
示例漂移（示例引用了错误的命令）
数据管道完整性问题

Priority 6: MCP tool quality

优先级6：MCP工具质量

Your goal now is to ensure every MCP tool exposed by this CLI carries agent-grade descriptions and correct read/write classifications. Tool descriptions and classifications are how agents discover and decide whether to call a tool — thin descriptions and missing annotations directly degrade agent UX, and Phase 1's mechanical gates (verify, dogfood) do NOT catch this class of issue.

Stop and:

Run
```
printing-press tools-audit "$CLI_DIR" --json
```
to surface mechanical findings (empty Short, thin Short, missing
```
mcp:read-only
```
on read-shaped command names).
You must read
```
references/tools-polish.md
```
and follow its instructions to address the findings AND run a judgment pass over every command — regardless of whether the audit flagged it. The audit catches mechanical issues; description quality and borderline classification (read-only vs. local-write) always require agent reasoning. You must not skip this.
Accepting MCP-description findings carries a stricter contract.
```
thin-mcp-description
```
and
```
empty-mcp-description
```
accepts require three pre-decision fields (
```
spec_source_material
```
,
```
target_description
```
,
```
gap_analysis
```
) populated per finding. The binary rejects bulk accepts (>5 findings sharing one rationale) and runs that "complete" without lifting MCPDescriptionQuality. Fix via override or generator improvement is the expected path; accept is rare. See
```
references/tools-polish.md
```
"Marking a finding accepted" for the full contract.

Proceed to "After all fixes" only when the audit's summary line reads

no pending findings

with no

incomplete:

block — every gate (pre-decision fields, duplicate rationale, scorecard delta) passes.

你的目标是确保CLI暴露的每个MCP工具都带有智能体级别的描述和正确的读写分类。 工具描述和分类是智能体发现并决定是否调用工具的依据——描述单薄和缺少注解会直接降低智能体的用户体验，而Phase 1的机械门限（verify、dogfood）无法捕获这类问题。

停止操作并执行以下步骤：

运行
```
printing-press tools-audit "$CLI_DIR" --json
```
以发现机械问题（Short字段为空、Short字段内容单薄、读取型命令名称缺少
```
mcp:read-only
```
注解）。
你必须阅读
```
references/tools-polish.md
```
并按照其说明解决问题，同时对每个命令进行判断检查——无论审计是否标记它。审计只能发现机械问题；描述质量和边界分类（只读 vs 本地写入）始终需要智能体的推理。你不能跳过此步骤。
接受MCP描述问题有更严格的要求。
```
thin-mcp-description
```
和
```
empty-mcp-description
```
的接受需要每个问题都填充三个预决策字段（
```
spec_source_material
```
、
```
target_description
```
、
```
gap_analysis
```
）。批量接受（>5个问题共享一个理由）会被拒绝，并且会导致“完成”但未提升MCPDescriptionQuality。预期的解决方法是通过覆盖或生成器改进；接受是罕见情况。请参阅
```
references/tools-polish.md
```
中的“Marking a finding accepted”了解完整要求。

只有当审计的摘要行显示

no pending findings

且没有

incomplete:

块时，才能进入“所有修复完成后”步骤——所有门限（预决策字段、重复理由、scorecard差异）都已通过。

After all fixes

所有修复完成后

bash

go build -o "$CLI_NAME" ./cmd/"$CLI_NAME"
gofmt -w .

bash

go build -o "$CLI_NAME" ./cmd/"$CLI_NAME"
gofmt -w .

Phase 3: Re-diagnose

Phase 3: 重新诊断

Re-run the diagnostic sweep on the fixed CLI:

bash

printing-press dogfood --dir "$CLI_DIR" $SPEC_FLAG 2>&1
printing-press verify --dir "$CLI_DIR" $SPEC_FLAG --json 2>&1
printing-press workflow-verify --dir "$CLI_DIR" --json 2>&1
printing-press verify-skill --dir "$CLI_DIR" --json 2>&1
printing-press scorecard --dir "$CLI_DIR" $SPEC_FLAG 2>&1
printing-press tools-audit "$CLI_DIR" 2>&1
go vet ./... 2>&1

Record the after scores. If verify-skill still has any

severity=error

findings or workflow-verify still reports

workflow-fail

, ship cannot fire (see ship logic below).

在修复后的CLI上重新运行诊断扫描：

bash

printing-press dogfood --dir "$CLI_DIR" $SPEC_FLAG 2>&1
printing-press verify --dir "$CLI_DIR" $SPEC_FLAG --json 2>&1
printing-press workflow-verify --dir "$CLI_DIR" --json 2>&1
printing-press verify-skill --dir "$CLI_DIR" --json 2>&1
printing-press scorecard --dir "$CLI_DIR" $SPEC_FLAG 2>&1
printing-press tools-audit "$CLI_DIR" 2>&1
go vet ./... 2>&1

记录修复后的分数。如果verify-skill仍有任何

severity=error

的问题，或workflow-verify仍报告

workflow-fail

，则不能发布（见下方发布逻辑）。

Ship logic

发布逻辑

Compute the ship recommendation:

ship
: verify >= 80%, scorecard >= 75, no critical failures, AND verify-skill exits 0 (no SKILL/CLI mismatches), AND workflow-verify is not
```
workflow-fail
```
, AND tools-audit shows zero pending findings (every finding fixed or explicitly accepted with rationale). The SKILL/workflow gates are hard requirements: a CLI that ships with a SKILL that lies about it (verify-skill findings) gives agents broken instructions; a CLI whose primary workflow fails verification has not actually shipped.
ship-with-gaps
: verify >= 65%, scorecard >= 65, non-critical gaps remain, AND the SKILL/workflow gates above hold, AND the README has a
```
## Known Gaps
```
block that lists the user-facing gaps. Reserved for the rare case where a refactor or external-dependency blocker prevents a clean fix.
README Known Gaps is mandatory for ship-with-gaps. The published library copy is what downstream users see; if the verdict claims gaps exist but the README hides them, downstream users meet a CLI that misbehaves with no disclosure. Before emitting
```
ship_recommendation: ship-with-gaps
```
:
1. Read the CLI's
```
README.md
```
  . If a
```
## Known Gaps
```
  section already exists (e.g., the main SKILL Phase 4 wrote it before polish ran), confirm it covers the user-facing items in
```
remaining_issues
```
  . Add bullets for any newly surfaced user-facing gap polish discovered.
2. If
```
## Known Gaps
```
  is missing, write it — placed after
```
## Quick Start
```
  (or before
```
## Usage
```
  ) to mirror the
```
## Unique Features
```
  placement convention. One bullet per user-facing item from
```
remaining_issues
```
  . Phrase from the user's perspective: what command misbehaves, what the workaround is. Example:
  markdown
```
## Known Gaps

- **`analytics export --csv`** returns truncated rows on workspaces with >10k events. Use `--json` and pipe to `jq` as a workaround until the underlying export endpoint is paginated.
```
3. Filter
```
remaining_issues
```
  for user-facing entries when populating the section. Internal items (verify drift on a deprecated flag, MCP description tuning, polish-internal notes) do not belong in the public Known Gaps. If the agent cannot identify any user-facing gap from
```
remaining_issues
```
  , the verdict is
```
ship
```
  , not
```
ship-with-gaps
```
  .
4. List each Known Gaps write/update in
```
fixes_applied
```
  so the caller can surface that this happened.
If polish cannot responsibly populate Known Gaps from the available evidence (e.g.,
```
remaining_issues
```
is all internal jargon with no user-facing reading), downgrade the verdict to
```
hold
```
rather than ship without disclosure.
hold
: verify < 65% or scorecard < 65 or critical failures, OR verify-skill has unresolved findings, OR workflow-verify reports
```
workflow-fail
```
and the workflow is the CLI's primary value.

计算发布建议：

ship
：验证通过率≥80%，scorecard得分≥75，无严重故障，且verify-skill退出码为0（无SKILL/CLI不匹配），且workflow-verify未报告
```
workflow-fail
```
，且tools-audit显示无待处理问题（所有问题已修复或通过合理理由明确接受）。SKILL/工作流门限是硬性要求：如果SKILL对CLI的描述有误（verify-skill发现问题），发布后会给智能体提供错误的指令；如果CLI的核心工作流未通过验证，则实际上并未完成发布。
ship-with-gaps
：验证通过率≥65%，scorecard得分≥65，存在非严重缺口，且满足上述SKILL/工作流门限，且README包含
```
## Known Gaps
```
块，列出了用户可见的缺口。仅适用于重构或外部依赖阻塞导致无法完全修复的罕见情况。
README Known Gaps是ship-with-gaps的必填项。发布的库副本是下游用户看到的内容；如果判定结果声称存在缺口但README未披露，下游用户会遇到行为异常且无说明的CLI。在输出
```
ship_recommendation: ship-with-gaps
```
之前：
1. 阅读CLI的
```
README.md
```
  。如果
```
## Known Gaps
```
  部分已存在（例如主SKILL的Phase 4在打磨前已写入），确认它涵盖了
```
remaining_issues
```
  中的用户可见项。为打磨发现的新用户可见缺口添加项目符号。
2. 如果
```
## Known Gaps
```
  缺失，写入该部分——放在
```
## Quick Start
```
  之后（或
```
## Usage
```
  之前），以遵循
```
## Unique Features
```
  的放置惯例。每个项目符号对应
```
remaining_issues
```
  中的一个用户可见项。从用户的角度表述：哪个命令行为异常，解决方法是什么。示例：
  markdown
```
## Known Gaps

- **`analytics export --csv`** 在事件数>10k的工作区中返回截断的行。在底层导出端点实现分页之前，可使用`--json`并通过管道传递给`jq`作为解决方法。
```
3. 填充该部分时，过滤
```
remaining_issues
```
  中的用户可见项。内部项（已弃用标志的验证漂移、MCP描述调整、打磨内部笔记）不属于公共Known Gaps。如果智能体无法从
```
remaining_issues
```
  中识别出任何用户可见缺口，则判定结果应为
```
ship
```
  ，而非
```
ship-with-gaps
```
  。
4. 在
```
fixes_applied
```
  中列出每个Known Gaps的写入/更新操作，以便调用者可以展示此操作已执行。
如果打磨无法根据现有证据合理填充Known Gaps（例如
```
remaining_issues
```
全是内部术语，无用户可见解读），则将判定结果降级为
```
hold
```
，而非未披露就发布。
hold
：验证通过率<65%或scorecard得分<65或存在严重故障，或verify-skill存在未解决的问题，或workflow-verify报告
```
workflow-fail
```
且工作流是CLI的核心价值。

Push higher without gaming

不通过投机手段提升分数

The ship gates are a floor, not a ceiling. After they pass, look at scorecard dimensions still below max and ask whether each gap is real or structural:

Find the underlying deficit, not the score. The scorecard is a proxy for quality, not the goal itself. A README scoring 8/10 might be missing a Cookbook section or have outdated commands — that's a real, fixable gap. A
```
mcp_surface_strategy
```
scoring 2/10 on a 200-endpoint API might be flagging that the surface is mostly endpoint mirrors — also potentially fixable.
If there's a real, agent-grade improvement available, make it. Better description, missing flag doc, weak README section, an example that doesn't reflect actual usage. The CLI gets better and the score follows.
If the deficit is structural, document and accept. Some dimensions assume capabilities the CLI's domain doesn't have (a read-only API scored against write-workflow dimensions, a CLI with no auth scored on auth dimensions, a small API penalized on
```
surface_strategy
```
thresholds calibrated for large APIs). Note the reason in
```
skipped_findings
```
and move on.
Never add scaffolding to satisfy the scorer. Fake commands, fake tests, fake flags, or boilerplate prose written purely to nudge a number — those degrade the CLI to satisfy the proxy. The scorer is imperfect by design (the "scoring may be imperfect" caveat in AGENTS.md applies). Trust the underlying judgment, not the number.

发布门限是最低要求，而非最高标准。通过门限后，查看仍未达到满分的scorecard维度，询问每个缺口是真实问题还是结构性问题：

找到根本缺陷，而非只看分数。scorecard是质量的代理指标，而非目标本身。README得分为8/10可能是缺少Cookbook部分或命令过时——这是真实的、可修复的缺口。200个端点的API的
```
mcp_surface_strategy
```
得分为2/10可能是因为表面大多是端点镜像——也可能是可修复的。
如果存在真实的、智能体级别的改进空间，就进行改进。更好的描述、缺失的标志文档、薄弱的README部分、不符合实际使用的示例。CLI会变得更好，分数也会随之提升。
如果缺陷是结构性的，记录并接受。某些维度假设CLI领域具备特定能力（只读API按写入工作流维度评分、无认证的CLI按认证维度评分、小API按为大API校准的
```
surface_strategy
```
阈值评分）。在
```
skipped_findings
```
中说明原因并继续。
永远不要为了满足评分而添加脚手架。虚假命令、虚假测试、虚假标志或纯粹为了提高分数而写的模板化文案——这些会降低CLI质量以满足代理指标。评分器设计上存在缺陷（AGENTS.md中的“scoring may be imperfect”警告适用）。相信底层判断，而非数字。

MCP scorecard dims map to spec fields, not generator code

MCP scorecard维度映射到spec字段，而非生成器代码

When

mcp_token_efficiency

mcp_tool_design

mcp_remote_transport

, or

mcp_surface_strategy

are below max, the fix is almost always a spec edit + regenerate (or

regen-merge

from a freshly-generated tree), not a generator-template change. Polish CAN address these — do not classify them as "feature add to a generator-owned file, retro candidate."

Weak dim	Spec field that fixes it	What to add to `spec.yaml` 's `mcp:` block
`mcp_remote_transport`	`mcp.transport`	`transport: [stdio, http]` (default is stdio-only; HTTP costs nothing and lets the same binary serve cloud-hosted agents)
`mcp_token_efficiency` , `mcp_surface_strategy`	`mcp.endpoint_tools` , `mcp.orchestration`	`endpoint_tools: hidden` + `orchestration: code` (Cloudflare pattern: ~70 raw endpoint tools collapse to `<api>_search` + `<api>_execute` ; all endpoints still reachable via execute)
`mcp_tool_design`	`mcp.intents`	Define multi-step intent compositions for the workflows the API supports
`mcp_description_quality`	`mcp-descriptions.json` (override file at the CLI root)	Per-tool description overrides; thin spec-derived descriptions get richer text without spec edits

Recommended threshold: at >50 typed endpoints, default to recommending all four (

transport

endpoint_tools=hidden

orchestration=code

intents

for the headline workflows). Below 30,

transport=[stdio, http]

is the only zero-cost win. The full reference is

docs/SPEC-EXTENSIONS.md

After editing the spec, regenerate (or

regen-merge

the changes into the published library) so the new

mcp:

block reaches templates. Cobratree-walked novel commands continue to surface as MCP tools either way; they don't need spec changes.

Rule of thumb: if your fix would still be valuable if the scorecard didn't exist, do it. If the only motivation is "to push the score," don't.

当

mcp_token_efficiency

、

mcp_tool_design

、

mcp_remote_transport

或

mcp_surface_strategy

低于满分时，修复方法几乎总是编辑spec并重新生成（或从新生成的树进行

regen-merge

），而非修改生成器模板。Polish可以解决这些问题——不要将其归类为“对生成器拥有的文件添加功能，retro候选”。

薄弱维度	修复该维度的spec字段	在 `spec.yaml` 的 `mcp:` 块中添加的内容
`mcp_remote_transport`	`mcp.transport`	`transport: [stdio, http]` （默认仅stdio；HTTP无成本，可让同一二进制文件为云托管智能体提供服务）
`mcp_token_efficiency` , `mcp_surface_strategy`	`mcp.endpoint_tools` , `mcp.orchestration`	`endpoint_tools: hidden` + `orchestration: code` （Cloudflare模式：约70个原始端点工具合并为 `<api>_search` + `<api>_execute` ；所有端点仍可通过execute访问）
`mcp_tool_design`	`mcp.intents`	为API支持的工作流定义多步骤意图组合
`mcp_description_quality`	`mcp-descriptions.json` （CLI根目录下的覆盖文件）	每个工具的描述覆盖；无需编辑spec即可让spec生成的单薄描述变得更丰富

推荐阈值：当端点数量>50时，默认建议全部四个配置（

transport

、

endpoint_tools=hidden

、

orchestration=code

、针对核心工作流的

intents

）。当端点数量<30时，

transport=[stdio, http]

是唯一零成本的改进。完整参考见

docs/SPEC-EXTENSIONS.md

。

编辑spec后，重新生成（或

regen-merge

更改到已发布的库），使新的

mcp:

块应用到模板中。无论是否更改spec，通过cobratree遍历的新命令都会继续作为MCP工具暴露。

经验法则：如果你的修复即使没有scorecard也有价值，就去做。如果唯一的动机是“提高分数”，就不要做。

Display delta and emit result block

展示差异并输出结果块

Display the delta to the user, then emit the structured

---POLISH-RESULT---

block. The block lets calling skills (e.g., main printing-press SKILL.md Phase 5.5) parse the recommendation and scores reliably; the human-readable table above is for the user.

Polish Results for <CLI_NAME>:

                    Before    After     Delta
  Scorecard:        XX/100    XX/100    +N
  Verify:           XX%       XX%       +N%
  Tools-audit:      XX        XX        -N pending findings

Fixes applied:
  - <one-line description of each fix>

Skipped findings:
  - <finding>: <why you chose not to fix it>

Remaining issues:
  - <one-line description of each issue you tried to fix but couldn't>

---POLISH-RESULT---
scorecard_before: <N>
scorecard_after: <N>
verify_before: <N>
verify_after: <N>
dogfood_before: <PASS|FAIL>
dogfood_after: <PASS|FAIL>
govet_before: <N>
govet_after: <N>
tools_audit_before: <N pending>
tools_audit_after: <N pending>
fixes_applied:
- <one-line description of each fix>
skipped_findings:
- <finding>: <why you chose not to fix it>
remaining_issues:
- <one-line description of each issue you tried to fix but couldn't>
ship_recommendation: <ship|ship-with-gaps|hold>
further_polish_recommended: <yes|no>
further_polish_reasoning: <one sentence explaining the call>
---END-POLISH-RESULT---

The three lists serve different purposes:

fixes_applied: what changed — the caller displays these
skipped_findings: issues you found but deliberately did not fix, with reasoning (e.g., "verify classifies
```
stale
```
as read — scorer bug, not a CLI problem", "thin-short on
```
version
```
accepted as-is — accurate and brief"). The caller surfaces these so the user can decide whether to address them manually.
remaining_issues: issues you tried to fix but couldn't resolve.

向用户展示差异，然后输出结构化的

---POLISH-RESULT---

块。该块让调用技能（例如主printing-press SKILL.md的Phase 5.5）能够可靠地解析建议和分数；上方的人类可读表格是给用户看的。

Polish Results for <CLI_NAME>:

                    Before    After     Delta
  Scorecard:        XX/100    XX/100    +N
  Verify:           XX%       XX%       +N%
  Tools-audit:      XX        XX        -N pending findings

Fixes applied:
  - <每个修复的单行描述>

Skipped findings:
  - <问题>: <你选择不修复的原因>

Remaining issues:
  - <你尝试修复但未解决的每个问题的单行描述>

---POLISH-RESULT---
scorecard_before: <N>
scorecard_after: <N>
verify_before: <N>
verify_after: <N>
dogfood_before: <PASS|FAIL>
dogfood_after: <PASS|FAIL>
govet_before: <N>
govet_after: <N>
tools_audit_before: <N pending>
tools_audit_after: <N pending>
fixes_applied:
- <每个修复的单行描述>
skipped_findings:
- <问题>: <你选择不修复的原因>
remaining_issues:
- <你尝试修复但未解决的每个问题的单行描述>
ship_recommendation: <ship|ship-with-gaps|hold>
further_polish_recommended: <yes|no>
further_polish_reasoning: <解释该决定的一句话>
---END-POLISH-RESULT---

三个列表的用途不同：

fixes_applied：更改的内容——调用者会展示这些内容
skipped_findings：你发现但故意不修复的问题及原因（例如：“verify将
```
stale
```
归类为read——评分器bug，而非CLI问题”、“
```
version
```
的thin-short按原样接受——准确且简洁”）。调用者会展示这些内容，以便用户决定是否手动处理。
remaining_issues：你尝试修复但未解决的问题。

Picking

further_polish_recommended

选择

further_polish_recommended

的值

Your judgment, not a count of

remaining_issues

. Set

yes

when another polish invocation has a real chance of closing what's left:

```
remaining_issues
```
includes verify or dogfood failures you ran out of time on and a fresh pass with more attention per failure could plausibly resolve.
The fixes you did land may have unblocked dependent issues you couldn't reach this pass.
A SKILL/CLI mismatch needs a second look after this pass changed the source tree.

Set

no

when another invocation would re-tread the same ground:

```
remaining_issues
```
are decisions only the user can make (rename a flagship command, choose a default behavior, accept a structural trade-off).
You already attempted the fix in two different ways this pass and both failed for the same reason.
The blocker is external (API changed shape, rate-limited, missing credential) and not something a fresh polish run sees differently.
```
remaining_issues
```
is empty AND
```
skipped_findings
```
are all environmental or structural — there is nothing left for polish to do.

further_polish_reasoning

is one sentence the caller surfaces verbatim. Make it specific ("verify failures on

analytics export

and

report show

looked closable but I gave up too early") rather than generic ("more polish might help"). Callers use this signal to decide whether to offer "Polish again" in their next prompt; a vague reason makes their prompt vague.

这取决于你的判断，而非

remaining_issues

的数量。当再次打磨有可能解决剩余问题时，设置为

yes

：

```
remaining_issues
```
包含你因时间不足而放弃的verify或dogfood失败，且重新集中精力处理每个失败有可能解决。
你已完成的修复可能解除了之前无法处理的依赖问题。
SKILL/CLI不匹配需要在本次修改源码树后再次检查。

当再次打磨只会重复相同工作时，设置为

no

：

```
remaining_issues
```
是只有用户才能决定的事项（重命名旗舰命令、选择默认行为、接受结构性权衡）。
你在本次打磨中已尝试两种不同的修复方法，但均因相同原因失败。
阻塞因素是外部的（API形状改变、速率限制、缺少凭证），再次打磨不会有不同结果。
```
remaining_issues
```
为空且
```
skipped_findings
```
均为环境或结构性问题——没有什么可打磨的了。

further_polish_reasoning

是调用者会直接展示的一句话。要具体（例如：“

analytics export

和

report show

的verify失败看起来可以解决，但我过早放弃了”），而非泛泛而谈（例如：“再次打磨可能会有帮助”）。调用者会使用这个信号决定是否在下次提示中提供“再次打磨”选项；模糊的理由会导致提示也模糊。

Publish Offer

发布提议

Skip this entire section in mid-pipeline mode. Detect from

$CLI_DIR

: if the path is under

.runstate/

(i.e.,

$PRESS_RUNSTATE/<scope>/runs/.../working/<api>-pp-cli/

), polish is being called from main SKILL Phase 5.5 or hold-path "Polish to retry," and the working CLI has not been promoted to library yet.

/printing-press-publish <slug>

resolves to

$PRESS_LIBRARY/<slug>/

, which is either empty or holds a stale prior run — invoking publish here would either fail to resolve or ship the wrong copy. The parent skill owns the publish flow on that path; just emit the result block and return.

A simple check:

bash

case "$CLI_DIR" in
  *.runstate/*) echo "mid-pipeline; skipping Publish Offer"; return ;;
esac

For standalone invocations (

$CLI_DIR

under

$PRESS_LIBRARY/<slug>/

), continue with the offer below.

ship

ship-with-gaps

Construct the prompt from the result block. The shape is data-driven so the user is never asked to weigh "Polish again" against "Publish" when polish itself just decided another pass would not help.

在流水线模式下跳过整个本节。通过

$CLI_DIR

判断：如果路径在

.runstate/

下（即

$PRESS_RUNSTATE/<scope>/runs/.../working/<api>-pp-cli/

），则Polish是从主SKILL的Phase 5.5或hold路径“Polish to retry”调用的，工作中的CLI尚未被推广到库中。

/printing-press-publish <slug>

会解析到

$PRESS_LIBRARY/<slug>/

，该目录要么为空，要么保存的是上一次运行的旧CLI——在此处调用发布要么无法解析，要么发布错误的副本。父技能负责该路径下的发布流程；只需输出结果块并返回即可。

简单检查：

bash

case "$CLI_DIR" in
  *.runstate/*) echo "mid-pipeline; skipping Publish Offer"; return ;;
esac

对于独立调用（

$CLI_DIR

在

$PRESS_LIBRARY/<slug>/

下），继续以下提议。

如果判定结果为

ship

或

ship-with-gaps

：

根据结果块构建提示。提示是数据驱动的，因此当Polish本身判定再次打磨无帮助时，不会让用户在“再次打磨”和“发布”之间做选择。

Recommendation

菜单

Suppress the "Polish again" option entirely when

further_polish_recommended: no

. Keep "Publish" and "Done" always available.

Surface

further_polish_reasoning

as context when polish opted out of recommending another pass — the user should see why polish is done.

Present via

AskUserQuestion

. Two example shapes:

Polish converged clean (

remaining_issues

empty,

further_polish_recommended: no

"<CLI_NAME> polished: scorecard XX/100, verify XX%. Polish ran cleanly — nothing more to fix.

Recommendation: Publish.

Publish now (recommended) — validate, package, and open a PR

Done for now — CLI is at ~/printing-press/library/<cli-name>"

Polish thinks another pass would help (

remaining_issues

non-empty,

further_polish_recommended: yes

"<CLI_NAME> polished: scorecard XX/100, verify XX%. <N> issues remain.

Polish notes: '<further_polish_reasoning>'

Recommendation: Polish again before publishing.

Polish again (recommended) — close the remaining <N> issues

Publish now — ship as-is

Done for now — CLI is at ~/printing-press/library/<cli-name>"

The recommended option leads, carries the

(recommended)

label, and the leading

Recommendation:

line states the agent's call explicitly. Three reinforcing channels so the user does not have to infer from ordering.

当

further_polish_recommended: no

时，完全隐藏“再次打磨”选项。始终保留“发布”和“完成”选项。

当Polish不推荐再次打磨时，展示

further_polish_reasoning

作为上下文——用户应该知道Polish为什么认为已完成。

通过

AskUserQuestion

展示。两个示例格式：

打磨已完美收敛（

remaining_issues

为空，

further_polish_recommended: no

）：

"<CLI_NAME>打磨完成：scorecard XX/100，验证通过率XX%。打磨运行无问题——无需进一步修复。

推荐：发布。

立即发布（推荐）——验证、打包并打开PR

暂时完成——CLI位于~/printing-press/library/<cli-name>"

Polish认为再次打磨会有帮助（

remaining_issues

非空，

further_polish_recommended: yes

）：

"<CLI_NAME>打磨完成：scorecard XX/100，验证通过率XX%。仍有<N>个问题。

Polish说明：'<further_polish_reasoning>'

推荐：发布前再次打磨。

再次打磨（推荐）——解决剩余的<N>个问题

立即发布——按现状发布

暂时完成——CLI位于~/printing-press/library/<cli-name>"

推荐选项排在首位，带有

(recommended)

标签，且开头的

Recommendation:

行明确说明智能体的建议。通过三种方式强化，用户无需从排序中推断。

If "Publish now"

如果选择“立即发布”

Check for existing PR:

bash

gh pr list --repo mvanhorn/printing-press-library --head "feat/$CLI_NAME" --state open --author @me --json number,url --jq '.[0]' 2>/dev/null

Then invoke

/printing-press-publish <cli-name>

After publish returns success, offer retro as a soft tail. This mirrors the main

/printing-press

skill's Phase 6 behavior so users who reach publish through polish (mid-pipeline → polish-again → publish, or standalone polish → publish) get the same retro opportunity as users who reach publish directly through Phase 6.

Present via

AskUserQuestion

"PR opened: <PR_URL>. Run a retro? It surfaces systemic gaps from this session (generator misses, scorer bugs, skill-doc drift) as a GitHub issue for the Printing Press maintainers. Every retro filed raises the floor for the next CLI — and your session context is freshest right now."

No — I'm done (default)

Yes — run retro now

If the user picks yes, invoke

/printing-press-retro

(In mid-pipeline mode this whole section is unreachable — the Publish Offer guard at the top of this section returns early — so no extra check is needed here.)

检查是否存在现有PR：

bash

gh pr list --repo mvanhorn/printing-press-library --head "feat/$CLI_NAME" --state open --author @me --json number,url --jq '.[0]' 2>/dev/null

然后调用

/printing-press-publish <cli-name>

。

发布成功返回后，提供retro作为可选后续步骤。这与主

/printing-press

技能的Phase 6行为一致，因此通过打磨（流水线中→再次打磨→发布，或独立打磨→发布）达到发布状态的用户，与直接通过Phase 6达到发布状态的用户拥有相同的retro机会。

通过

AskUserQuestion

展示：

"PR已创建：<PR_URL>。运行retro吗？它会将本次会话中的系统性缺口（生成器遗漏、评分器bug、技能文档漂移）作为GitHub issue提交给Printing Press维护者。每个提交的retro都会提升下一个CLI的质量基准——而且你现在对会话上下文的记忆最清晰。"

不——我已完成（默认）

是——立即运行retro

如果用户选择是，调用

/printing-press-retro

。

（在流水线模式下，本节完全不可达——本节顶部的发布提议检查会提前返回——因此此处无需额外检查。）

If "Polish again"

如果选择“再次打磨”

Re-run Phase 1 → Phase 2 → Phase 3 with the same CLI. Maximum 2 additional polish passes (3 total including the first).

对同一CLI重新运行Phase 1 → Phase 2 → Phase 3。最多额外进行2次打磨（总共3次，包括第一次）。

If "Done for now"

如果选择“暂时完成”

End normally.

正常结束。

Rules

规则

Fix everything. Do not ask for approval before fixing — polish is autonomous.
Report results honestly. Show what improved and what didn't.
Do not add new features. Polish fixes quality issues, not feature gaps.
Do not re-run research or generation. Polish works with the CLI as-is.
Do not modify the printing-press generator. That's
```
/printing-press-retro
```
.
Do not modify any files outside
```
$CLI_DIR
```
.
If polish adds or renames a Cobra command, the MCP surface updates automatically through the generated
```
internal/mcp/cobratree
```
runtime mirror. Update
```
novel_features
```
only when README/SKILL highlights or registry display should change; use
```
cmd.Annotations["mcp:hidden"] = "true"
```
for debug-only commands.
Maximum 1 fix-and-rediagnose pass per polish invocation. The "Polish again" path runs additional invocations (max 3 total).
Prefer mechanical fixes over creative decisions. When a creative decision is needed (like the CLI description), use the research brief from manuscripts if available.

修复所有问题。修复前无需请求批准——Polish是自主运行的。
如实报告结果。展示哪些方面得到了改进，哪些没有。
不要添加新功能。Polish修复质量问题，而非功能缺口。
不要重新运行研究或生成。Polish基于现有CLI进行操作。
不要修改printing-press生成器。那是
```
/printing-press-retro
```
的职责。
不要修改
```
$CLI_DIR
```
之外的任何文件。
如果Polish添加或重命名Cobra命令，MCP表面会通过生成的
```
internal/mcp/cobratree
```
运行时镜像自动更新。仅当README/SKILL亮点或注册表显示需要更改时，才更新
```
novel_features
```
；对于仅用于调试的命令，使用
```
cmd.Annotations["mcp:hidden"] = "true"
```
。
每次Polish调用最多进行一次修复+重新诊断。“再次打磨”路径会运行额外的调用（最多3次总调用）。
优先选择机械修复而非创意决策。当需要创意决策时（例如CLI描述），如果可用，使用手稿中的研究摘要。