quality-validation
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseVerification Before Completion
完成前验证
Overview
概述
Claiming work is complete without verification is dishonesty, not efficiency.
Core principle: Evidence before claims, always.
Violating the letter of this rule is violating the spirit of this rule.
未经验证就宣称工作完成是不诚信的行为,而非高效。
核心原则: 始终先有证据,再做宣称。
违反规则的字面要求,即违反规则的精神内核。
The Iron Law
铁律
NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCEIf you haven't run the verification command in this message, you cannot claim it passes.
NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE如果未运行此消息中的验证命令,你就不能宣称任务通过。
The Gate Function
把关流程
BEFORE claiming any status or expressing satisfaction:
1. IDENTIFY: What command proves this claim?
2. RUN: Execute the FULL command (fresh, complete)
3. READ: Full output, check exit code, count failures
4. VERIFY: Does output confirm the claim?
- If NO: State actual status with evidence
- If YES: State claim WITH evidence
5. ONLY THEN: Make the claim
Skip any step = lying, not verifying在宣称任何状态或表达满意之前:
1. 确定:什么命令能证明该宣称?
2. 执行:运行完整的命令(全新、完整的执行)
3. 读取:查看完整输出,检查退出码,统计失败次数
4. 验证:输出是否能证实宣称?
- 若不能:附上证据说明实际状态
- 若能:附上证据做出宣称
5. 只有此时:才能做出宣称
跳过任何步骤 = 撒谎,而非验证Common Failures
常见失误
| Claim | Requires | Not Sufficient |
|---|---|---|
| Tests pass | Test command output: 0 failures | Previous run, "should pass" |
| Linter clean | Linter output: 0 errors | Partial check, extrapolation |
| Build succeeds | Build command: exit 0 | Linter passing, logs look good |
| Bug fixed | Test original symptom: passes | Code changed, assumed fixed |
| Regression test works | Red-green cycle verified | Test passes once |
| Agent completed | VCS diff shows changes | Agent reports "success" |
| Requirements met | Line-by-line checklist | Tests passing |
| 宣称内容 | 所需验证 | 不充分的做法 |
|---|---|---|
| 测试通过 | 测试命令输出:0个失败用例 | 之前的运行结果、“应该能通过” |
| 代码检查器无问题 | 代码检查器输出:0个错误 | 部分检查、推断 |
| 构建成功 | 构建命令:退出码0 | 代码检查器通过、日志看起来正常 |
| 漏洞已修复 | 针对原症状的测试通过 | 代码已修改、假设已修复 |
| 回归测试有效 | 已验证红-绿循环 | 测试仅通过一次 |
| Agent已完成任务 | 版本控制系统差异显示有变更 | Agent报告“成功” |
| 满足需求 | 逐条核对检查清单 | 测试通过 |
Red Flags - STOP
危险信号 - 立即停止
- Using "should", "probably", "seems to"
- Expressing satisfaction before verification ("Great!", "Perfect!", "Done!", etc.)
- About to commit/push/PR without verification
- Trusting agent success reports
- Relying on partial verification
- Thinking "just this once"
- Tired and wanting work over
- ANY wording implying success without having run verification
- 使用“应该”“可能”“似乎”这类表述
- 未验证就表达满意(如“太棒了!”“完美!”“完成了!”等)
- 未验证就准备提交/推送/创建拉取请求
- 信任Agent的成功报告
- 依赖部分验证
- 想着“就这一次”
- 疲惫不堪想尽快结束工作
- 任何暗示未运行验证就已成功的表述
Rationalization Prevention
合理化借口防范
| Excuse | Reality |
|---|---|
| "Should work now" | RUN the verification |
| "I'm confident" | Confidence ≠ evidence |
| "Just this once" | No exceptions |
| "Linter passed" | Linter ≠ compiler |
| "Agent said success" | Verify independently |
| "I'm tired" | Exhaustion ≠ excuse |
| "Partial check is enough" | Partial proves nothing |
| "Different words so rule doesn't apply" | Spirit over letter |
| 借口 | 事实 |
|---|---|
| “现在应该能运行了” | 去执行验证 |
| “我很有信心” | 信心≠证据 |
| “就这一次” | 没有例外 |
| “代码检查器通过了” | 代码检查器≠编译器 |
| “Agent说成功了” | 独立验证 |
| “我累了” | 疲惫不是借口 |
| “部分检查就够了” | 部分检查无法证明任何事 |
| “换个说法规则就不适用了” | 精神内核优先于字面表述 |
Key Patterns
关键范式
Tests:
✅ [Run test command] [See: 34/34 pass] "All tests pass"
❌ "Should pass now" / "Looks correct"Regression tests (TDD Red-Green):
✅ Write → Run (pass) → Revert fix → Run (MUST FAIL) → Restore → Run (pass)
❌ "I've written a regression test" (without red-green verification)Build:
✅ [Run build] [See: exit 0] "Build passes"
❌ "Linter passed" (linter doesn't check compilation)Requirements:
✅ Re-read plan → Create checklist → Verify each → Report gaps or completion
❌ "Tests pass, phase complete"Agent delegation:
✅ Agent reports success → Check VCS diff → Verify changes → Report actual state
❌ Trust agent report测试:
✅ [运行测试命令] [显示:34/34通过] “所有测试通过”
❌ “现在应该能通过了” / “看起来没问题”回归测试(测试驱动开发红-绿循环):
✅ 编写 → 运行(通过)→ 还原修复 → 运行(必须失败)→ 恢复修复 → 运行(通过)
❌ “我已经写了回归测试”(未验证红-绿循环)构建:
✅ [运行构建命令] [显示:退出码0] “构建通过”
❌ “代码检查器通过了”(代码检查器不检查编译情况)需求:
✅ 重新阅读计划 → 创建检查清单 → 逐项验证 → 报告差距或完成情况
❌ “测试通过,阶段完成”Agent委托任务:
✅ Agent报告成功 → 检查版本控制系统差异 → 验证变更 → 报告实际状态
❌ 信任Agent的报告Why This Matters
为什么这很重要
From 24 failure memories:
- your human partner said "I don't believe you" - trust broken
- Undefined functions shipped - would crash
- Missing requirements shipped - incomplete features
- Time wasted on false completion → redirect → rework
- Violates: "Honesty is a core value. If you lie, you'll be replaced."
基于24次失败案例的总结:
- 你的人类搭档说“我不信你”——信任破裂
- 发布了未定义的函数——会导致崩溃
- 发布了未满足需求的功能——功能不完整
- 因虚假的完成声明浪费时间→重新定位→返工
- 违反了:“诚信是核心价值观。若你撒谎,将被替换。”
When To Apply
适用场景
ALWAYS before:
- ANY variation of success/completion claims
- ANY expression of satisfaction
- ANY positive statement about work state
- Committing, PR creation, task completion
- Moving to next task
- Delegating to agents
Rule applies to:
- Exact phrases
- Paraphrases and synonyms
- Implications of success
- ANY communication suggesting completion/correctness
始终在以下操作前应用:
- 任何形式的成功/完成宣称
- 任何表达满意的表述
- 任何关于工作状态的正面陈述
- 提交、创建拉取请求、标记任务完成
- 进入下一个任务
- 委托任务给Agent
规则适用于:
- 精确表述
- 改写和同义词
- 暗示成功的表述
- 任何表明任务已完成/正确的沟通内容
The Bottom Line
底线
No shortcuts for verification.
Run the command. Read the output. THEN claim the result.
This is non-negotiable.
验证没有捷径。
运行命令。读取输出。然后再宣称结果。
这是不容协商的。