workbench-goal-mode-v2
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseWorkbench Goal Mode v2
工作台目标模式v2
Use this skill when an issue contains , , ,
or asks the workbench to complete an objective autonomously across multiple
agents, turns, and evidence gates until a real blocker appears.
/goalGOAL_MODE: yesGOAL_MODE_V2Goal Mode v2 is a persistent conductor, not a single-agent persistence wrapper.
It splits the workflow into two cooperating layers so the human does not need to
babysit windows, monitor every agent, or manually switch contexts.
当问题包含、、,或者要求工作台跨多个Agent、多轮任务和验证节点自主完成目标,直到出现真实阻塞时,使用此技能。
/goalGOAL_MODE: yesGOAL_MODE_V2目标模式v2是一个持久化指挥系统,而非单Agent持久化包装器。它将工作流拆分为两个协作层,因此人类无需监控窗口、跟踪每个Agent或手动切换上下文。
Session Communication Bootstrap
会话通信引导
At conductor startup, load and apply:
docs/agent-communication-profile.mdtext
Apply communication profile docs/agent-communication-profile.md.
Tone: human, direct, bilingual, pushback-ok.This controls communication style only. Runtime identity, model, tools, and MCP
availability still require live proof.
指挥系统启动时,加载并应用:
docs/agent-communication-profile.mdtext
Apply communication profile docs/agent-communication-profile.md.
Tone: human, direct, bilingual, pushback-ok.此配置仅控制沟通风格。运行时身份、模型、工具和MCP可用性仍需实时验证。
Architecture
架构
mermaid
flowchart TB
H["Operator shorthand / issue"] --> GC["GOAL_CAPTURED"]
GC --> DSN["DESIGNING"]
DSN --> DP["DECISION_PACKET"]
DP --> DSP["DISPATCHING"]
DSP --> OBS["OBSERVING"]
OBS -->|agent done| REV["REVIEWING"]
OBS -->|new evidence| DP
REV -->|PASS| LRN["LEARNING / ARCHIVING"]
REV -->|FLAG| DSP
REV -->|BLOCK| BC["BLOCKER_CLASSIFIED"]
BC -->|operator| OP["OPERATOR_NEEDED"]
BC -->|external| CD["COOLDOWN → retry"]
BC -->|permanent| ARC["ARCHIVED"]
LRN --> NG["NEXT_GOAL or DONE"]
OP -->|human responds| DSN
CD -->|cooldown elapsed| DSNmermaid
flowchart TB
H["Operator shorthand / issue"] --> GC["GOAL_CAPTURED"]
GC --> DSN["DESIGNING"]
DSN --> DP["DECISION_PACKET"]
DP --> DSP["DISPATCHING"]
DSP --> OBS["OBSERVING"]
OBS -->|agent done| REV["REVIEWING"]
OBS -->|new evidence| DP
REV -->|PASS| LRN["LEARNING / ARCHIVING"]
REV -->|FLAG| DSP
REV -->|BLOCK| BC["BLOCKER_CLASSIFIED"]
BC -->|operator| OP["OPERATOR_NEEDED"]
BC -->|external| CD["COOLDOWN → retry"]
BC -->|permanent| ARC["ARCHIVED"]
LRN --> NG["NEXT_GOAL or DONE"]
OP -->|human responds| DSN
CD -->|cooldown elapsed| DSNLayer 1: Design / Decision
第一层:设计/决策
Owned by the conductor (Admin or delegated design agent). Produces decision
packets, not raw execution spam.
Responsibilities:
- Refine operator intent into a scoped objective.
- Pull constraints, non-goals, blockers, and evidence expectations from durable memory (SYNTHESIS.md, DECISIONS.md, prior issue evidence).
- Choose the Friction Tier and applicable gates (SDD, L2 Pressure, Capy check).
- Maintain taste, scope, blocker semantics, and evidence standards.
- Ask the operator only when a real design trade-off needs human judgment.
由指挥器(管理员或委派的设计Agent)负责。生成决策包,而非原始执行指令。
职责:
- 将操作者意图细化为限定范围的目标。
- 从持久化内存(SYNTHESIS.md、DECISIONS.md、过往问题证据)中提取约束条件、非目标、阻塞因素和证据预期。
- 选择摩擦层级(Friction Tier)和适用的验证节点(SDD、L2 Pressure、Capy检查)。
- 维持风格、范围、阻塞语义和证据标准。
- 仅当需要人类判断真实的设计权衡时,才向操作者询问。
Layer 2: Dispatch / Operations
第二层:调度/操作
Owned by the dispatcher (Admin or Supervisor). Converts decision packets into
bounded Multica issues, monitors them, and routes follow-ups.
Responsibilities:
- Convert one decision packet into one or more bounded issues.
- Assign one owner per issue. Mention specialists only for narrow review.
- Monitor active / in_review / blocked / done states via .
multica issue list - Re-route only when new evidence changes state — never re-sweep unchanged work.
- Stop when only operator-call or external-platform blockers remain.
由调度器(管理员或主管)负责。将决策包转换为限定范围的Multica问题,监控这些问题并路由后续任务。
职责:
- 将一个决策包转换为一个或多个限定范围的问题。
- 为每个问题分配一位负责人。仅在需要专项评审时提及专家。
- 通过监控活跃/评审中/阻塞/完成状态。
multica issue list - 仅当新证据改变状态时重新路由——绝不重复处理未变更的工作。
- 当仅剩余操作者呼叫或外部平台阻塞时停止。
State Machine
状态机
| State | Meaning | Next |
|---|---|---|
| Shorthand or issue received, not yet designed | |
| Conductor is pulling context, refining intent, checking memory | |
| Scoped route produced, ready to dispatch | |
| One or more bounded issues created/assigned | |
| Waiting for agent evidence; periodic status scan | |
| Evidence evaluated against closeout gates | |
| Blocker typed: operator / external / permanent | |
| Evidence harvested, noise archived, memory updated | |
| Waiting for human input | |
| Objective achieved, all gates passed | terminal |
| 状态 | 含义 | 下一状态 |
|---|---|---|
| 收到简写指令或问题,尚未开始设计 | |
| 指挥器正在获取上下文、细化意图、检查内存 | |
| 已生成限定范围的执行路径,准备调度 | |
| 已创建/分配一个或多个限定范围的问题 | |
| 等待Agent提交证据;定期扫描状态 | |
| 根据收尾验证节点评估证据 | |
| 已分类阻塞类型:操作者/外部/永久 | |
| 已收集证据、归档无效信息、更新内存 | |
| 等待人类输入 | |
| 已达成目标,所有验证节点通过 | 终止 |
Decision Packet Format
决策包格式
Every decision packet must be posted as a structured comment before any issues
are created:
text
DECISION_PACKET
goal_id: <goal-shorthand-or-issue-id>
intent: <one-sentence objective>
route: <chosen execution path — which agents, skills, lanes>
owner: <primary executing agent>
reviewer: <who verifies evidence>
constraints: <non-negotiables, boundaries, blocked lanes>
evidence_expectations: <what proves success — exact artifacts or checks>
non_goals: <explicit exclusions>
blocker_conditions: <what would block this route>
tier: fast | standard | heavy
dedupe_key: <canonical key to prevent duplicate dispatch>
max_active: <max concurrent in_progress issues for this goal>
cooldown_minutes: <minimum minutes before next sweep or re-route>
verdict: READY_TO_DISPATCH | NEEDS_DESIGN | OPERATOR_NEEDED在创建任何问题之前,必须以结构化评论的形式发布每个决策包:
text
DECISION_PACKET
goal_id: <goal-shorthand-or-issue-id>
intent: <one-sentence objective>
route: <chosen execution path — which agents, skills, lanes>
owner: <primary executing agent>
reviewer: <who verifies evidence>
constraints: <non-negotiables, boundaries, blocked lanes>
evidence_expectations: <what proves success — exact artifacts or checks>
non_goals: <explicit exclusions>
blocker_conditions: <what would block this route>
tier: fast | standard | heavy
dedupe_key: <canonical key to prevent duplicate dispatch>
max_active: <max concurrent in_progress issues for this goal>
cooldown_minutes: <minimum minutes before next sweep or re-route>
verdict: READY_TO_DISPATCH | NEEDS_DESIGN | OPERATOR_NEEDEDDedupe, Cooldown, and Noise Controls
去重、冷却与无效信息控制
These prevent the controller from accumulating endless sweeper issues (per
DAS-741 / DAS-743 findings):
这些机制可防止控制器累积无尽的扫描任务(基于DAS-741 / DAS-743的研究结果):
Dedupe
去重
- Before creating any issue, check if an equivalent issue already exists using
the . Match on title prefix + active status (not done/cancelled).
dedupe_key - If an equivalent issue exists and is or
in_progress, do not create a duplicate. Post a status note on the existing issue instead.in_review - If an equivalent issue exists and is , check if the blocker changed before re-dispatching.
blocked
- 在创建任何问题之前,使用检查是否已存在等效问题。匹配标题前缀+活跃状态(非完成/已取消)。
dedupe_key - 如果等效问题存在且处于或
in_progress状态,则不创建重复问题,而是在现有问题上发布状态说明。in_review - 如果等效问题存在且处于状态,在重新调度前检查阻塞因素是否已变更。
blocked
Cooldown
冷却
- After dispatching a decision packet, set . Do not scan or re-route for this goal until the cooldown elapses.
cooldown_minutes - Default cooldown: 15 minutes for standard work, 30 minutes for heavy work.
- Cooldown resets when a new decision packet is dispatched.
- 调度决策包后,设置。在冷却时间结束前,不扫描或重新路由此目标的任务。
cooldown_minutes - 默认冷却时间:标准任务15分钟,重型任务30分钟。
- 当新决策包被调度时,重置冷却时间。
Max Active
最大活跃任务数
- No more than (default: 5) issues may be
max_activefor a single goal at any time.in_progress - If the limit is reached, new dispatch waits until an active issue moves to
or
in_review.done
- 单个目标同时处于状态的任务不得超过
in_progress(默认:5)个。max_active - 如果达到限制,新的调度需等待活跃任务转为或
in_review状态。done
Archive and Cancel Semantics
归档与取消语义
- Completed () issues: harvest evidence, then archive — do not re-read or re-route.
done - Stale issues (no activity for 48h): post one nudge, then flag for Supervisor.
in_review - Noise issues (created in error, duplicate, superseded): set to with a one-line reason. Do not leave them in
cancelled.blocked - The conductor itself must have a self-cancel condition: if the conductor issue
is the only remaining active issue and all dispatched work is or
done, post the final verdict and stop.cancelled
- 已完成()的任务:收集证据后归档——不再重新读取或路由。
done - 停滞的任务(48小时无活动):发送一次提醒,然后标记给主管。
in_review - 无效任务(错误创建、重复、已被取代):设置为并附上一行原因说明。不要将其留在
cancelled状态。blocked - 指挥器自身必须具备自取消条件:如果指挥器问题是唯一剩余的活跃问题,且所有已调度任务均为或
done,则发布最终结论并停止运行。cancelled
Autonomy Guards
自主操作防护机制
The conductor may act autonomously when:
- The decision packet is .
READY_TO_DISPATCH - No operator-call conditions are triggered.
- Dedupe confirms no duplicate work exists.
- The route does not touch blocked lanes (DAS-696, DAS-693, or operator-declared frozen lanes).
- Workbench Max is not in the route.
The conductor MUST call the operator when:
- A design trade-off needs human taste judgment (not just more context).
- A permission, secret, payment, or runtime mutation is required.
- A blocked lane is the only viable route.
- Two prior attempts at the same sub-task failed with different approaches.
- The dedupe key matches an existing active issue with a different decision packet (route conflict).
当满足以下条件时,指挥器可自主操作:
- 决策包状态为。
READY_TO_DISPATCH - 未触发操作者呼叫条件。
- 去重检查确认无重复工作。
- 执行路径未涉及阻塞通道(DAS-696、DAS-693或操作者声明的冻结通道)。
- 执行路径中不包含Workbench Max。
当出现以下情况时,指挥器必须呼叫操作者:
- 设计权衡需要人类的主观判断(而非仅需更多上下文)。
- 需要权限、密钥、支付或运行时变更。
- 唯一可行的执行路径涉及阻塞通道。
- 同一子任务的两次尝试采用不同方法均失败。
- 去重键匹配现有活跃问题,但决策包不同(路径冲突)。
Composition With Existing Layers
与现有层的组合
Goal Mode v2 composes with, but does not replace:
| Layer | How Goal Mode v2 uses it |
|---|---|
| Friction Tier Router | Selected during DESIGNING; encoded in decision packet tier field |
| SDD | Decision packet may route heavy work through SDD stages; design layer handles Raw Requirement + Product Design |
| L2 Pressure Gate | Applied during DESIGNING when objective touches remote/HarnessMax/leaderboard work |
| Goal Mode v1 | v2 replaces the persistence wrapper; v1 skill remains for simple single-agent /goal tasks |
| Auto Review Sweeper | Dispatch layer uses sweeper results during OBSERVING → REVIEWING transition |
| Self-Awareness | Run once at GOAL_CAPTURED for heavy-path goals before DESIGNING begins |
| Completion Cooling | Applied to dispatched child issues at 75/85/90/100 thresholds |
目标模式v2与现有层组合使用,但不替代它们:
| 层 | 目标模式v2的使用方式 |
|---|---|
| Friction Tier Router | 在DESIGNING阶段选择;编码到决策包的tier字段中 |
| SDD | 决策包可将重型任务路由至SDD阶段;设计层负责原始需求+产品设计 |
| L2 Pressure Gate | 当目标涉及远程/HarnessMax/排行榜任务时,在DESIGNING阶段应用 |
| Goal Mode v1 | v2取代了持久化包装器;v1技能仍适用于简单的单Agent /goal任务 |
| Auto Review Sweeper | 调度层在OBSERVING → REVIEWING转换期间使用扫描器结果 |
| Self-Awareness | 对于重型路径目标,在DESIGNING开始前于GOAL_CAPTURED阶段运行一次 |
| Completion Cooling | 在已调度子任务的75/85/90/100进度阈值处应用 |
Activation
激活方式
When activated by or , the conductor must:
/goalGOAL_MODE: yes- Apply before the first user-facing conductor note.
docs/agent-communication-profile.md - Post with objective, owner, non_goals, closeout gates, and operator-call conditions.
GOAL_LOCK - If Heavy Path: post before designing.
SELF_AWARENESS_BOOTSTRAP - If L2_PRESSURE: post before routing.
RV_PRESSURE_CHECK - Enter → produce
DESIGNING→ enterDECISION_PACKET.DISPATCHING - Continue the observe → review → archive → next-goal loop until DONE or OPERATOR_NEEDED.
当通过或激活时,指挥器必须:
/goalGOAL_MODE: yes- 在发布第一条面向用户的指挥器说明之前,应用。
docs/agent-communication-profile.md - 发布,包含目标、负责人、非目标、收尾验证节点和操作者呼叫条件。
GOAL_LOCK - 如果是重型路径:在设计前发布。
SELF_AWARENESS_BOOTSTRAP - 如果是L2_PRESSURE:在路由前发布。
RV_PRESSURE_CHECK - 进入状态→生成
DESIGNING→进入DECISION_PACKET状态。DISPATCHING - 继续观察→评审→归档→下一目标的循环,直到DONE或OPERATOR_NEEDED状态。
Closeout Contract
收尾协议
text
GOAL_MODE_V2_CLOSEOUT
goal_id:
objective:
state_machine_path: <trace of states visited, e.g. GOAL_CAPTURED→DESIGNING→...→DONE>
decision_packets_produced: <count>
issues_dispatched: <count and IDs>
evidence_harvested: <summary of completed evidence>
noise_cancelled: <count and IDs of cancelled duplicates>
operator_calls: <count and reasons, if any>
residual_risk:
archive_actions_taken:
verdict: PASS | FLAG | BLOCKtext
GOAL_MODE_V2_CLOSEOUT
goal_id:
objective:
state_machine_path: <trace of states visited, e.g. GOAL_CAPTURED→DESIGNING→...→DONE>
decision_packets_produced: <count>
issues_dispatched: <count and IDs>
evidence_harvested: <summary of completed evidence>
noise_cancelled: <count and IDs of cancelled duplicates>
operator_calls: <count and reasons, if any>
residual_risk:
archive_actions_taken:
verdict: PASS | FLAG | BLOCKValidation
验证
Before claiming DONE, verify:
- Every dispatched issue has a verdict (PASS/FLAG/BLOCK) or is properly cancelled.
- No duplicate active issues share the same dedupe_key.
- The conductor's self-cancel condition is satisfied.
- Evidence is posted on the goal issue, not scattered across child comments.
- was raised for every case where the conductor lacked authority.
OPERATOR_NEEDED
在宣告DONE之前,需验证:
- 每个已调度的问题均有结论(PASS/FLAG/BLOCK)或已被正确取消。
- 没有重复的活跃问题共享相同的dedupe_key。
- 指挥器的自取消条件已满足。
- 证据已发布在目标问题上,而非分散在子评论中。
- 所有指挥器缺乏权限的情况均已触发。
OPERATOR_NEEDED