tao-run-automl-deft-pipeline
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseAutoML + DEFT Pipeline
AutoML + DEFT 工作流
A workflow-bridge skill that runs three phases in sequence by delegating to two existing skills — for HPO and a DEFT application skill (default for AOI; other skills for non-AOI cases) for the iterative data-improvement loop.
tao-run-automltao-run-deft-aoiskills/applications/deft-*This skill does not re-implement AutoML or DEFT. It owns only the connective tissue: HPO spec inputs, the spec-handoff between AutoML and DEFT, and the post-DEFT AutoML re-run on the augmented dataset.
这是一个工作流桥接技能,通过调用两个现有技能按顺序执行三个阶段——用于HPO(超参数优化),DEFT应用技能(AOI场景默认使用;非AOI场景使用其他技能)用于迭代式数据改进循环。
tao-run-automltao-run-deft-aoiskills/applications/deft-*本技能不会重新实现AutoML或DEFT,仅负责衔接工作:HPO规格输入、AutoML与DEFT之间的规格传递,以及在增强数据集上重新运行DEFT后的AutoML优化。
When this skill applies
适用场景
- User asks to "run the AOI workflow" or "improve my AOI ChangeNet model" — default to this skill, not directly. The bare DEFT loop is the inner stage of this pipeline.
tao-run-deft-aoi - User wants AutoML and DEFT chained on the same model/dataset
- User says "AutoML at both ends", "tune HPs then DEFT", "warm-start DEFT", "AutoML before and after DEFT"
- User has an AutoML-tuned spec and asks how to feed it into DEFT
- 用户要求“运行AOI工作流”或“改进我的AOI ChangeNet模型”——默认使用本技能,而非直接调用。纯DEFT循环是本流程的内部阶段。
tao-run-deft-aoi - 用户希望在同一模型/数据集上串联AutoML和DEFT
- 用户提到“两端使用AutoML”“先调优超参数再执行DEFT”“热启动DEFT”“DEFT前后都用AutoML”
- 用户已有AutoML调优后的规格,询问如何将其传入DEFT
When this skill does NOT apply
不适用场景
- User explicitly asks for the DEFT loop only ("run JUST the DEFT loop", "skip AutoML") → use directly
tao-run-deft-aoi - User wants only AutoML with no follow-on DEFT → use directly
tao-run-automl - User is doing zero-shot eval, RAG, or non-training workflows
- 用户明确要求仅运行DEFT循环(“仅运行DEFT循环”“跳过AutoML”)→ 直接使用
tao-run-deft-aoi - 用户仅需AutoML,无需后续DEFT → 直接使用
tao-run-automl - 用户正在进行零样本评估、RAG或非训练类工作流
The mental model
核心模型
Phase 1 (AutoML baseline) Phase 2 (DEFT loop, plain train) Phase 3 (AutoML refinement)
───────────────────────── ──────────────────────────────── ───────────────────────────
specs/baseline_spec.yaml (Phase 1 winner pre-seeds baseline ${RESULTS_DIR}/iter${N}/dataset/
train/base/training_set.csv — DEFT skips its baseline train) train_combined_iter${N}.csv
│ │ │
▼ ▼ ▼
[ AutoML HPO sweep ] [ DEFT: baseline-inference → RCA [ AutoML HPO sweep ]
N recommendations → iter 1..N (plain retrain) ] re-tunes HPs against the
pick best by val_loss / FAR RCA / route / SDG / mining DEFT-augmented dataset
│ │ │
▼ ▼ ▼
best HPs spec + ckpt ─────► DEFT-augmented CSV ───────────► final best checkpoint
+ iter winner checkpoint (the deliverable; no
(Phase 3 warm-starts from it) further retrain)The handoffs are:
- Phase 1 → Phase 2: a spec file AND the winning checkpoint. Retraining the same HPs in DEFT's baseline step is wasted compute, so the bridge deep-merges Phase 1's winning HPs onto , copies the winning checkpoint into
baseline_spec.yamlunder the filename DEFT expects, and pre-populates${RESULTS_DIR}/baseline/train/+deft_state.jsonso DEFT resumes at baseline inference → evaluate → RCA → iter 1. DEFT itself stays plain-train (loop_log.jsonlpreserved). Verbatim 4-step procedure inautoml_policy: off.references/handoff.md - Phase 2 → Phase 3: a training CSV AND the iter winner's checkpoint. The CSV () is AutoML's training data; the checkpoint (
train_combined_iter${N_final}.csvfromiterations.<best>.best_ckpt_path) is wired into each rec'sdeft_state.jsonso Phase 3 fine-tunes from Phase 2's winner rather than from scratch. Without this warm-start Phase 3 routinely regresses vs the iter winner. Phase 3's winning checkpoint is the deliverable — no separate retrain after Phase 3. Seetrain.pretrained_model_path.references/handoff.md
Phase 1 (AutoML baseline) Phase 2 (DEFT loop, plain train) Phase 3 (AutoML refinement)
───────────────────────── ──────────────────────────────── ───────────────────────────
specs/baseline_spec.yaml (Phase 1 winner pre-seeds baseline ${RESULTS_DIR}/iter${N}/dataset/
train/base/training_set.csv — DEFT skips its baseline train) train_combined_iter${N}.csv
│ │ │
▼ ▼ ▼
[ AutoML HPO sweep ] [ DEFT: baseline-inference → RCA [ AutoML HPO sweep ]
N recommendations → iter 1..N (plain retrain) ] re-tunes HPs against the
pick best by val_loss / FAR RCA / route / SDG / mining DEFT-augmented dataset
│ │ │
▼ ▼ ▼
best HPs spec + ckpt ─────► DEFT-augmented CSV ───────────► final best checkpoint
+ iter winner checkpoint (the deliverable; no
(Phase 3 warm-starts from it) further retrain)衔接逻辑如下:
- 阶段1 → 阶段2:传递规格文件和最优检查点。在DEFT的基线步骤中重新训练相同超参数会浪费计算资源,因此桥接技能会将阶段1的最优超参数深度合并到中,将最优检查点复制到
baseline_spec.yaml目录下并命名为DEFT期望的文件名,同时预填充${RESULTS_DIR}/baseline/train/和deft_state.json,使DEFT从基线推理→评估→RCA→迭代1开始执行。DEFT自身保持plain-train模式(保留loop_log.jsonl)。详细的四步流程见automl_policy: off。references/handoff.md - 阶段2 → 阶段3:传递训练CSV文件和迭代最优检查点。CSV文件()作为AutoML的训练数据;检查点(来自
train_combined_iter${N_final}.csv的deft_state.json)会被配置到每个推荐项的iterations.<best>.best_ckpt_path中,使阶段3从阶段2的最优模型开始微调,而非从头训练。如果没有热启动,阶段3的性能通常会比迭代最优模型退化。阶段3的最优检查点是最终交付物——阶段3结束后无需单独重训。详见train.pretrained_model_path。references/handoff.md
Why three phases instead of two
为何采用三阶段而非两阶段
- Phase 1 alone finds good HPs on the original training distribution, but the model still has the distributional gaps DEFT is designed to fill.
- Phase 2 alone (just DEFT) fills the gaps but uses whatever HPs was hand-authored with — usually not optimal.
specs/baseline_spec.yaml - Phase 3 alone would run AutoML against the augmented dataset, but without a tuned baseline the DEFT loop's iteration cost is higher (slower convergence, more iterations to hit the KPI).
Running all three: AutoML cheap-tunes once on the original data, DEFT does the heavy data work with reasonable HPs, then AutoML tunes again on the now-richer dataset. Phase 3 is the most important of the three for the final deployed FAR/recall.
- 仅阶段1:能在原始训练分布上找到优质超参数,但模型仍存在DEFT旨在填补的分布缺口。
- 仅阶段2(仅DEFT):能填补分布缺口,但使用的是中手动编写的超参数——通常并非最优。
specs/baseline_spec.yaml - 仅阶段3:会在增强数据集上运行AutoML,但如果没有调优后的基线,DEFT循环的迭代成本会更高(收敛速度慢,达到KPI所需迭代次数更多)。
运行全部三个阶段:AutoML在原始数据上快速调优一次,DEFT使用合理的超参数完成繁重的数据处理工作,然后AutoML在更丰富的数据集上再次调优。阶段3对最终部署的FAR/召回率最为重要。
Cost up-front
前期成本
The pipeline is sequential. Total wall-clock ≈ Phase 1 (N_automl × per-rec train) + Phase 2 (M iterations × per-iter cost) + Phase 3 (N_automl × per-rec train).
Note that Phase 2 has no separate baseline train — Phase 1's winning checkpoint is reused as DEFT's baseline, so the baseline cost lands inside Phase 1's N_automl trainings rather than as an extra retrain. Surface this to the user before kickoff. Typically Phase 2's iterations still dominate (each includes SDG + retrain), but Phase 1 and Phase 3 each add several hours on a single-GPU box. Use the per-job estimate from the user's setup (if they have one) rather than guessing minutes. See for the per-phase cost breakdown.
references/pitfalls.md流程为串行执行,总耗时≈阶段1(N_automl × 单推荐项训练时间) + 阶段2(M次迭代 × 单次迭代成本) + 阶段3(N_automl × 单推荐项训练时间)。
注意阶段2没有单独的基线训练——阶段1的最优检查点会被复用为DEFT的基线,因此基线成本包含在阶段1的N_automl次训练中,而非额外的重训。在启动前需向用户说明这一点。通常阶段2的迭代仍占主导(每次迭代包含SDG + 重训),但阶段1和阶段3在单GPU设备上各需数小时。请根据用户环境的单任务估算时间(如果有)来计算,而非猜测分钟数。各阶段成本明细见。
references/pitfalls.mdConsolidated Pre-Flight — one gate, all three phases
统一预检查——单入口,全三阶段
The pipeline has exactly one user gate. Before any side-effecting action (docker pull, docker login, any job-launch call delegated to a downstream skill, file mutations under ), the agent must produce a single consolidated Pre-Flight Summary that subsumes every downstream skill's preflight. Once the user approves, the run is autonomous through all three phases — no further interactive pauses.
${RESULTS_DIR}/The user explicitly does not want to be paged between phases. The DEFT loop's own inline gate becomes a zero-question display step (every value pre-supplied), as does 's shared launch preflight in Phases 1 and 3.
## Pre-Flight Summarytao-run-automlBefore printing the gate the agent must read every downstream preflight section in full and run every read-only check those sections prescribe, surfacing each outcome in the summary. Running every step of the DEFT skill's is mandatory — if any step is skipped the consolidated gate is invalid and the pipeline must not advance. The summary must include, in order: (1) workspace/host/platform/network, (2) credentials SET/UNSET status, (3) resolved container image URIs with PRESENT/MISSING, (4) dataset table with leakage check, (5) Phase 1 config, (6) Phase 2 config incl. pre-seeded baseline source, (7) Phase 3 config, (8) compute estimate, (9) the confirmation line. After the gate, pass every collected value through to each downstream skill so it has nothing to ask. The only allowed post-gate pauses are mid-run hard-stop safety gates (e.g. DEFT's KPI regression gate); call them out in the summary.
## Pre-FlightSee for the full build procedure, the exact mandatory contents of each summary section (with the GPU memory rule of thumb, DEFT loop defaults, and required inputs verbatim), the downstream gate-suppression inputs, and the fallback when an older skill-bank version hard-codes its own STOP gate.
references/preflight.md本流程仅有一个用户确认环节。在执行任何会产生副作用的操作(拉取docker镜像、docker登录、调用下游技能启动任务、修改下的文件)之前,Agent必须生成一份统一的预检查汇总,涵盖所有下游技能的预检查内容。用户确认后,流程将自动完成所有三个阶段——无需再进行交互式暂停。
${RESULTS_DIR}/用户明确不希望在阶段之间被打扰。DEFT循环自身的确认环节变为无提示展示步骤(所有值已预先提供),阶段1和阶段3中的共享启动预检查也是如此。
## Pre-Flight Summarytao-run-automl在展示确认环节之前,Agent必须完整阅读所有下游预检查部分,并执行这些部分规定的所有只读检查,在汇总中展示每个检查的结果。必须执行DEFT技能中的每一步——如果跳过任何步骤,统一确认环节将无效,流程不得继续。汇总必须按以下顺序包含:(1) 工作区/主机/平台/网络,(2) 凭据SET/UNSET状态,(3) 已解析的容器镜像URI及PRESENT/MISSING状态,(4) 数据集泄漏检查表,(5) 阶段1配置,(6) 阶段2配置(含预填充的基线源),(7) 阶段3配置,(8) 计算资源估算,(9) 确认语句。确认后,将所有收集到的值传递给每个下游技能,使其无需再询问。仅允许在运行中途遇到硬停止安全检查(例如DEFT的KPI退化检查)时暂停,并需在汇总中提前说明。
## Pre-Flight完整的构建流程、汇总各部分的必填内容(含GPU内存经验法则、DEFT循环默认值和必填输入)、下游检查抑制输入,以及旧版技能库硬编码自身STOP检查时的回退方案,均见。
references/preflight.mdPhase 1 — AutoML baseline
阶段1 —— AutoML基线
Invoke with:
tao-skill-bank:tao-run-automl| Input | AOI default | Notes |
|---|---|---|
| | Same model the DEFT loop expects |
| | Same training set DEFT will start from |
| | Held-out — must NOT be the KPI test set ( |
| FAR @ 100% recall (preferred) or | See |
| | LLM-brain or |
| 5–10 for AOI | More recs = better HPs but linear in compute |
| Pin epochs / batch_size; sweep optimizer-related HPs only | Otherwise AutoML wanders into long-train regimes that blow Phase 2's budget |
After the sweep finishes, AutoML's is the winning hyperparameter dict.
result["best"]["specs"]调用,参数如下:
tao-skill-bank:tao-run-automl| 输入 | AOI默认值 | 说明 |
|---|---|---|
| | 与DEFT循环预期的模型一致 |
| | 与DEFT初始使用的训练集一致 |
| | 预留数据集——不得使用KPI测试集( |
| FAR @ 100% recall(优先)或 | 见 |
| | 如果计算资源紧张,可使用LLM-brain或 |
| AOI场景为5–10 | 推荐项越多,超参数质量越高,但计算量呈线性增长 |
| 固定epochs/batch_size;仅调优优化器相关超参数 | 否则AutoML可能进入长训练周期,超出阶段2的预算 |
超参数搜索完成后,AutoML的即为最优超参数字典。
result["best"]["specs"]Handoff to Phase 2
向阶段2传递内容
Phase 1 hands over two artifacts: the winning spec and the winning checkpoint. Instead of retraining the same HPs in DEFT's baseline step, pre-seed DEFT's baseline state from Phase 1's outputs so DEFT starts at baseline inference → evaluate → RCA → iter 1. The four steps — write the merged , copy the winning checkpoint into , initialise with (and append the matching entry), then invoke DEFT — are given verbatim with the exact code in . inside the loop is preserved.
baseline_spec_automl.yaml${RESULTS_DIR}/baseline/train/deft_state.jsoniterations.baseline.stage_completed == "train"loop_log.jsonlreferences/handoff.mdautoml_policy: off阶段1传递两个工件:最优规格和最优检查点。为避免在DEFT的基线步骤中重新训练相同超参数,需从阶段1的输出预填充DEFT的基线状态,使DEFT从基线推理→评估→RCA→迭代1开始执行。四步流程——写入合并后的、将最优检查点复制到、初始化并设置(同时在中添加匹配条目)、然后调用DEFT——的具体代码见。循环内的保持不变。
baseline_spec_automl.yaml${RESULTS_DIR}/baseline/train/deft_state.jsoniterations.baseline.stage_completed == "train"loop_log.jsonlreferences/handoff.mdautoml_policy: offQuality check before handing off
传递前的质量检查
Run a quick eval of the winning checkpoint against the held-out set: per-class prediction counts (if it collapsed to one class, evaluate the 2nd or 3rd best instead) and a comparison to a zero-shot ChangeNet baseline (if AutoML did not improve over zero-shot, surface that and pause). See .
references/handoff.md针对预留数据集快速评估最优检查点:查看每类预测数量(如果模型崩溃为单一类别,则选择第二或第三优的模型),并与零样本ChangeNet基线对比(如果AutoML未优于零样本模型,需向用户说明并暂停)。详见。
references/handoff.mdPhase 2 — DEFT loop (plain training, baseline pre-seeded from Phase 1)
阶段2 —— DEFT循环(plain训练,基线由阶段1预填充)
Invoke (read its for the full interface). For non-AOI applications, invoke the matching DEFT skill; the handoff shape is the same.
tao-skill-bank:tao-run-deft-aoiSKILL.mdThe DEFT loop's baseline-train sub-step is skipped. Phase 1 already produced a checkpoint trained at the winning HPs, and Phase 1's handoff (see above) pre-populated and so DEFT resumes at baseline inference → evaluate → RCA → iter 1. The rest of the DEFT loop runs unchanged. Do not modify its invariant.
${RESULTS_DIR}/baseline/train/${RESULTS_DIR}/deft_state.jsonautoml_policy: offThe DEFT loop owns:
- The Pre-Flight Summary display step — not a fresh user gate. The Consolidated Pre-Flight (above) is the single gate; the DEFT summary still prints as an audit-trail display of the pre-seeded source but must not re-prompt, since every input was collected in the consolidated gate.
baseline/train/ - Baseline inference → evaluate → RCA on the pre-seeded checkpoint, and the full per-iteration RCA → routing → SDG → mining → assemble → train cycle.
- KPI gating and stop conditions; layout,
${RESULTS_DIR}/,deft_state.json,loop_log.jsonl.DEFT_Loop_Report.html
After the loop exits (KPI met or reached), capture two values from :
max_iterationsdeft_state.json- — the loop's best plain-train checkpoint
iterations.<best>.best_ckpt_path - The final iteration label — used to locate the augmented training CSV
N_final
If the DEFT loop hard-stops on an unrecoverable gate, skip Phase 3. There is no validated augmented CSV to feed AutoML.
调用(完整接口见其)。对于非AOI应用,调用对应的DEFT技能;传递逻辑相同。
tao-skill-bank:tao-run-deft-aoiSKILL.mdDEFT循环的基线训练子步骤将被跳过。阶段1已生成基于最优超参数训练的检查点,且阶段1的传递操作(见上文)已预填充和,使DEFT从基线推理→评估→RCA→迭代1开始执行。DEFT循环的其余部分保持不变。不得修改其的固定设置。
${RESULTS_DIR}/baseline/train/${RESULTS_DIR}/deft_state.jsonautoml_policy: offDEFT循环负责:
- 预检查汇总展示步骤——并非新的用户确认环节。统一预检查(见上文)是唯一的确认环节;DEFT汇总仍会打印,作为预填充的源的审计轨迹展示,但不得重新提示,因为所有输入已在统一预检查中收集完成。
baseline/train/ - 基于预填充检查点的基线推理→评估→RCA,以及完整的每迭代RCA→路由→SDG→挖掘→组装→训练周期。
- KPI检查和停止条件;目录结构、
${RESULTS_DIR}/、deft_state.json、loop_log.jsonl。DEFT_Loop_Report.html
循环退出后(达到KPI或),从中获取两个值:
max_iterationsdeft_state.json- ——循环的最优plain-train检查点
iterations.<best>.best_ckpt_path - 最终迭代标签——用于定位增强后的训练CSV
N_final
如果DEFT循环因不可恢复的检查点硬停止,跳过阶段3。此时没有经过验证的增强CSV可提供给AutoML。
Phase 3 — AutoML refinement on the DEFT-augmented dataset
阶段3 —— 基于DEFT增强数据集的AutoML优化
Re-invoke with the augmented training CSV as the train dataset, the same held-out validation CSV as before, and Phase 2's iter winner checkpoint as the warm-start:
tao-skill-bank:tao-run-automl| Input | AOI value |
|---|---|
| |
| |
| Same as Phase 1 ( |
| Same metric as Phase 1 |
| Same as Phase 1 |
| 5–10 |
| Initial spec | Start from |
| Warm-start checkpoint | |
The warm-start is mandatory: with no warm-start, every rec starts from random init with only 10-20 epochs to reconverge, Phase 3's regresses 0.03-0.05 vs iter1, and the safety net silently rolls back to the iter winner — wasting Phase 3's compute. The concrete code (selecting the lowest- iteration, excluding any prior ), the broad-exploration tradeoff, output to , and wiring Phase 3's checkpoint back into the DEFT report via + re-running (with the regression safety net) are all in .
val_loss_pick_bestspec_overridesfar_pctfinal_automl${RESULTS_DIR}/final_automl/iterations.final_automlprepare_inference_spec.py_pick_bestreferences/handoff.md重新调用,使用增强后的训练CSV作为训练数据集,使用与之前相同的预留验证CSV,并以阶段2的迭代最优检查点作为热启动:
tao-skill-bank:tao-run-automl| 输入 | AOI场景值 |
|---|---|
| |
| |
| 与阶段1相同( |
| 与阶段1相同 |
| 与阶段1相同 |
| 5–10 |
| 初始规格 | 从 |
| 热启动检查点 | 来自 |
热启动是强制要求:如果没有热启动,每个推荐项都会从随机初始化开始,仅用10-20个 epoch重新收敛,阶段3的会比迭代1退化0.03-0.05,且安全机制会自动回退到迭代最优模型——浪费阶段3的计算资源。具体的代码(选择最低的迭代,排除任何先前的)、广泛探索的权衡、输出到,以及通过将阶段3的检查点重新关联到DEFT报告并重新运行(含退化安全机制)的逻辑,均见。
val_loss_pick_bestspec_overridesfar_pctfinal_automl${RESULTS_DIR}/final_automl/iterations.final_automlprepare_inference_spec.py_pick_bestreferences/handoff.mdPitfalls and quality checks
注意事项与质量检查
These apply to both AutoML phases — bake them into agent behavior, don't just paste once. The full detail is in :
references/pitfalls.md- Metric pitfalls (AOI is class-imbalanced). ChangeNet AOI is PASS-dominant; can mode-collapse to a zero-recall PASS-everything model. Prefer FAR @ 100%-recall directly, or gate val_loss with a
val_losssanity check, or decide top-K by FAR @ 100%-recall. For balanced / regression tasks, val_loss is fine.pred_counts - Run-to-run noise. AutoML can show 2–3× metric variance for the same config. If the winner looks suspiciously better than the runner-up, re-run with a fresh seed before committing the spec to Phase 2.
- Cleanliness (data leakage). Both AutoML phases use a validation set distinct from the KPI test set (), which stays untouched until DEFT's evaluate stage. Phase 3 trains on the augmented CSV but keeps the same val set so Phase 1 and Phase 3 numbers stay comparable.
kpi/testing_set.csv - Compute budget. Surface the per-phase structure up front and only give a wall-clock range after the user supplies their per-job time.
这些规则适用于两个AutoML阶段——需融入Agent行为,而非仅粘贴一次。详细内容见:
references/pitfalls.md- 指标陷阱(AOI存在类别不平衡):ChangeNet AOI以PASS类为主;可能导致模型崩溃为零召回率的“全PASS”模型。优先直接使用FAR @ 100%-recall,或用
val_loss合理性检查约束val_loss,或按FAR @ 100%-recall选择前K个模型。对于平衡/回归任务,val_loss是可行的。pred_counts - 运行间噪声:相同配置下,AutoML的指标可能存在2–3倍的差异。如果最优模型的性能明显优于次优模型,在将规格提交给阶段2之前,需使用新种子重新运行。
- 数据清洁度(数据泄漏):两个AutoML阶段使用的验证集均与KPI测试集()分离,该测试集在DEFT的评估阶段之前保持未使用状态。阶段3在增强CSV上训练,但使用相同的验证集,使阶段1和阶段3的结果具有可比性。
kpi/testing_set.csv - 计算预算:提前向用户说明各阶段的结构,仅在用户提供单任务时间后给出耗时范围。
Quick Start (AOI worked example)
快速开始(AOI示例)
When starting fresh from "run the AOI workflow", the agent delivers a three-phase worded message to the user (Phase 1 AutoML baseline → Phase 2 DEFT loop → Phase 3 AutoML refinement, with the cost framing and "OK to proceed?" close), then after confirmation invokes (Phase 1), writes the merged spec, pre-seeds , invokes (Phase 2) with every input pre-supplied, and invokes again (Phase 3) — with no further pauses unless a downstream skill hits an unrecoverable hard-stop gate — then summarizes the trajectory (baseline AutoML best → DEFT iter 1 → ... → DEFT iter N_final → Phase 3 best).
tao-run-automldeft_state.jsontao-run-deft-aoitao-run-automlSee for the verbatim customer-facing message and the exact post-confirmation invoke sequence.
references/quick-start.md当用户从“运行AOI工作流”开始时,Agent需向用户发送包含三阶段的文字说明(阶段1:AutoML基线 → 阶段2:DEFT循环 → 阶段3:AutoML优化,包含成本说明和“是否继续?”的确认),用户确认后调用(阶段1)、写入合并后的规格、预填充、调用(阶段2,所有输入预先提供)、再次调用(阶段3)——除非下游技能遇到不可恢复的硬停止检查点,否则无需再暂停——最后总结整个流程的性能轨迹(基线AutoML最优 → DEFT迭代1 → ... → DEFT迭代N_final → 阶段3最优)。
tao-run-automldeft_state.jsontao-run-deft-aoitao-run-automl面向客户的标准话术和确认后的具体调用序列见。
references/quick-start.mdNon-AOI DEFT applications
非AOI DEFT应用
The same three-phase pattern applies to other DEFT skills — swap , the Phase 2 DEFT skill, the spec/checkpoint path conventions, and the Phase 3 augmented-CSV path. The handoff shape (Phase 1 emits spec + checkpoint that pre-seeds the DEFT baseline, Phase 2 emits an augmented dataset, Phase 3 emits the final checkpoint) is identical, and the baseline-skip mechanism is generic to any DEFT-style loop with a resumable baseline state. See .
network_archreferences/quick-start.md相同的三阶段模式适用于其他DEFT技能——只需替换、阶段2的DEFT技能、规格/检查点路径约定,以及阶段3的增强CSV路径。传递逻辑(阶段1输出规格+检查点预填充DEFT基线,阶段2输出增强数据集,阶段3输出最终检查点)完全相同,且基线跳过机制适用于任何具有可恢复基线状态的DEFT风格循环。详见。
network_archreferences/quick-start.mdSee also
相关链接
- — AutoML interface, algorithms, HP ranges
tao-skill-bank:tao-run-automl - — full DEFT AOI loop (Phase 2 default)
tao-skill-bank:tao-run-deft-aoi - — underlying ChangeNet train/eval/infer skill (used by both AutoML and DEFT)
tao-skill-bank:tao-train-visual-changenet - Other skills — non-AOI Phase 2 targets
skills/applications/deft-* - — building the consolidated pre-flight gate
references/preflight.md - — Phase 1→2 pre-seed, Phase 2 quality check, Phase 3 warm-start + report wiring
references/handoff.md - — metric, noise, leakage, and compute-budget guidance
references/pitfalls.md - — verbatim worked-example message and non-AOI variant
references/quick-start.md
- ——AutoML接口、算法、超参数范围
tao-skill-bank:tao-run-automl - ——完整DEFT AOI循环(阶段2默认)
tao-skill-bank:tao-run-deft-aoi - ——底层ChangeNet训练/评估/推理技能(AutoML和DEFT均使用)
tao-skill-bank:tao-train-visual-changenet - 其他技能——非AOI场景的阶段2目标
skills/applications/deft-* - ——构建统一预检查确认环节
references/preflight.md - ——阶段1→2预填充、阶段2质量检查、阶段3热启动+报告关联
references/handoff.md - ——指标、噪声、泄漏和计算预算指南
references/pitfalls.md - ——标准示例话术和非AOI变体
references/quick-start.md