Loading...
Loading...
Found 5 Skills
Use when planning, running, comparing, or recording computational experiments, benchmarks, ablations, autonomous research loops, overnight runs, training runs, or exploratory variants.
Design experiment plans with progressive stages — initial implementation, baseline tuning, creative research, and ablation studies. Plan baselines, datasets, hyperparameter sweeps, and evaluation metrics. Use when planning experiments for a research paper.
Use when main results pass result-to-claim (claim_supported=yes or partial) and ablation studies are needed for paper submission. Codex designs ablations from a reviewer's perspective, CC reviews feasibility and implements.
Help a CS or AI PhD student design hypothesis-driven experiments with baselines, variables, metrics, controls, logging, and stop conditions. Use this skill whenever the user is about to run experiments, compare models, plan an ablation, debug inconclusive results, prepare an experiment section, or wants to avoid changing too many things at once.
Design hypothesis-driven ML/AI experiments before running them. Use this skill whenever the user wants to plan experiments, ablations, baselines, metrics, controls, seeds, logging, stop conditions, reviewer-proof evidence, or an experiment matrix for a paper claim before using run-experiment or writing results.