Loading...
Loading...
Use when designing or auditing computer science experiments, evaluation plans, baselines, metrics, ablations, datasets, statistical tests, benchmarks, validity threats, or reproducibility claims.
npx skill4agent add vincenzoimp/academic-research-skills cs-methodology-evaluationreferences/cs-methodology-evaluation-policy.mdreferences/experiment-policy.mdreferences/repository-contract.mddocs/methodology/evaluation-plan.mddocs/methodology/threats-to-validity.mdexperiments/registry.csv