Loading...
Loading...
Use when the user wants to iterate on a viral-article scoring system itself, calibrate or improve a scoring prompt against labeled samples, or run batch scoring experiments on a fixed article set. Best for prompt-only scoring research where the evaluator scripts stay fixed and only the scoring rubric/prompt is meant to evolve.
npx skill4agent add xiaomoboy/claude-writing-skills score-optimizerarticle-optimizerassets/articles/samples/cp labels.example.json labels.jsonreferences/score_program.mdreferences/score_prompt.mdcomposite_scoremaespearmanclassificationpython3 scripts/run_scoring.py > scoring.log 2>&1
python3 scripts/evaluate.py > eval.log 2>&1
grep "^composite_score:\|^mae:\|^spearman:\|^classification:" eval.logpython3 scripts/generate_score_chart.pyscripts/evaluate.pyscripts/run_scoring.py