Loading...
Loading...
Found 3 Skills
Use when validating subjective quality criteria that cannot be deterministically tested — applies LLM-based evaluation with structured rubrics for tone, aesthetics, UX feel, documentation quality, and code readability. Triggers: documentation quality check, error message tone review, UX copy evaluation, code readability assessment, design aesthetic review.
INTERNAL sub-agent for blind 9-dimensional rubric scoring. **NOT a user-facing skill — do NOT invoke from the main conversation.** It is called via the Task tool by cheat-score / cheat-predict / cheat-bump to generate a context-isolated score for a script. It ONLY accepts script_path + rubric_notes_path; any other input will be refused. It outputs strict JSON: 9 dimensions × {score 0-5, confidence enum, one-line reason}. **It strictly refuses to read** .cheat-state.json, predictions/*, retro sections, or any content that may leak post-publish data. This is Channel B in the 3-channel calibration model (A=main, B=blind sub-agent, C=cross-model).
Score, grade, or evaluate things using AI against a rubric. Use when grading essays, scoring code reviews, rating candidate responses, auditing support quality, evaluating compliance, building a quality rubric, running QA checks against criteria, assessing performance, rating content quality, or any task where you need numeric scores with justifications — not just categories.