Loading...
Loading...
Found 2 Skills
Honestly evaluate AI work quality using a two-axis scoring system. Use after completing a task, code review, or work session to get an unbiased assessment. Detects score inflation, forces devil's advocate reasoning, and persists scores across sessions.
When user asks about "confidence", "accuracy", "how sure", "uncertain", "reliable", "track record", or when Claude should express calibrated uncertainty. Enables domain-aware self-assessment.