Search Results: uat

Found 1,928 Skills

Product & Designmastepanoski/claude-skill...

nielsen-heuristics-audit

Evaluate UX/UI using Jakob Nielsen's 10 usability heuristics. Comprehensive audit of visibility, control, consistency, error prevention, recognition, flexibility, aesthetics, error recovery, and documentation.

🇺🇸|EnglishTranslated

AI & Machine Learninggoogle/adk-docs

adk-eval-guide

MUST READ before running any ADK evaluation. ADK evaluation methodology — eval metrics, evalset schema, LLM-as-judge, tool trajectory scoring, and common failure causes. Use when evaluating agent quality, running adk eval, or debugging eval results. Do NOT use for API code patterns (use adk-cheatsheet), deployment (use adk-deploy-guide), or project scaffolding (use adk-scaffold).

🇺🇸|EnglishTranslated

AI & Machine Learningmindrally/skills

meta-prompt

Meta-prompting framework for critiquing responses, analyzing solution trajectories, and evaluating AI-generated content quality

🇺🇸|EnglishTranslated

Automationalirezarezvani/claude-ski...

run

Run a single experiment iteration. Edit the target file, evaluate, keep or discard.

🇺🇸|EnglishTranslated

AI & Machine Learningthe-perfect-developer/the...

google-adk

This skill should be used when the user asks to "build an agent with Google ADK", "use the Agent Development Kit", "create a Google ADK agent", "set up ADK tools", or needs guidance on Google's Agent Development Kit best practices, multi-agent systems, or agent evaluation.

🇺🇸|EnglishTranslated

Marketing & Growthslavingia/skills

grow-sustainably

Evaluate business decisions through the lens of sustainable, profitable growth. Use when someone is making decisions about spending, hiring, fundraising, or scaling their business.

🇺🇸|EnglishTranslated

AI & Machine Learningnotque/claude-code-toolki...

skill-eval

Evaluate and improve skills through measured testing. Run trigger evaluations to test whether skill descriptions cause correct activation, optimize descriptions via automated train/test loops, benchmark skill output quality with A/B comparisons, and validate skill structure. Use when user says "improve skill", "test skill triggers", "optimize description", "benchmark skill", "eval skill", or "skill quality". Do NOT use for creating new skills (use skill-creator-engineer).

🇺🇸|EnglishTranslated

Marketing & Growthextruct-ai/gtm-skills

email-response-simulation

Deep persona simulation and skeptical buyer review for cold emails. Builds a full prospect "world" from LinkedIn + company data, defines their professional reality (KPIs, pain points, inbox behavior), then runs a skeptical buyer roast — emotional reaction first, business evaluation second. One prospect at a time, Tier 1 only. Triggers on: "review email", "copy feedback", "email feedback", "would they reply", "persona review", "check this email", "review this draft", "roast this email", "skeptical buyer".

🇺🇸|EnglishTranslated

Documentation & Writingnocobase/skills

nocobase-utils

General-purpose NocoBase reference utilities covering cross-cutting topics such as evaluator engines, expression syntax, and more. Use when you need authoritative reference information that applies across multiple NocoBase features.

🇺🇸|EnglishTranslated

Documentation & Writingmathews-tom/armory

research-critique

Critical analysis of research papers, academic manuscripts, preprints, and technical studies — evaluating methodology, claims-evidence alignment, contribution significance, and intellectual honesty. Produces coherent analytical responses (not checklists) that distinguish genuine weaknesses from standard field limitations. Governs intellectual posture: collegial reader, not adversarial reviewer. Triggers on: "critique this paper", "review this research", "what do you think of this paper", "analyze this study", "evaluate the methodology", "is this paper sound", "assess this research", "strengths and weaknesses of this paper", "does the evidence support the claims". Use this skill when the user provides a research paper, preprint, or technical study and asks for critical evaluation of its scientific merit, methodology, or contribution — not formatting, citation hygiene, or submission readiness (use manuscript-review for those).

🇺🇸|EnglishTranslated

Testing & QAmathews-tom/armory

benchmark-runner

Designs structured benchmarks for comparing algorithms, models, or implementations. Selects appropriate metrics (latency, throughput, memory, accuracy), designs representative test cases, captures hardware/software context, produces comparison tables with tradeoff analysis, and includes reproduction instructions. Triggers on: "benchmark", "compare performance", "which is faster", "latency comparison", "memory comparison", "run benchmark", "design benchmark", "compare implementations", "evaluate algorithms", "performance comparison", "throughput test", "speed test". Use this skill when comparing two or more implementations, algorithms, or models.

🇺🇸|EnglishTranslated

AI & Machine Learninggyanesh-m/skills

galileo-typescript-sdk

Complete reference for the Galileo AI platform TypeScript/JS SDK for evaluating, observing, and protecting GenAI applications. Use when building Node.js or TypeScript applications that need LLM evaluation, production observability, tracing, or runtime guardrails with Galileo.

🇺🇸|EnglishTranslated