Search Results: uat

Found 1,928 Skills

AI & Machine Learningalirezarezvani/claude-ski...

eval

Evaluate and rank agent results by metric or LLM judge for an AgentHub session.

icp-website-review

Evaluate a website, landing page, content, or any online asset through the eyes of pre-built synthetic ICP personas. Loads personas from icp-persona-builder output, then runs them against target URLs. Supports three modes: structured scorecard, freeform focus group, and head-to-head competitive comparison. Reusable — run against the same site after changes, or against new content anytime.

🇺🇸|EnglishTranslated

Code Qualitydbvc/skills

linus-tech-review

Technical solution evaluation and code review in the style of Linus Torvalds. Only use this when the user explicitly requests a Linus-style review or explicitly asks for a rigorous evaluation of code changes/technical solutions (e.g., "review changes/code", "evaluate if the solution is appropriate", "check submission standards", "linus-tech-review").

🇨🇳|ChineseTranslated

Data Processingmims-harvard/tooluniverse

tooluniverse-literature-deep-research

Conduct comprehensive literature research with target disambiguation, evidence grading, and structured theme extraction. Creates a detailed report with mandatory completeness checklist, biological model synthesis, and testable hypotheses. For biological targets, resolves official IDs (Ensembl/UniProt), synonyms, naming collisions, and gathers expression/pathway context before literature search. Default deliverable is a report file; for single factoid questions, uses a fast verification mode and may include an inline answer. Use when users need thorough literature reviews, target profiles, or to verify specific claims from the literature.

🇺🇸|EnglishTranslated

Backend Developmentpluginagentmarketplace/cu...

architecture-patterns

Design, evaluate, and document software architecture patterns

🇺🇸|EnglishTranslated

1 scripts/Checked

Product & Designgohypergiant/agent-skills

accelint-persona-review

Evaluate Figma designs from operator persona perspectives through design critique and user experience evaluation. Use when reviewing UX for specific user roles (e.g., air-surveillance-tech, weapons-director), conducting design reviews, or evaluating operator interfaces. Analyzes cognitive load, communication patterns, pain points, and system visibility. Works with Figma MCP (desktop/URL) and Outline docs.

🇺🇸|EnglishTranslated

AI & Machine Learningvinta/hal-9000

magi

Use only when the user explicitly requests brainstorming, evaluating architecture choices, or comparing options where no single concern dominates

🇺🇸|EnglishTranslated

Marketing & Growthdeanpeters/product-manage...

acquisition-channel-advisor

Evaluate acquisition channels using unit economics, customer quality, and scalability. Recommends scale/test/kill decisions.

🇺🇸|EnglishTranslated

AI & Machine Learningjpoehnelt/skills

agent-dx-cli-scale

A scoring scale for evaluating how well a CLI is designed for AI agents, based on the "Rewrite Your CLI for AI Agents" principles.

🇺🇸|EnglishTranslated

AI & Machine Learninggithub/awesome-copilot

eval-driven-dev

Instrument Python LLM apps, build golden datasets, write eval-based tests, run them, and root-cause failures — covering the full eval-driven development cycle. Make sure to use this skill whenever a user is developing, testing, QA-ing, evaluating, or benchmarking a Python project that calls an LLM, even if they don't say "evals" explicitly. Use for making sure an AI app works correctly, catching regressions after prompt changes, debugging why an agent started behaving differently, or validating output quality before shipping.

🇺🇸|EnglishTranslated

AI & Machine Learninglaunchdarkly/agent-skills

aiconfig-online-evals

Attach judges to AI Config variations for automatic LLM-as-a-judge evaluation. Create custom judges, configure sampling rates, and monitor quality scores.

🇺🇸|EnglishTranslated

Version Controlnathan13888/nice-skills

wtf

Quick situational awareness for the current git branch. Summarizes what a feature branch is about by analyzing commits and changes against trunk. On trunk, highlights recent interesting activity. Use when user says "wtf", "what's going on", "what is this branch", "what changed", or "catch me up".

🇺🇸|EnglishTranslated