Search Results: uat

Found 1,906 Skills

Data Processingagiprolabs/claude-trading...

wallet-profiling

Behavioral classification, performance analysis, and trading style detection for Solana wallets

skill-audit

Audit existing skills with Tessl scoring, metadata and trigger-coverage checks, repo conventions, and skill-authoring best practices. Use when creating or revising a skill, triaging weak self-activation, or comparing a skill against source-repo guidance such as `AGENTS.md`, `CLAUDE.md`, or repo rules, plus external skill guidance. Do not use to verify general application code or to rewrite unrelated docs.

🇺🇸|EnglishTranslated

AI & Machine Learninggarrytan/gstack

benchmark-models

Cross-model benchmark for gstack skills. Runs the same prompt through Claude, GPT (via Codex CLI), and Gemini side-by-side — compares latency, tokens, cost, and optionally quality via LLM judge. Answers "which model is actually best for this skill?" with data instead of vibes. Separate from /benchmark, which measures web page performance. Use when: "benchmark models", "compare models", "which model is best for X", "cross-model comparison", "model shootout". (gstack) Voice triggers (speech-to-text aliases): "compare models", "model shootout", "which model is best".

🇺🇸|EnglishTranslated

AI & Machine Learningruvnet/ruflo

gaia-submission

Walk through a complete GAIA benchmark→submit flow — from key resolution through HAL-compatible package generation

🇺🇸|EnglishTranslated

AI & Machine Learningyonatangross/orchestkit

context-compression

Use when conversation context is too long, hitting token limits, or responses are degrading. Compresses history while preserving critical information using anchored summarization and probe-based validation.

🇺🇸|EnglishTranslated

Tools & Utilitiescisco-ai-defense/skill-sc...

safe-calculator

A safe calculator for mathematical expressions

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningrysweet/amplihack

eval-recipes-runner

Run Microsoft's eval-recipes benchmarks to validate amplihack improvements against baseline agents. Auto-activates when testing improvements, running evals, or benchmarking changes.

🇺🇸|EnglishTranslated

AI & Machine Learningtondevrel/scientific-agen...

scikit-learn

The industry standard library for machine learning in Python. Provides simple and efficient tools for predictive data analysis, covering classification, regression, clustering, dimensionality reduction, model selection, and preprocessing.

🇺🇸|EnglishTranslated

Tools & Utilitieshexbee/hello-skills

mungers-lattice

Multidisciplinary analytical engine using Charlie Munger's latticework of mental models. Applies cross-disciplinary thinking (math, physics, biology, psychology, economics) to dissect life and business decisions. Use when user presents a decision problem, investment question, or complex analysis request requiring deep rational analysis.

🇺🇸|EnglishTranslated

AI & Machine Learningg1joshi/agent-skills

mlflow

MLflow ML lifecycle management. Use for ML experiment tracking.

🇺🇸|EnglishTranslated

AI & Machine Learningbitwize-music-studio/clau...

researchers-journalism

Researches investigative articles, interviews, and news coverage. Use when research needs journalistic sources for cross-referencing or additional context.

🇺🇸|EnglishTranslated

Testing & QAnahisaho/codegraphmcpserv...

quality-assurance

Copilot agent that assists with comprehensive QA strategy and test planning to ensure product quality through systematic testing and quality metrics Trigger terms: QA, quality assurance, test strategy, QA plan, quality metrics, test planning, quality gates, acceptance testing, regression testing Use when: User requests involve quality assurance tasks.

🇺🇸|EnglishTranslated