Search Results: testing

Found 3,723 Skills

codex-review

Codex code review closeout: local dirty changes, PR branch vs main, parallel tests.

AI & Machine Learningalirezarezvani/claude-ski...

dossier

Decision-grade entity research skill — produces a hypothesis-tested dossier on a specific company, person, nonprofit, or government org, not a generic profile. Forcing intake makes the user state their hypothesis upfront (what they already believe and want to verify or disprove) so the dossier tests it rather than confirms it. Output is an editable Word document (.docx) with verdict on the hypothesis, identity facts, 12-month activity timeline, network signals, reputation signals, red flags, 3-5 conversation hooks tied to specific findings, and source-provenance audit log. Uses WebSearch + WebFetch + free APIs (SEC EDGAR, GitHub, ProPublica Nonprofit Explorer) as workhorses; optional BYOK MCPs (LinkedIn, Crunchbase, Apollo, Pitchbook, SimilarWeb) enhance coverage. Triggers: 'research [company]', 'dossier on [person/company]', 'background check on [entity]', 'prep me for a meeting with [person/company]', 'due diligence on [company]', 'what should I know about [entity]', 'research [person] before I [meet/hire/invest]', 'competitor research on [company]', 'investor diligence [company]', 'interview prep for [company]'. Honors sensitivity exclusions for journalism + personal-vetting contexts.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learningruvnet/ruflo

trader-cloud-backtest

Run a heavy neural-trader job (long walk-forward, big Monte-Carlo, parameter sweep, model training) on the Anthropic Managed Agent cloud runtime instead of locally

🇺🇸|EnglishTranslated

Testing & QAcursor/plugins

control-ui

Build or adapt a local browser/CDP harness to drive and inspect a web, IDE, or Electron UI. Use for local UI verification, screenshots, accessibility snapshots, perf profiles, visual diffs, or reproducing UI bugs.

🇺🇸|EnglishTranslated

AI & Machine Learningnvidia/skills

onboard-gb200-1node-tests

Onboard 1-node GitHub MR functional tests for GB200 from existing mr-scoped 2-node tests.

🇺🇸|EnglishTranslated

Testing & QAunifapi-agent/skills-inde...

unifapi-smoke

Use this temporary smoke-test skill to verify skills.sh indexing and download snapshot behavior for a fresh UnifAPI agent skills repository.

🇺🇸|EnglishTranslated

Code Qualityaffaan-m/everything-claud...

benchmark-optimization-loop

Use when the user asks to make something faster, try many variants, run recursive optimization, benchmark latency/throughput/cost, or choose the best implementation by repeated measured tests.

🇺🇸|EnglishTranslated

Product & Designdonchitos/claude-code-gam...

prototype

Concept prototype — validate the core idea is worth designing before writing GDDs. Run right after /brainstorm and /setup-engine. Routes to HTML, Engine, or Paper path based on game type. Produces a throwaway build and a PROCEED/PIVOT/KILL verdict.

🇺🇸|EnglishTranslated

Testing & QAdonchitos/claude-code-gam...

regression-suite

Map test coverage to GDD critical paths, identify fixed bugs without regression tests, flag coverage drift from new features, and maintain tests/regression-suite.md. Run after implementing a bug fix or before a release gate.

🇺🇸|EnglishTranslated

Project Managementdonchitos/claude-code-gam...

vertical-slice

Pre-Production validation — build a production-quality end-to-end build to confirm the full game loop is achievable before committing to Production. Run after GDDs, architecture, and UX specs are complete. Produces a PROCEED/PIVOT/KILL verdict that gates the Pre-Production → Production transition.

🇺🇸|EnglishTranslated

Testing & QAcursor/plugins

verify-this

Verify a claim with fresh local evidence: restate it falsifiably, capture baseline and treatment, compare artifacts, and return VERIFIED, NOT VERIFIED, or INCONCLUSIVE.

🇺🇸|EnglishTranslated

Security & Complianceelementalsouls/claude-bug...

hunt-business-logic

Hunting skill for business logic vulnerabilities. Built from 12 public bug bounty reports. Covers coupon-race-stacking (Instacart, Stripe, Reverb), negative-quantity-in-cart price tampering (Upserve, Eternal/Zomato), decimal/fraction price-field overflow (Shipt), client-side checkout amount trust on PayPal redirect (WordPress.org), price-per-unit mass-assignment (Krisp), and archived-price swap / cart-TOCTOU (Stripe). Use when hunting business logic — heavy emphasis on financial-impact-demonstrated cases.

🇺🇸|EnglishTranslated