Search Results: autoresearch

Found 26 Skills

setup

Set up a new autoresearch experiment interactively. Collects domain, target file, eval command, metric, direction, and evaluator.

🇺🇸|EnglishTranslated

Code Qualitylevnikolaevich/claude-cod...

ln-811-algorithm-optimizer

Optimizes algorithms via autoresearch loop: benchmark, research, hypothesize, keep/discard

🇺🇸|EnglishTranslated

Automationpaulrberg/agent-skills

autoresearch

This skill should be used when the user asks to "run autoresearch", "optimize X in a loop", "set up autonomous experiments", "start autoresearch", "optimize X overnight", or "experiment loop". Sets up and runs an autonomous experiment loop for any optimization target.

🇺🇸|EnglishTranslated

2 scripts/Attention

AI & Machine Learningalirezarezvani/claude-ski...

autoresearch-agent

Autonomous experiment loop that optimizes any file by a measurable metric. Inspired by Karpathy's autoresearch. The agent edits a target file, runs a fixed evaluation, keeps improvements (git commit), discards failures (git reset), and loops indefinitely. Use when: user wants to optimize code speed, reduce bundle/image size, improve test pass rate, optimize prompts, improve content quality (headlines, copy, CTR), or run any measurable improvement loop. Requires: a target file, an evaluation command that outputs a metric, and a git repo.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learningalpoxdev/hypercore

autoresearch-skill

[Hyper] Optimize an existing Codex skill through baseline-first experiments, binary evals, optional guards, and one-mutation-at-a-time iteration. Use for skill autoresearch, measured trigger/workflow improvement, self-optimizing a skill, benchmarking skill changes, or resuming skill experiment artifacts.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learninguditgoenka/autoresearch

autoresearch

Autonomous Goal-directed Iteration. Apply Karpathy's autoresearch principles to ANY task. Loops autonomously — modify, verify, keep/discard, repeat. Supports optional loop count via Claude Code's /loop command.

🇺🇸|EnglishTranslated

Automationdavebcn87/pi-autoresearch

autoresearch-create

Set up and run an autonomous experiment loop for any optimization target. Gathers what to optimize, then starts the loop immediately. Use when asked to "run autoresearch", "optimize X in a loop", "set up autoresearch for X", or "start experiments".

🇺🇸|EnglishTranslated

Automationgithub/awesome-copilot

autoresearch

Autonomous iterative experimentation loop for any programming task. Guides the user through defining goals, measurable metrics, and scope constraints, then runs an autonomous loop of code changes, testing, measuring, and keeping/discarding results. Inspired by Karpathy's autoresearch. USE FOR: autonomous improvement, iterative optimization, experiment loop, auto research, performance tuning, automated experimentation, hill climbing, try things automatically, optimize code, run experiments, autonomous coding loop. DO NOT USE FOR: one-shot tasks, simple bug fixes, code review, or tasks without a measurable metric.

🇺🇸|EnglishTranslated

Automationdabiggm0e/autoresearch-op...

autoresearch

Set up and run an autonomous experiment loop for any optimization target. Use when asked to start autoresearch or run experiments.

🇺🇸|EnglishTranslated

AI & Machine Learningtheneoai/awesome-skills

autoresearch

AI autonomous research agent for LLM training optimization using opencode as the agent. The agent autonomously modifies train.py, runs experiments, evaluates val_bpb, and iterates to find the best model. Use when: "run autoresearch", "start experiment", "train model", "autonomous research", "optimize LLM training".

🇺🇸|EnglishTranslated

2 scripts/Attention

Automationfactory-ai/factory-plugin...

autoresearch

Autonomous experiment loop for optimization research. Use when the user wants to: - Optimize a metric through systematic experimentation (ML training loss, test speed, bundle size, build time, etc.) - Run an automated research loop: try an idea, measure it, keep improvements, revert regressions, repeat - Set up autoresearch for any codebase with a measurable optimization target Implements the autoresearch pattern with MAD-based confidence scoring, git branch isolation, and structured experiment logging.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningpedronauck/skills

autoresearch

Autonomously optimize any Claude Code skill by running it repeatedly, scoring outputs against binary evals, mutating the prompt, and keeping improvements. Based on Karpathy's autoresearch methodology. Use when: optimize this skill, improve this skill, run autoresearch on, make this skill better, self-improve skill, benchmark skill, eval my skill, run evals on. Outputs: an improved SKILL.md, a results log, and a changelog of every mutation tried.

🇺🇸|EnglishTranslated