by evo-hq · Codex Skill · ★ 1.2k
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
evo A plugin for Claude Code and Codex that optimizes code through experiments. You give it a codebase. It discovers metrics to optimize, sets up the evaluation, and starts running experiments in a loop -- trying things, keeping what improves the score, throwing away what doesn't. Inspired by Karpathy's autoresearch -- where an LLM runs training experiments autonomously to beat its own best score. Autoresearch is a pure hill climb: try something, keep or revert, repeat on a single branch. Evo adds structure on top of that idea: Tree search over greedy hill climb. Multiple directions can fork from any committed node, so exploration doesn't collapse to one path. Parallel semi-autonomous agents. Spawn multiple subagents and run them simultaneously, each in its own git worktree. Each subagent reads traces, formulates hypotheses, and can run multiple iterations within its branch. Shared state. Failure traces, annotations, and discarded hypotheses are accessible to every agent before it decides what to try next. Gating. Regression tests or safety checks can be wired up as a gate. Experiments that don't pass get discarded. Observability. A dashboard to monitor your experiments.
| Stars | 1,201 |
| Forks | 88 |
| Language | Python |
| Category | Codex Skill |
| License | Apache-2.0 |
| Quality Score | 55.468/100 |
| Open Issues | 11 |
| Last Updated | 2026-06-19 |
| Created | 2026-04-05 |
| Platforms | claude-code, codex, python |
| Est. Tokens | ~14k |
Looking for a evo alternative? If you're comparing evo with other codex skill tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
A codex plugin for running optimization loops inside a codebase. It is useful when you have a measurable targe
CORAL is a robust, lightweight infrastructure for multi-agent autonomous self-evolution, built for autoresearc
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Ka
🦞+🔬 NanoResearch: The Autonomous AI Research Assistant
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Ka
Curated list of AutoResearch use cases with optimization traces and open source implementations
Explore other popular codex skill tools:
evo is turns your codebase into an autoresearch loop — discovers what to measure, instruments the benchmark, then runs tree search with parallel subagents.. It is categorized as a Codex Skill with 1.2k GitHub stars.
evo is primarily written in Python. It covers topics such as agent-skills, autonomous-agents, autoresearch.
You can find installation instructions and usage details in the evo GitHub repository at github.com/evo-hq/evo. The project has 1.2k stars and 88 forks, indicating an active community.
evo is released under the Apache-2.0 license, making it free to use and modify according to the license terms.
The top alternatives to evo on Agent Skills Hub include codex-autoresearch, CORAL, awesome-autoresearch. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.