Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine

Abstract

Recent studies operationalize self-improvement through coding agents thatedit their own codebases. They grow a tree of self-modifications throughexpansion strategies that favor higher software engineering benchmarkperformance, assuming that this implies more promising subsequentself-modifications. However, we identify a mismatch between the agent'sself-improvement potential (metaproductivity) and its coding benchmarkperformance, namely the Metaproductivity-Performance Mismatch. Inspired byHuxley's concept of clade, we propose a metric ($\mathrm{CMP}$) that aggregatesthe benchmark performances of the descendants of an agent as an indicator ofits potential for self-improvement. We show that, in our self-improving codingagent development setting, access to the true $\mathrm{CMP}$ is sufficient tosimulate how the G\"odel Machine would behave under certain assumptions. Weintroduce the Huxley-G\"odel Machine (HGM), which, by estimating $\mathrm{CMP}$and using it as guidance, searches the tree of self-modifications. On SWE-benchVerified and Polyglot, HGM outperforms prior self-improving coding agentdevelopment methods while using less wall-clock time. Last but not least, HGMdemonstrates strong transfer to other coding datasets and large languagemodels. The agent optimized by HGM on SWE-bench Verified with GPT-5-mini andevaluated on SWE-bench Lite with GPT-5 achieves human-level performance,matching the best officially checked results of human-engineered coding agents.Our code is available at https://github.com/metauto-ai/HGM.

Quick Read (beta)

loading the full paper ...