Abstract
Training large language model agents on tasks at the frontier of theircapabilities is key to unlocking advanced reasoning. We introduce a datasynthesis approach inspired by the educational theory of the Zone of ProximalDevelopment (ZPD), which defines this frontier as tasks an LLM cannot solvealone but can master with guidance. To operationalize this, we present theAgentFrontier Engine, an automated pipeline that synthesizes high-quality,multidisciplinary data situated precisely within the LLM's ZPD. This enginesupports both continued pre-training with knowledge-intensive data and targetedpost-training on complex reasoning tasks. From the same framework, we derivethe ZPD Exam, a dynamic and automated benchmark designed to evaluate agentcapabilities on these frontier tasks. We train AgentFrontier-30B-A3B model onour synthesized data, which achieves state-of-the-art results on demandingbenchmarks like Humanity's Last Exam, even surpassing some leading proprietaryagents. Our work demonstrates that a ZPD-guided approach to data synthesisoffers a scalable and effective path toward building more capable LLM agents.