Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Abstract

Public research results on large-scale supervised finetuning of AI agentsremain relatively rare, since the collection of agent training data presentsunique challenges. In this work, we argue that the bottleneck is not a lack ofunderlying data sources, but that a large variety of data is fragmented acrossheterogeneous formats, tools, and interfaces. To this end, we introduce theagent data protocol (ADP), a light-weight representation language that servesas an "interlingua" between agent datasets in diverse formats and unified agenttraining pipelines downstream. The design of ADP is expressive enough tocapture a large variety of tasks, including API/tool use, browsing, coding,software engineering, and general agentic workflows, while remaining simple toparse and train on without engineering at a per-dataset level. In experiments,we unified a broad collection of 13 existing agent training datasets into ADPformat, and converted the standardized ADP data into training-ready formats formultiple agent frameworks. We performed SFT on these data, and demonstrated anaverage performance gain of ~20% over corresponding base models, and deliversstate-of-the-art or near-SOTA performance on standard coding, browsing, tooluse, and research benchmarks, without domain-specific tuning. All code and dataare released publicly, in the hope that ADP could help lower the barrier tostandardized, scalable, and reproducible agent training.

Quick Read (beta)

loading the full paper ...