Abstract
The field of Multi-Agent Reinforcement Learning (MARL) is currently facing areproducibility crisis. While solutions for standardized reporting have beenproposed to address the issue, we still lack a benchmarking tool that enablesstandardization and reproducibility, while leveraging cutting-edgeReinforcement Learning (RL) implementations. In this paper, we introduceBenchMARL, the first MARL training library created to enable standardizedbenchmarking across different algorithms, models, and environments. BenchMARLuses TorchRL as its backend, granting it high performance and maintainedstate-of-the-art implementations while addressing the broad community of MARLPyTorch users. Its design enables systematic configuration and reporting, thusallowing users to create and run complex benchmarks from simple one-lineinputs. BenchMARL is open-sourced on GitHub:https://github.com/facebookresearch/BenchMARL