Abstract
Developing a reinforcement learning (RL) agent often involves identifyingvalues for numerous parameters, covering the policy, reward function,environment, and agent-internal architecture. Since these parameters areinterrelated in complex ways, optimizing them is a black-box problem thatproves especially challenging for nonexperts. Although existingoptimization-as-a-service platforms (e.g., Vizier and Optuna) can handle suchproblems, they are impractical for RL systems, since the need for manual usermapping of each parameter to distinct components makes the effort cumbersome.It also requires understanding of the optimization process, limiting thesystems' application beyond the machine learning field and restricting accessin areas such as cognitive science, which models human decision-making. Totackle these challenges, the paper presents AgentForge, a flexible low-codeplatform to optimize any parameter set across an RL system. Available athttps://github.com/feferna/AgentForge, it allows an optimization problem to bedefined in a few lines of code and handed to any of the interfaced optimizers.With AgentForge, the user can optimize the parameters either individually orjointly. The paper presents an evaluation of its performance for a challengingvision-based RL problem.