Abstract
Multi-agent reinforcement learning has shown promise on a variety ofcooperative tasks as a consequence of recent developments in differentiableinter-agent communication. However, most architectures are limited to pools ofhomogeneous agents, limiting their applicability. Here we propose a modularframework for learning complex tasks in which a traditional monolithic agent isframed as a collection of cooperating heterogeneous agents. We apply thisapproach to model sensorimotor coordination in the neocortex as a multi-agentreinforcement learning problem. Our results demonstrate proof-of-concept of theproposed architecture and open new avenues for learning complex tasks and forunderstanding functional localization in the brain and future intelligentsystems.