Abstract
A multiagent sequential decision problem has been seen in many criticalapplications including urban transportation, autonomous driving cars, militaryoperations, etc. Its widely known solution, namely multiagent reinforcementlearning, has evolved tremendously in recent years. Among them, the solutionparadigm of modeling other agents attracts our interest, which is differentfrom traditional value decomposition or communication mechanisms. It enablesagents to understand and anticipate others' behaviors and facilitates theircollaboration. Inspired by recent research on the legibility that allows agentsto reveal their intentions through their behavior, we propose a multiagentactive legibility framework to improve their performance. Thelegibility-oriented framework allows agents to conduct legible actions so as tohelp others optimise their behaviors. In addition, we design a series ofproblem domains that emulate a common scenario and best characterize thelegibility in multiagent reinforcement learning. The experimental resultsdemonstrate that the new framework is more efficient and costs less trainingtime compared to several multiagent reinforcement learning algorithms.