Abstract
Multi-agent formation as well as obstacle avoidance is one of the mostactively studied topics in the field of multi-agent systems. Although someclassic controllers like model predictive control (MPC) and fuzzy controlachieve a certain measure of success, most of them require precise globalinformation which is not accessible in harsh environments. On the other hand,some reinforcement learning (RL) based approaches adopt the leader-followerstructure to organize different agents' behaviors, which sacrifices thecollaboration between agents thus suffering from bottlenecks in maneuverabilityand robustness. In this paper, we propose a distributed formation and obstacleavoidance method based on multi-agent reinforcement learning (MARL). Agents inour system only utilize local and relative information to make decisions andcontrol themselves distributively. Agent in the multi-agent system willreorganize themselves into a new topology quickly in case that any of them isdisconnected. Our method achieves better performance regarding formation error,formation convergence rate and on-par success rate of obstacle avoidancecompared with baselines (both classic control methods and another RL-basedmethod). The feasibility of our method is verified by both simulation andhardware implementation with Ackermann-steering vehicles.