Abstract
Traditional robust methods in multi-agent reinforcement learning (MARL) oftenstruggle against coordinated adversarial attacks in cooperative scenarios. Toaddress this limitation, we propose the Wolfpack Adversarial Attack framework,inspired by wolf hunting strategies, which targets an initial agent and itsassisting agents to disrupt cooperation. Additionally, we introduce theWolfpack-Adversarial Learning for MARL (WALL) framework, which trains robustMARL policies to defend against the proposed Wolfpack attack by fosteringsystemwide collaboration. Experimental results underscore the devastatingimpact of the Wolfpack attack and the significant robustness improvementsachieved by WALL. Our code is available athttps://github.com/sunwoolee0504/WALL.