Abstract
Current research on decision-making in safety-critical scenarios often relieson inefficient data-driven scenario generation or specific modeling approaches,which fail to capture corner cases in real-world contexts. To address thisissue, we propose a Red-Team Multi-Agent Reinforcement Learning framework,where background vehicles with interference capabilities are treated asred-team agents. Through active interference and exploration, red-team vehiclescan uncover corner cases outside the data distribution. The framework uses aConstraint Graph Representation Markov Decision Process, ensuring that red-teamvehicles comply with safety rules while continuously disrupting the autonomousvehicles (AVs). A policy threat zone model is constructed to quantify thethreat posed by red-team vehicles to AVs, inducing more extreme actions toincrease the danger level of the scenario. Experimental results show that theproposed framework significantly impacts AVs decision-making safety andgenerates various corner cases. This method also offers a novel direction forresearch in safety-critical scenarios.