Abstract
Ensuring the safety of autonomous vehicles (AVs) requires identifying rarebut critical failure cases that on-road testing alone cannot discover.High-fidelity simulations provide a scalable alternative, but automaticallygenerating realistic and diverse traffic scenarios that can effectively stresstest AV motion planners remains a key challenge. This paper introduces CRASH -Challenging Reinforcement-learning based Adversarial scenarios for SafetyHardening - an adversarial deep reinforcement learning framework to addressthis issue. First CRASH can control adversarial Non Player Character (NPC)agents in an AV simulator to automatically induce collisions with the Egovehicle, falsifying its motion planner. We also propose a novel approach, thatwe term safety hardening, which iteratively refines the motion planner bysimulating improvement scenarios against adversarial agents, leveraging thefailure cases to strengthen the AV stack. CRASH is evaluated on a simplifiedtwo-lane highway scenario, demonstrating its ability to falsify both rule-basedand learning-based planners with collision rates exceeding 90%. Additionally,safety hardening reduces the Ego vehicle's collision rate by 26%. Whilepreliminary, these results highlight RL-based safety hardening as a promisingapproach for scenario-driven simulation testing for autonomous vehicles.