Abstract
Adversarial attacks and robustness in Deep Reinforcement Learning (DRL) havebeen widely studied in various threat models; however, few considerenvironmental state perturbations, which are natural in embodied scenarios. Toimprove the robustness of DRL agents, we formulate the problem of environmentalstate perturbation, introducing a preliminary non-targeted attack method as acalibration adversary, and then propose a defense framework, named BoostedAdversarial Training (BAT), which first tunes the agents via supervisedlearning to avoid catastrophic failure and subsequently adversarially trainsthe agent with reinforcement learning. Extensive experimental resultssubstantiate the vulnerability of mainstream agents under environmental stateperturbations and the effectiveness of our proposed attack. The defense resultsdemonstrate that while existing robust reinforcement learning algorithms maynot be suitable, our BAT framework can significantly enhance the robustness ofagents against environmental state perturbations across various situations.