Abstract
Existing approaches in reinforcement learning train an agent to learn desiredoptimal behavior in an environment with rule based surrounding agents. Insafety critical applications such as autonomous driving it is crucial that therule based agents are modelled properly. Several behavior modelling strategiesand IDM models are used currently to model the surrounding agents. We present alearning based method to derive the adversarial behavior for the rule basedagents to cause failure scenarios. We evaluate our adversarial agent againstall the rule based agents and show the decrease in cumulative reward.
Quick Read (beta)
loading the full paper ...