Snooping Attacks on Deep Reinforcement Learning

  • 2020-01-15 23:59:17
  • Matthew Inkawhich, Yiran Chen, Hai Li
  • 0

Abstract

Adversarial attacks have exposed a significant security vulnerability instate-of-the-art machine learning models. Among these models include deepreinforcement learning agents. The existing methods for attacking reinforcementlearning agents assume the adversary either has access to the target agent'slearned parameters or the environment that the agent interacts with. In thiswork, we propose a new class of threat models, called snooping threat models,that are unique to reinforcement learning. In these snooping threat models, theadversary does not have the ability to interact with the target agent'senvironment, and can only eavesdrop on the action and reward signals beingexchanged between agent and environment. We show that adversaries operating inthese highly constrained threat models can still launch devastating attacksagainst the target agent by training proxy models on related tasks andleveraging the transferability of adversarial examples.

 

Quick Read (beta)

loading the full paper ...