Implicit Quantile Networks for Distributional Reinforcement Learning

  • 2018-06-14 14:28:37
  • Will Dabney, Georg Ostrovski, David Silver, RĂ©mi Munos
  • 92

Abstract

In this work, we build on recent advances in distributional reinforcementlearning to give a generally applicable, flexible, and state-of-the-artdistributional variant of DQN. We achieve this by using quantile regression toapproximate the full quantile function for the state-action returndistribution. By reparameterizing a distribution over the sample space, thisyields an implicitly defined return distribution and gives rise to a largeclass of risk-sensitive policies. We demonstrate improved performance on the 57Atari 2600 games in the ALE, and use our algorithm's implicitly defineddistributions to study the effects of risk-sensitive policies in Atari games.

 

Quick Read (beta)

loading the full paper ...