Randomized Prior Functions for Deep Reinforcement Learning

  • 2018-06-08 19:47:54
  • Ian Osband, John Aslanides, Albin Cassirer
  • 18

Abstract

Dealing with uncertainty is essential for efficient reinforcement learning.There is a growing literature on uncertainty estimation for deep learning fromfixed datasets, but many of the most popular approaches are poorly-suited tosequential decision problems. Other methods, such as bootstrap sampling, haveno mechanism for uncertainty that does not come from the observed data. Wehighlight why this can be a crucial shortcoming and propose a simple remedythrough addition of a randomized untrainable `prior' network to each ensemblemember. We prove that this approach is efficient with linear representations,provide simple illustrations of its efficacy with nonlinear representations andshow that this approach scales to large-scale problems far better than previousattempts.

 

Quick Read (beta)

loading the full paper ...