QUOTA: The Quantile Option Architecture for Reinforcement Learning

  • 2018-11-07 19:52:47
  • Shangtong Zhang, Borislav Mavrin, Linglong Kong, Bo Liu, Hengshuai Yao
  • 0

Abstract

In this paper, we propose the Quantile Option Architecture (QUOTA) forexploration based on recent advances in distributional reinforcement learning(RL). In QUOTA, decision making is based on quantiles of a value distribution,not only the mean. QUOTA provides a new dimension for exploration via makinguse of both optimism and pessimism of a value distribution. We demonstrate theperformance advantage of QUOTA in both challenging video games and physicalrobot simulators.

 

Introduction (beta)

None

 

Conclusion (beta)

None