Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning

Abstract

Reinforcement Learning algorithms can learn complex behavioral patterns forsequential decision making tasks wherein an agent interacts with an environmentand acquires feedback in the form of rewards sampled from it. Traditionally,such algorithms make decisions, i.e., select actions to execute, at everysingle time step of the agent-environment interactions. In this paper, wepropose a novel framework, Fine Grained Action Repetition (FiGAR), whichenables the agent to decide the action as well as the time scale of repeatingit. FiGAR can be used for improving any Deep Reinforcement Learning algorithmwhich maintains an explicit policy estimate by enabling temporal abstractionsin the action space. We empirically demonstrate the efficacy of our frameworkby showing performance improvements on top of three policy search algorithms indifferent domains: Asynchronous Advantage Actor Critic in the Atari 2600domain, Trust Region Policy Optimization in Mujoco domain and DeepDeterministic Policy Gradients in the TORCS car racing domain.

Quick Read (beta)

loading the full paper ...