Automata Guided Reinforcement Learning With Demonstrations

  • 2018-09-17 16:17:28
  • Xiao Li, Yao Ma, Calin Belta
  • 0

Abstract

Tasks with complex temporal structures and long horizons pose a challenge forreinforcement learning agents due to the difficulty in specifying the tasks interms of reward functions as well as large variances in the learning signals.We propose to address these problems by combining temporal logic (TL) withreinforcement learning from demonstrations. Our method automatically generatesintrinsic rewards that align with the overall task goal given a TL taskspecification. The policy resulting from our framework has an interpretable andhierarchical structure. We validate the proposed method experimentally on a setof robotic manipulation tasks.

 

Quick Read (beta)

loading the full paper ...