Multi-task Maximum Entropy Inverse Reinforcement Learning

  • 2018-05-22 21:57:34
  • Adam Gleave, Oliver Habryka
  • 3

Abstract

Multi-task Inverse Reinforcement Learning (IRL) is the problem of inferringmultiple reward functions from expert demonstrations. Prior work, built onBayesian IRL, is unable to scale to complex environments due to computationalconstraints. This paper contributes the first formulation of multi-task IRL inthe more computationally efficient Maximum Causal Entropy (MCE) IRL framework.Experiments show our approach can perform one-shot imitation learning in agridworld environment that single-task IRL algorithms require hundreds ofdemonstrations to solve. Furthermore, we outline how our formulation can beapplied to state-of-the-art MCE IRL algorithms such as Guided Cost Learning.This extension, based on meta-learning, could enable multi-task IRL to beperformed for the first time in high-dimensional, continuous state MDPs withunknown dynamics as commonly arise in robotics.

 

Quick Read (beta)

loading the full paper ...