Regularized Inverse Reinforcement Learning

Abstract

Inverse Reinforcement Learning (IRL) aims to facilitate a learner's abilityto imitate expert behavior by acquiring reward functions that explain theexpert's decisions. Regularized IRL applies convex regularizers to thelearner's policy in order to avoid the expert's behavior being rationalized byarbitrary constant rewards, also known as degenerate solutions. We proposeanalytical solutions, and practical methods to obtain them, for regularizedIRL. Current methods are restricted to the maximum-entropy IRL framework,limiting them to Shannon-entropy regularizers, as well as proposingfunctional-form solutions that are generally intractable. We presenttheoretical backing for our proposed IRL method's applicability to bothdiscrete and continuous controls and empirically validate its performance on avariety of tasks.

Quick Read (beta)

loading the full paper ...