Attaining Interpretability in Reinforcement Learning via Hierarchical Primitive Composition

Abstract

Deep reinforcement learning has shown its effectiveness in variousapplications and provides a promising direction for solving tasks with highcomplexity. In most reinforcement learning algorithms, however, two majorissues need to be dealt with - the sample inefficiency and the interpretabilityof a policy. The former happens when the environment is sparsely rewardedand/or has a long-term credit assignment problem, while the latter becomes aproblem when the learned policies are deployed at the customer side product. Inthis paper, we propose a novel hierarchical reinforcement learning algorithmthat mitigates the aforementioned issues by decomposing the original task in ahierarchy and by compounding pretrained primitives with intents. We show howthe proposed scheme can be employed in practice by solving a pick and placetask with a 6 DoF manipulator.

Quick Read (beta)

loading the full paper ...