Abstract
We present a novel approach to hierarchical reinforcement learning calledHierarchical Actor-Critic (HAC). HAC aims to make learning tasks with sparsebinary rewards more efficient by enabling agents to learn how to break downtasks from scratch. The technique uses of a set of actor-critic networks thatlearn to decompose tasks into a hierarchy of subgoals. We demonstrate that HACsignificantly improves sample efficiency in a series of tasks that involvesparse binary rewards and require behavior over a long time horizon.
Quick Read (beta)
loading the full paper ...