Diversity-Driven Extensible Hierarchical Reinforcement Learning

Abstract

Hierarchical reinforcement learning (HRL) has recently shown promisingadvances on speeding up learning, improving the exploration, and discoveringintertask transferable skills. Most recent works focus on HRL with two levels,i.e., a master policy manipulates subpolicies, which in turn manipulateprimitive actions. However, HRL with multiple levels is usually needed in manyreal-world scenarios, whose ultimate goals are highly abstract, while theiractions are very primitive. Therefore, in this paper, we propose adiversity-driven extensible HRL (DEHRL), where an extensible and scalableframework is built and learned levelwise to realize HRL with multiple levels.DEHRL follows a popular assumption: diverse subpolicies are useful, i.e.,subpolicies are believed to be more useful if they are more diverse. However,existing implementations of this diversity assumption usually have their owndrawbacks, which makes them inapplicable to HRL with multiple levels.Consequently, we further propose a novel diversity-driven solution to achievethis assumption in DEHRL. Experimental studies evaluate DEHRL with fivebaselines from four perspectives in two domains; the results show that DEHRLoutperforms the state-of-the-art baselines in all four aspects.

Quick Read (beta)

loading the full paper ...