Abstract
Long-term planning poses a major difficulty to many reinforcement learningalgorithms. This problem becomes even more pronounced in dynamic visualenvironments. In this work we propose Hierarchical Planning and ReinforcementLearning (HIP-RL), a method for merging the benefits and capabilities ofSymbolic Planning with the learning abilities of Deep Reinforcement Learning.We apply HIPRL to the complex visual tasks of interactive question answeringand visual semantic planning and achieve state-of-the-art results on threechallenging datasets all while taking fewer steps at test time and training infewer iterations. Sample results can be found at youtu.be/0TtWJ_0mPfI
Quick Read (beta)
loading the full paper ...