Gated Path Planning Networks

  • 2018-06-17 16:32:52
  • Lisa Lee, Emilio Parisotto, Devendra Singh Chaplot, Eric Xing, Ruslan Salakhutdinov
  • 36

Abstract

Value Iteration Networks (VINs) are effective differentiable path planningmodules that can be used by agents to perform navigation while stillmaintaining end-to-end differentiability of the entire architecture. Despitetheir effectiveness, they suffer from several disadvantages including traininginstability, random seed sensitivity, and other optimization problems. In thiswork, we reframe VINs as recurrent-convolutional networks which demonstratesthat VINs couple recurrent convolutions with an unconventional max-poolingactivation. From this perspective, we argue that standard gated recurrentupdate equations could potentially alleviate the optimization issues plaguingVIN. The resulting architecture, which we call the Gated Path Planning Network,is shown to empirically outperform VIN on a variety of metrics such as learningspeed, hyperparameter sensitivity, iteration count, and even generalization.Furthermore, we show that this performance gap is consistent across differentmaze transition types, maze sizes and even show success on a challenging 3Denvironment, where the planner is only provided with first-person RGB images.

 

Quick Read (beta)

loading the full paper ...