LaND: Learning to Navigate from Disengagements

Abstract

Consistently testing autonomous mobile robots in real world scenarios is anecessary aspect of developing autonomous navigation systems. Each time thehuman safety monitor disengages the robot's autonomy system due to the robotperforming an undesirable maneuver, the autonomy developers gain insight intohow to improve the autonomy system. However, we believe that thesedisengagements not only show where the system fails, which is useful fortroubleshooting, but also provide a direct learning signal by which the robotcan learn to navigate. We present a reinforcement learning approach forlearning to navigate from disengagements, or LaND. LaND learns a neural networkmodel that predicts which actions lead to disengagements given the currentsensory observation, and then at test time plans and executes actions thatavoid disengagements. Our results demonstrate LaND can successfully learn tonavigate in diverse, real world sidewalk environments, outperforming bothimitation learning and reinforcement learning approaches. Videos, code, andother material are available on our websitehttps://sites.google.com/view/sidewalk-learning

Quick Read (beta)

loading the full paper ...