Abstract
We present VAPOR, a novel method for autonomous legged robot navigation inunstructured, densely vegetated outdoor environments using offlineReinforcement Learning (RL). Our method trains a novel RL policy using anactor-critic network and arbitrary data collected in real outdoor vegetation.Our policy uses height and intensity-based cost maps derived from 3D LiDARpoint clouds, a goal cost map, and processed proprioception data as stateinputs, and learns the physical and geometric properties of the surroundingobstacles such as height, density, and solidity/stiffness. The fully-trainedpolicy's critic network is then used to evaluate the quality of dynamicallyfeasible velocities generated from a novel context-aware planner. Our planneradapts the robot's velocity space based on the presence of entrapment inducingvegetation, and narrow passages in dense environments. We demonstrate ourmethod's capabilities on a Spot robot in complex real-world outdoor scenes,including dense vegetation. We observe that VAPOR's actions improve successrates by up to 40%, decrease the average current consumption by up to 2.9%, anddecrease the normalized trajectory length by up to 11.2% compared to existingend-to-end offline RL and other outdoor navigation methods.