Abstract
Mobile robots navigating in crowds trained using reinforcement learning areknown to suffer performance degradation when faced with out-of-distributionscenarios. We propose that by properly accounting for the uncertainties ofpedestrians, a robot can learn safe navigation policies that are robust todistribution shifts. Our method augments agent observations with predictionuncertainty estimates generated by adaptive conformal inference, and it usesthese estimates to guide the agent's behavior through constrained reinforcementlearning. The system helps regulate the agent's actions and enables it to adaptto distribution shifts. In the in-distribution setting, our approach achieves a96.93% success rate, which is over 8.80% higher than the previousstate-of-the-art baselines with over 3.72 times fewer collisions and 2.43 timesfewer intrusions into ground-truth human future trajectories. In threeout-of-distribution scenarios, our method shows much stronger robustness whenfacing distribution shifts in velocity variations, policy changes, andtransitions from individual to group dynamics. We deploy our method on a realrobot, and experiments show that the robot makes safe and robust decisions wheninteracting with both sparse and dense crowds. Our code and videos areavailable on https://gen-safe-nav.github.io/.