Fully Bayesian Recurrent Neural Networks for Safe Reinforcement Learning

Abstract

Reinforcement Learning (RL) has demonstrated state-of-the-art results in anumber of autonomous system applications, however many of the underlyingalgorithms rely on black-box predictions. This results in poor explainabilityof the behaviour of these systems, raising concerns as to their use insafety-critical applications. Recent work has demonstrated thatuncertainty-aware models exhibit more cautious behaviours through theincorporation of model uncertainty estimates. In this work, we build onProbabilistic Backpropagation to introduce a fully Bayesian Recurrent NeuralNetwork architecture. We apply this within a Safe RL scenario, and demonstratethat the proposed method significantly outperforms a popular approach forobtaining model uncertainties in collision avoidance tasks. Furthermore, wedemonstrate that the proposed approach requires less training and is far moreefficient than the current leading method, both in terms of compute resourceand memory footprint.

Quick Read (beta)

loading the full paper ...