Complementary Meta-Reinforcement Learning for Fault-Adaptive Control

Abstract

Faults are endemic to all systems. Adaptive fault-tolerant control maintainsdegraded performance when faults occur as opposed to unsafe conditions orcatastrophic events. In systems with abrupt faults and strict time constraints,it is imperative for control to adapt quickly to system changes to maintainsystem operations. We present a meta-reinforcement learning approach thatquickly adapts its control policy to changing conditions. The approach buildsupon model-agnostic meta learning (MAML). The controller maintains a complementof prior policies learned under system faults. This "library" is evaluated on asystem after a new fault to initialize the new policy. This contrasts withMAML, where the controller derives intermediate policies anew, sampled from adistribution of similar systems, to initialize a new policy. Our approachimproves sample efficiency of the reinforcement learning process. We evaluateour approach on an aircraft fuel transfer system under abrupt faults.

Quick Read (beta)

loading the full paper ...