Accelerating the Convergence of Human-in-the-Loop Reinforcement Learning with Counterfactual Explanations

Abstract

The capability to interactively learn from human feedback would enable robotsin new social settings. For example, novice users could train service robots innew tasks naturally and interactively. Human-in-the-loop Reinforcement Learning(HRL) addresses this issue by combining human feedback and reinforcementlearning (RL) techniques. State-of-the-art interactive learning techniquessuffer from slow convergence, thus leading to a frustrating experience for thehuman. This work approaches this problem by extending the existing TAMERFramework with the possibility to enhance human feedback with two differenttypes of counterfactual explanations. We demonstrate our extensions' success inimproving the convergence, especially in the crucial early phases of thetraining.

Quick Read (beta)

loading the full paper ...