Enabling Robots to Communicate their Objectives

Abstract

The overarching goal of this work is to efficiently enable end-users tocorrectly anticipate a robot's behavior in novel situations. Since a robot'sbehavior is often a direct result of its underlying objective function, ourinsight is that end-users need to have an accurate mental model of thisobjective function in order to understand and predict what the robot will do.While people naturally develop such a mental model over time through observingthe robot act, this familiarization process may be lengthy. Our approachreduces this time by having the robot model how people infer objectives fromobserved behavior, and then it selects those behaviors that are maximallyinformative. The problem of computing a posterior over objectives from observedbehavior is known as Inverse Reinforcement Learning (IRL), and has been appliedto robots learning human objectives. We consider the problem where the roles ofhuman and robot are swapped. Our main contribution is to recognize that unlikerobots, humans will not be exact in their IRL inference. We thus introduce twofactors to define candidate approximate-inference models for human learning inthis setting, and analyze them in a user study in the autonomous drivingdomain. We show that certain approximate-inference models lead to the robotgenerating example behaviors that better enable users to anticipate what itwill do in novel situations. Our results also suggest, however, that additionalresearch is needed in modeling how humans extrapolate from examples of robotbehavior.

Quick Read (beta)

loading the full paper ...