AGENT: A Benchmark for Core Psychological Reasoning

Abstract

For machine agents to successfully interact with humans in real-worldsettings, they will need to develop an understanding of human mental life.Intuitive psychology, the ability to reason about hidden mental variables thatdrive observable actions, comes naturally to people: even pre-verbal infantscan tell agents from objects, expecting agents to act efficiently to achievegoals given constraints. Despite recent interest in machine agents that reasonabout other agents, it is not clear if such agents learn or hold the corepsychology principles that drive human reasoning. Inspired by cognitivedevelopment studies on intuitive psychology, we present a benchmark consistingof a large dataset of procedurally generated 3D animations, AGENT (Action,Goal, Efficiency, coNstraint, uTility), structured around four scenarios (goalpreferences, action efficiency, unobserved constraints, and cost-rewardtrade-offs) that probe key concepts of core intuitive psychology. We validateAGENT with human-ratings, propose an evaluation protocol emphasizinggeneralization, and compare two strong baselines built on Bayesian inverseplanning and a Theory of Mind neural network. Our results suggest that to passthe designed tests of core intuitive psychology at human levels, a model mustacquire or have built-in representations of how agents plan, combining utilitycomputations and core knowledge of objects and physics.

Quick Read (beta)

loading the full paper ...