Aligning Superhuman AI with Human Behavior: Chess as a Model System

Abstract

As artificial intelligence becomes increasingly intelligent---in some cases,achieving superhuman performance---there is growing potential for humans tolearn from and collaborate with algorithms. However, the ways in which AIsystems approach problems are often different from the ways people do, and thusmay be uninterpretable and hard to learn from. A crucial step in bridging thisgap between human and artificial intelligence is modeling the granular actionsthat constitute human behavior, rather than simply matching aggregate humanperformance. We pursue this goal in a model system with a long history in artificialintelligence: chess. The aggregate performance of a chess player unfolds asthey make decisions over the course of a game. The hundreds of millions ofgames played online by players at every skill level form a rich source of datain which these decisions, and their exact context, are recorded in minutedetail. Applying existing chess engines to this data, including an open-sourceimplementation of AlphaZero, we find that they do not predict human moves well. We develop and introduce Maia, a customized version of Alpha-Zero trained onhuman chess games, that predicts human moves at a much higher accuracy thanexisting engines, and can achieve maximum accuracy when predicting decisionsmade by players at a specific skill level in a tuneable way. For a dual task ofpredicting whether a human will make a large mistake on the next move, wedevelop a deep neural network that significantly outperforms competitivebaselines. Taken together, our results suggest that there is substantialpromise in designing artificial intelligence systems with human collaborationin mind by first accurately modeling granular human decision-making.

Quick Read (beta)

loading the full paper ...