Multi-objective Reinforcement Learning: A Tool for Pluralistic Alignment

Abstract

Reinforcement learning (RL) is a valuable tool for the creation of AIsystems. However it may be problematic to adequately align RL based on scalarrewards if there are multiple conflicting values or stakeholders to beconsidered. Over the last decade multi-objective reinforcement learning (MORL)using vector rewards has emerged as an alternative to standard, scalar RL. Thispaper provides an overview of the role which MORL can play in creatingpluralistically-aligned AI.

Quick Read (beta)

loading the full paper ...