Learning Altruistic Behaviours in Reinforcement Learning without External Rewards

Abstract

Can artificial agents learn to assist others in achieving their goals withoutknowing what those goals are? Generic reinforcement learning agents could betrained to behave altruistically towards others by rewarding them foraltruistic behaviour, i.e., rewarding them for benefiting other agents in agiven situation. Such an approach assumes that other agents' goals are known sothat the altruistic agent can cooperate in achieving those goals. However,explicit knowledge of other agents' goals is often difficult to acquire. In thecase of human agents, their goals and preferences may be difficult to expressfully, may be ambiguous or even contradictory. Thus, it is beneficial todevelop agents that do not depend on external supervision and can learnaltruistic behaviour in a task-agnostic manner. We propose to actaltruistically towards other agents by giving them more choice and therebyallowing them to better achieve their goals. Some concrete examples includeopening a door for others or safeguarding them to pursue their objectiveswithout interference. We formalize this concept and propose an altruistic agentthat learns to increase the choices another agent has by preferring to maximizethe number of states that the other agent can reach in its future. We evaluateour approach on three different multi-agent environments where another agent'ssuccess depends on the altruistic agent's behaviour. Finally, we show that ourunsupervised agents can perform comparably to agents explicitly trained to workcooperatively, in some cases even outperforming them.

Quick Read (beta)

loading the full paper ...