Learning Fairness in Multi-Agent Systems

Abstract

Fairness is essential for human society, contributing to stability andproductivity. Similarly, fairness is also the key for many multi-agent systems.Taking fairness into multi-agent learning could help multi-agent systems becomeboth efficient and stable. However, learning efficiency and fairnesssimultaneously is a complex, multi-objective, joint-policy optimization. Totackle these difficulties, we propose FEN, a novel hierarchical reinforcementlearning model. We first decompose fairness for each agent and proposefair-efficient reward that each agent learns its own policy to optimize. Toavoid multi-objective conflict, we design a hierarchy consisting of acontroller and several sub-policies, where the controller maximizes thefair-efficient reward by switching among the sub-policies that provides diversebehaviors to interact with the environment. FEN can be trained in a fullydecentralized way, making it easy to be deployed in real-world applications.Empirically, we show that FEN easily learns both fairness and efficiency andsignificantly outperforms baselines in a variety of multi-agent scenarios.

Quick Read (beta)

loading the full paper ...