Abstract
Stackelberg games and their resulting equilibria have received increasingattention in the multi-agent reinforcement learning literature. Each stage of atraditional Stackelberg game involves a leader(s) acting first, followed by thefollowers. In situations where the roles of leader(s) and followers can beinterchanged, the designated role can have considerable advantages, forexample, in first-mover advantage settings. Then the question arises: Whoshould be the leader and when? A bias in the leader selection process can leadto unfair outcomes. This problem is aggravated if the agents areself-interested and care only about their goals and rewards. We formally definethis leader selection problem and show its relation to fairness in agents'returns. Furthermore, we propose a multi-agent reinforcement learning frameworkthat maximizes fairness by integrating mediators. Mediators have previouslybeen used in the simultaneous action setting with varying levels of control,such as directly performing agents' actions or just recommending them. Ourframework integrates mediators in the Stackelberg setting with minimal control(leader selection). We show that the presence of mediators leads toself-interested agents taking fair actions, resulting in higher overallfairness in agents' returns.