Federated Multi-armed Bandits with Personalization

  • 2021-02-25 18:59:43
  • Chengshuai Shi, Cong Shen, Jing Yang
  • 3

Abstract

A general framework of personalized federated multi-armed bandits (PF-MAB) isproposed, which is a new bandit paradigm analogous to the federated learning(FL) framework in supervised learning and enjoys the features of FL withpersonalization. Under the PF-MAB framework, a mixed bandit learning problemthat flexibly balances generalization and personalization is studied. A lowerbound analysis for the mixed model is presented. We then propose thePersonalized Federated Upper Confidence Bound (PF-UCB) algorithm, where theexploration length is chosen carefully to achieve the desired balance oflearning the local model and supplying global information for the mixedlearning objective. Theoretical analysis proves that PF-UCB achieves an$O(\log(T))$ regret regardless of the degree of personalization, and has asimilar instance dependency as the lower bound. Experiments using bothsynthetic and real-world datasets corroborate the theoretical analysis anddemonstrate the effectiveness of the proposed algorithm.

 

Quick Read (beta)

loading the full paper ...