Approximated Behavioral Metric-based State Projection for Federated Reinforcement Learning

Abstract

Federated reinforcement learning (FRL) methods usually share the encryptedlocal state or policy information and help each client to learn from otherswhile preserving everyone's privacy. In this work, we propose that sharing theapproximated behavior metric-based state projection function is a promising wayto enhance the performance of FRL and concurrently provides an effectiveprotection of sensitive information. We introduce FedRAG, a FRL framework tolearn a computationally practical projection function of states for each clientand aggregating the parameters of projection functions at a central server. TheFedRAG approach shares no sensitive task-specific information, yet providesinformation gain for each client. We conduct extensive experiments on theDeepMind Control Suite to demonstrate insightful results.

Quick Read (beta)

loading the full paper ...