Control with Distributed Deep Reinforcement Learning: Learn a Better Policy

Abstract

Distributed approach is a very effective method to improve trainingefficiency of reinforcement learning. In this paper, we propose a new heuristicdistributed architecture for deep reinforcement learning (DRL) algorithm, inwhich a PSO based network update mechanism is adopted to speed up learning anoptimal policy besides using multiple agents for parallel training. In thismechanism, the update of neural network of each agent is not only according tothe training result of itself, but also affected by the optimal neural networkof all agents. In order to verify the effectiveness of the proposed method, theproposed architecture is implemented on the Deep Q-Network algorithm (DQN) andthe Deep Deterministic Policy Gradient algorithm (DDPG) to train severaltypical control problems. The training results show that the proposed method iseffective.

Quick Read (beta)

loading the full paper ...