Mastering Complex Control in MOBA Games with Deep Reinforcement Learning

  • 2020-01-03 02:39:43
  • Deheng Ye, Zhao Liu, Mingfei Sun, Bei Shi, Peilin Zhao, Hao Wu, Hongsheng Yu, Shaojie Yang, Xipeng Wu, Qingwei Guo, Qiaobo Chen, Yinyuting Yin, Hao Zhang, Tengfei Shi, Liang Wang, Qiang Fu, Wei Yang, Lanxiao Huang
  • 0

Abstract

We study the reinforcement learning problem of complex action control in theMulti-player Online Battle Arena (MOBA) 1v1 games. This problem involves farmore complicated state and action spaces than those of traditional 1v1 games,such as Go and Atari series, which makes it very difficult to search anypolicies with human-level performance. In this paper, we present a deepreinforcement learning framework to tackle this problem from the perspectivesof both system and algorithm. Our system is of low coupling and highscalability, which enables efficient explorations at large scale. Our algorithmincludes several novel strategies, including control dependency decoupling,action mask, target attention, and dual-clip PPO, with which our proposedactor-critic network can be effectively trained in our system. Tested on theMOBA game Honor of Kings, the trained AI agents can defeat top professionalhuman players in full 1v1 games.

 

Quick Read (beta)

loading the full paper ...