ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero

  • 2019-02-12 17:59:38
  • Yuandong Tian, Jerry Ma, Qucheng Gong, Shubho Sengupta, Zhuoyuan Chen, James Pinkerton, C. Lawrence Zitnick
  • 39

Abstract

The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are aremarkable demonstration of deep reinforcement learning's capabilities,achieving superhuman performance in the complex game of Go with progressivelyincreasing autonomy. However, many obstacles remain in the understanding of andusability of these promising approaches by the research community. Towardelucidating unresolved mysteries and facilitating future research, we proposeELF OpenGo, an open-source reimplementation of the AlphaZero algorithm. ELFOpenGo is the first open-source Go AI to convincingly demonstrate superhumanperformance with a perfect (20:0) record against global top professionals. Weapply ELF OpenGo to conduct extensive ablation studies, and to identify andanalyze numerous interesting phenomena in both the model training and in thegameplay inference procedures. Our code, models, selfplay datasets, andauxiliary data are publicly available.

 

Quick Read (beta)

loading the full paper ...