High-Accuracy Model-Based Reinforcement Learning, a Survey

  • 2021-07-17 14:01:05
  • Aske Plaat, Walter Kosters, Mike Preuss
  • 2


Deep reinforcement learning has shown remarkable success in the past fewyears. Highly complex sequential decision making problems from game playing androbotics have been solved with deep model-free methods. Unfortunately, thesample complexity of model-free methods is often high. To reduce the number ofenvironment samples, model-based reinforcement learning creates an explicitmodel of the environment dynamics. Achieving high model accuracy is a challengein high-dimensional problems. In recent years, a diverse landscape ofmodel-based methods has been introduced to improve model accuracy, usingmethods such as uncertainty modeling, model-predictive control, latent models,and end-to-end learning and planning. Some of these methods succeed inachieving high accuracy at low sample complexity, most do so either in arobotics or in a games context. In this paper, we survey these methods; weexplain in detail how they work and what their strengths and weaknesses are. Weconclude with a research agenda for future work to make the methods more robustand more widely applicable to other applications.


Quick Read (beta)

loading the full paper ...