Behaviour Suite for Reinforcement Learning

Abstract

This paper introduces the Behaviour Suite for Reinforcement Learning, orbsuite for short. bsuite is a collection of carefully-designed experiments thatinvestigate core capabilities of reinforcement learning (RL) agents with twoobjectives. First, to collect clear, informative and scalable problems thatcapture key issues in the design of general and efficient learning algorithms.Second, to study agent behaviour through their performance on these sharedbenchmarks. To complement this effort, we open sourcegithub.com/deepmind/bsuite, which automates evaluation and analysis of anyagent on bsuite. This library facilitates reproducible and accessible researchon the core issues in RL, and ultimately the design of superior learningalgorithms. Our code is Python, and easy to use within existing projects. Weinclude examples with OpenAI Baselines, Dopamine as well as new referenceimplementations. Going forward, we hope to incorporate more excellentexperiments from the research community, and commit to a periodic review ofbsuite from a committee of prominent researchers.

Quick Read (beta)

loading the full paper ...