Abstract
We present a traffic simulation named DeepTraffic where the planning systemsfor a subset of the vehicles are handled by a neural network as part of amodel-free, off-policy reinforcement learning process. The primary goal ofDeepTraffic is to make the hands-on study of deep reinforcement learningaccessible to thousands of students, educators, and researchers in order toinspire and fuel the exploration and evaluation of deep Q-learning networkvariants and hyperparameter configurations through large-scale, opencompetition. This paper investigates the crowd-sourced hyperparameter tuning ofthe policy network that resulted from the first iteration of the DeepTrafficcompetition where thousands of participants actively searched through thehyperparameter space.