Abstract
Reinforcement Learning and the Evolutionary Strategy are two major approachesin addressing complicated control problems. Both are strong contenders and havetheir own devotee communities. Both groups have been very active in developingnew advances in their own domain and devising, in recent years, leading-edgetechniques to address complex continuous control tasks. Here, in the context ofDeep Reinforcement Learning, we formulate a parallelized version of theProximal Policy Optimization method and a Deep Deterministic Policy Gradientmethod. Moreover, we conduct a thorough comparison between the state-of-the-arttechniques in both camps fro continuous control; evolutionary methods and DeepReinforcement Learning methods. The results show there is no consistent winner.