Reinforcement learning has steadily improved and outperform human in lots oftraditional games since the resurgence of deep neural network. However, thesesuccess is not easy to be copied to autonomous driving because the state spacesin real world are extreme complex and action spaces are continuous and finecontrol is required. Moreover, the autonomous driving vehicles must also keepfunctional safety under the complex environments. To deal with thesechallenges, we first adopt the deep deterministic policy gradient (DDPG)algorithm, which has the capacity to handle complex state and action spaces incontinuous domain. We then choose The Open Racing Car Simulator (TORCS) as ourenvironment to avoid physical damage. Meanwhile, we select a set of appropriatesensor information from TORCS and design our own rewarder. In order to fit DDPGalgorithm to TORCS, we design our network architecture for both actor andcritic inside DDPG paradigm. To demonstrate the effectiveness of our model, Weevaluate on different modes in TORCS and show both quantitative and qualitativeresults.