Continuous direct yaw moment control systems such as torque-vectoringcontroller are an essential part for vehicle stabilization. This controller hasbeen extensively researched with the central objective of maintaining thevehicle stability by providing consistent stable cornering response. Theability of careful tuning of the parameters in a torque-vectoring controllercan significantly enhance vehicle's performance and stability. However, withoutany re-tuning of the parameters, especially in extreme driving conditions e.g.low friction surface or high velocity, the vehicle fails to maintain thestability. In this paper, the utility of Reinforcement Learning (RL) based onDeep Deterministic Policy Gradient (DDPG) as a parameter tuning algorithm fortorque-vectoring controller is presented. It is shown that, torque-vectoringcontroller with parameter tuning via reinforcement learning performs well on arange of different driving environment e.g., wide range of friction conditionsand different velocities, which highlight the advantages of reinforcementlearning as an adaptive algorithm for parameter tuning. Moreover, therobustness of DDPG algorithm are validated under scenarios which are beyond thetraining environment of the reinforcement learning algorithm. The simulationhas been carried out using a four wheels vehicle model with nonlinear tirecharacteristics. We compare our DDPG based parameter tuning against a geneticalgorithm and a conventional trial-and-error tunning of the torque vectoringcontroller, and the results demonstrated that the reinforcement learning basedparameter tuning significantly improves the stability of the vehicle.