ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning Models

  • 2018-10-31 08:00:32
  • Yueh-Hua Wu, Fan-Yun Sun, Yen-Yu Chang, Shou-De Lin
This work provides a thorough study on how reward scaling can affectperformance of deep reinforcement learning agents. In particular, we would liketo answer the question that how does reward scaling affect non-saturating ReLUnetworks in RL? This question matters because ReLU is one of the most effectiveactivation functions for deep learning models. We also propose an AdaptiveNetwork Scaling framework to find a suitable scale of the rewards duringlearning for better performance. We conducted empirical studies to justify thesolution.


