Abstract
This work provides a thorough study on how reward scaling can affectperformance of deep reinforcement learning agents. In particular, we would liketo answer the question that how does reward scaling affect non-saturating ReLUnetworks in RL? This question matters because ReLU is one of the most effectiveactivation functions for deep learning models. We also propose an AdaptiveNetwork Scaling framework to find a suitable scale of the rewards duringlearning for better performance. We conducted empirical studies to justify thesolution.
Quick Read (beta)
loading the full paper ...