ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning Models

Abstract

This work provides a thorough study on how reward scaling can affectperformance of deep reinforcement learning agents. In particular, we would liketo answer the question that how does reward scaling affect non-saturating ReLUnetworks in RL? This question matters because ReLU is one of the most effectiveactivation functions for deep learning models. We also propose an AdaptiveNetwork Scaling framework to find a suitable scale of the rewards duringlearning for better performance. We conducted empirical studies to justify thesolution.

Quick Read (beta)

loading the full paper ...