Deep Reinforcement Learning in System Optimization

Abstract

The recent advancements in deep reinforcement learning have opened newhorizons and opportunities to tackle various problems in system optimization.Such problems are generally tailored to delayed, aggregated, and sequentialrewards, which is an inherent behavior in the reinforcement learning setting,where an agent collects rewards while exploring and exploiting the environmentto maximize the long term reward. However, in some cases, it is not clear whydeep reinforcement learning is a good fit for the problem. Sometimes, it doesnot perform better than the state-of-the-art solutions. And in other cases,random search or greedy algorithms could outperform deep reinforcementlearning. In this paper, we review, discuss, and evaluate the recent trends ofusing deep reinforcement learning in system optimization. We propose a set ofessential metrics to guide future works in evaluating the efficacy of usingdeep reinforcement learning in system optimization. Our evaluation includeschallenges, the types of problems, their formulation in the deep reinforcementlearning setting, embedding, the model used, efficiency, and robustness. Weconclude with a discussion on open challenges and potential directions forpushing further the integration of reinforcement learning in systemoptimization.

Quick Read (beta)

loading the full paper ...