Abstract
In this paper we design and evaluate a Deep-Reinforcement Learning agent thatoptimizes routing. Our agent adapts automatically to current traffic conditionsand proposes tailored configurations that attempt to minimize the networkdelay. Experiments show very promising performance. Moreover, this approachprovides important operational advantages with respect to traditionaloptimization algorithms.
Quick Read (beta)
loading the full paper ...