Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation

Abstract

Achieving faster execution with shorter compilation time can enable furtherdiversity and innovation in neural networks. However, the current paradigm ofexecuting neural networks either relies on hand-optimized libraries,traditional compilation heuristics, or very recently, simulated annealing andgenetic algorithms. Our work takes a unique approach by formulating compileroptimizations for neural networks as a reinforcement learning problem, whosesolution takes fewer steps to converge. This solution, dubbed ReLeASE, comeswith a sampling algorithm that leverages clustering to focus the costly samples(real hardware measurements) on representative points, subsuming an entiresubspace. Our adaptive sampling not only reduces the number of samples, butalso improves the quality of samples for better exploration in shorter time. Assuch, experimentation with real hardware shows that reinforcement learning withadaptive sampling provides 4.45x speed up in optimization time over AutoTVM,while also improving inference time of the modern deep networks by 5.6%.Further experiments also confirm that our adaptive sampling can even improveAutoTVM's simulated annealing by 4.00x.

Quick Read (beta)

loading the full paper ...