L2PF -- Learning to Prune Faster

Abstract

Various applications in the field of autonomous driving are based onconvolutional neural networks (CNNs), especially for processing camera data.The optimization of such CNNs is a major challenge in continuous development.Newly learned features must be brought into vehicles as quickly as possible,and as such, it is not feasible to spend redundant GPU hours duringcompression. In this context, we present Learning to Prune Faster which detailsa multi-task, try-and-learn method, discretely learning redundant filters ofthe CNN and a continuous action of how long the layers have to be fine-tuned.This allows us to significantly speed up the convergence process of learninghow to find an embedded-friendly filter-wise pruned CNN. For ResNet20, we haveachieved a compression ratio of 3.84 x with minimal accuracy degradation.Compared to the state-of-the-art pruning method, we reduced the GPU hours by1.71 x.

Quick Read (beta)

loading the full paper ...