Rank-1 Convolutional Neural Network

Abstract

In this paper, we propose a convolutional neural network(CNN) with 3-D rank-1filters which are composed by the outer product of 1-D filters. After beingtrained, the 3-D rank-1 filters can be decomposed into 1-D filters in the testtime for fast inference. The reason that we train 3-D rank-1 filters in thetraining stage instead of consecutive 1-D filters is that a better gradientflow can be obtained with this setting, which makes the training possible evenin the case where the network with consecutive 1-D filters cannot be trained.The 3-D rank-1 filters are updated by both the gradient flow and the outerproduct of the 1-D filters in every epoch, where the gradient flow tries toobtain a solution which minimizes the loss function, while the outer productoperation tries to make the parameters of the filter to live on a rank-1sub-space. Furthermore, we show that the convolution with the rank-1 filtersresults in low rank outputs, constraining the final output of the CNN also tolive on a low dimensional subspace.

Quick Read (beta)

loading the full paper ...