Convolutional Networks with Adaptive Computation Graphs

Abstract

Do convolutional networks really need a fixed feed-forward structure? Often,a neural network is already confident after a few layers about the high-levelconcept shown in the image. However, due to the fixed network structure, allremaining layers still need to be evaluated. What if the network could jumpright to a layer that is specialized in fine-grained differences of the image'scontent? In this work, we propose Adanets, a family of convolutional networkswith adaptive computation graphs. Following a high-level structure similar toresidual networks (Resnets), the key difference is that for each layer a gatingfunction determines whether to execute the layer or move on to the next one. Inexperiments on CIFAR-10 and ImageNet we demonstrate that Adanets efficientlyallocate computational budget among layers and learn distinct layersspecializing in similar categories. Adanet 50 achieves a top 5 error rate of7.94% on ImageNet using 30% fewer computations than Resnet 34, which onlyachieves 8.58%. Lastly, we study the effect of adaptive computation graphs onthe susceptibility towards adversarial examples. We observe that Adanets show ahigher robustness towards adversarial attacks, complementing other defensessuch as JPEG compression.

Quick Read (beta)

loading the full paper ...