IRLAS: Inverse Reinforcement Learning for Architecture Search

Abstract

In this paper, we propose an inverse reinforcement learning method forarchitecture search (IRLAS), which trains an agent to learn to search networkstructures that are topologically inspired by human-designed network. Mostexisting architecture search approaches totally neglect the topologicalcharacteristics of architectures, which results in complicated architecturewith a high inference latency. Motivated by the fact that human-designednetworks are elegant in topology with a fast inference speed, we propose amirror stimuli function inspired by biological cognition theory to extract theabstract topological knowledge of an expert human-design network (ResNeXt). Toavoid raising a too strong prior over the search space, we introduce inversereinforcement learning to train the mirror stimuli function and exploit it as aheuristic guidance for architecture search, easily generalized to differentarchitecture search algorithms. On CIFAR-10, the best architecture searched byour proposed IRLAS achieves 2.60% error rate. For ImageNet mobile setting, ourmodel achieves a state-of-the-art top-1 accuracy 75.28%, while being 2~4xfaster than most auto-generated architectures. A fast version of this modelachieves 10% faster than MobileNetV2, while maintaining a higher accuracy.

Quick Read (beta)

loading the full paper ...