InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba

  • 2025-07-23 10:31:35
  • Yuhang Wang, Jun Li, Zhijian Wu, Jifeng Shen, Jianhua Xu, Wankou Yang
  • 0

Abstract

Within the family of convolutional neural networks, InceptionNeXt has shownexcellent competitiveness in image classification and a number of downstreamtasks. Built on parallel one-dimensional strip convolutions, however, itsuffers from limited ability of capturing spatial dependencies along differentdimensions and fails to fully explore spatial modeling in local neighborhood.Besides, inherent locality constraints of convolution operations aredetrimental to effective global context modeling. To overcome theselimitations, we propose a novel backbone architecture termed InceptionMamba inthis study. More specifically, the traditional one-dimensional stripconvolutions are replaced by orthogonal band convolutions in our InceptionMambato achieve cohesive spatial modeling. Furthermore, global contextual modelingcan be achieved via a bottleneck Mamba module, facilitating enhancedcross-channel information fusion and enlarged receptive field. Extensiveevaluations on classification and various downstream tasks demonstrate that theproposed InceptionMamba achieves state-of-the-art performance with superiorparameter and computational efficiency. The source code will be available athttps://github.com/Wake1021/InceptionMamba.

 

Quick Read (beta)

loading the full paper ...