Abstract
Recent years have seen significant advancements in image restoration, largelyattributed to the development of modern deep neural networks, such as CNNs andTransformers. However, existing restoration backbones often face the dilemmabetween global receptive fields and efficient computation, hindering theirapplication in practice. Recently, the Selective Structured State Space Model,especially the improved version Mamba, has shown great potential for long-rangedependency modeling with linear complexity, which offers a way to resolve theabove dilemma. However, the standard Mamba still faces certain challenges inlow-level vision such as local pixel forgetting and channel redundancy. In thiswork, we introduce a simple but effective baseline, named MambaIR, whichintroduces both local enhancement and channel attention to improve the vanillaMamba. In this way, our MambaIR takes advantage of the local pixel similarityand reduces the channel redundancy. Extensive experiments demonstrate thesuperiority of our method, for example, MambaIR outperforms SwinIR by up to0.45dB on image SR, using similar computational cost but with a globalreceptive field. Code is available at \url{https://github.com/csguoh/MambaIR}.