Abstract
Infrared small target detection (IRSTD) remains a long-standing challenge incomplex backgrounds due to low signal-to-clutter ratios (SCR), diverse targetmorphologies, and the absence of distinctive visual cues. While recent deeplearning approaches aim to learn discriminative representations, the intrinsicvariability and weak priors of small targets often lead to unstableperformance. In this paper, we propose a novel end-to-end IRSTD framework,termed LRRNet, which leverages the low-rank property of infrared imagebackgrounds. Inspired by the physical compressibility of cluttered scenes, ourapproach adopts a compression--reconstruction--subtraction (CRS) paradigm todirectly model structure-aware low-rank background representations in the imagedomain, without relying on patch-based processing or explicit matrixdecomposition. To the best of our knowledge, this is the first work to directlylearn low-rank background structures using deep neural networks in anend-to-end manner. Extensive experiments on multiple public datasetsdemonstrate that LRRNet outperforms 38 state-of-the-art methods in terms ofdetection accuracy, robustness, and computational efficiency. Remarkably, itachieves real-time performance with an average speed of 82.34 FPS. Evaluationson the challenging NoisySIRST dataset further confirm the model's resilience tosensor noise. The source code will be made publicly available upon acceptance.