Skull-stripping induces shortcut learning in MRI-based Alzheimer's disease classification

Abstract

Objectives: High classification accuracy of Alzheimer's disease (AD) fromstructural MRI has been achieved using deep neural networks, yet the specificimage features contributing to these decisions remain unclear. In this study,the contributions of T1-weighted (T1w) gray-white matter texture, volumetricinformation, and preprocessing -- particularly skull-stripping -- weresystematically assessed. Methods: A dataset of 990 matched T1w MRIs from AD patients and cognitivelynormal controls from the ADNI database were used. Preprocessing was variedthrough skull-stripping and intensity binarization to isolate texture and shapecontributions. A 3D convolutional neural network was trained on eachconfiguration, and classification performance was compared using exact McNemartests with discrete Bonferroni-Holm correction. Feature relevance was analyzedusing Layer-wise Relevance Propagation, image similarity metrics, and spectralclustering of relevance maps. Results: Despite substantial differences in image content, classificationaccuracy, sensitivity, and specificity remained stable across preprocessingconditions. Models trained on binarized images preserved performance,indicating minimal reliance on gray-white matter texture. Instead, volumetricfeatures -- particularly brain contours introduced through skull-stripping --were consistently used by the models. Conclusions: This behavior reflects a shortcut learning phenomenon, wherepreprocessing artifacts act as potentially unintended cues. The resultingClever Hans effect emphasizes the critical importance of interpretability toolsto reveal hidden biases and to ensure robust and trustworthy deep learning inmedical imaging.

Quick Read (beta)

loading the full paper ...