Abstract
Adversarial attacks pose significant challenges for vision models in criticalfields like healthcare, where reliability is essential. Although adversarialtraining has been well studied in natural images, its application to biomedicaland microscopy data remains limited. Existing self-supervised adversarialtraining methods overlook the hierarchical structure of histopathology images,where patient-slide-patch relationships provide valuable discriminativesignals. To address this, we propose Hierarchical Self-Supervised AdversarialTraining (HSAT), which exploits these properties to craft adversarial examplesusing multi-level contrastive learning and integrate it into adversarialtraining for enhanced robustness. We evaluate HSAT on multiclass histopathologydataset OpenSRH and the results show that HSAT outperforms existing methodsfrom both biomedical and natural image domains. HSAT enhances robustness,achieving an average gain of 54.31% in the white-box setting and reducingperformance drops to 3-4% in the black-box setting, compared to 25-30% for thebaseline. These results set a new benchmark for adversarial training in thisdomain, paving the way for more robust models. Our Code for training andevaluation is available at https://github.com/HashmatShadab/HSAT.