Abstract
Robustness of deep neural networks to input noise remains a criticalchallenge, as naive noise injection often degrades accuracy on clean(uncorrupted) data. We propose a novel training framework that addresses thistrade-off through two complementary objectives. First, we introduce a lossfunction applied at the penultimate layer that explicitly enforces intra-classcompactness and increases the margin to analytically defined decisionboundaries. This enhances feature discriminativeness and class separability forclean data. Second, we propose a class-wise feature alignment mechanism thatbrings noisy data clusters closer to their clean counterparts. Furthermore, weprovide a theoretical analysis demonstrating that improving feature stabilityunder additive Gaussian noise implicitly reduces the curvature of the softmaxloss landscape in input space, as measured by Hessian eigenvalues.This thusnaturally enhances robustness without explicit curvature penalties. Conversely,we also theoretically show that lower curvatures lead to more robust models. Wevalidate the effectiveness of our method on standard benchmarks and our customdataset. Our approach significantly reinforces model robustness to variousperturbations while maintaining high accuracy on clean data, advancing theunderstanding and practice of noise-robust deep learning.