Abstract
Advantages of deep learning over traditional methods have been demonstratedfor radio signal classification in the recent years. However, variousresearchers have discovered that even a small but intentional featureperturbation known as adversarial examples can significantly deteriorate theperformance of the deep learning based radio signal classification. Amongvarious kinds of adversarial examples, universal adversarial perturbation hasgained considerable attention due to its feature of being data independent,hence as a practical strategy to fool the radio signal classification with ahigh success rate. Therefore, in this paper, we investigate a defense systemcalled neural rejection system to propose against universal adversarialperturbations, and evaluate its performance by generating white-box universaladversarial perturbations. We show that the proposed neural rejection system isable to defend universal adversarial perturbations with significantly higheraccuracy than the undefended deep neural network.