DeTrigger: A Gradient-Centric Approach to Backdoor Attack Mitigation in Federated Learning

Abstract

Federated Learning (FL) enables collaborative model training acrossdistributed devices while preserving local data privacy, making it ideal formobile and embedded systems. However, the decentralized nature of FL also opensvulnerabilities to model poisoning attacks, particularly backdoor attacks,where adversaries implant trigger patterns to manipulate model predictions. Inthis paper, we propose DeTrigger, a scalable and efficient backdoor-robustfederated learning framework that leverages insights from adversarial attackmethodologies. By employing gradient analysis with temperature scaling,DeTrigger detects and isolates backdoor triggers, allowing for precise modelweight pruning of backdoor activations without sacrificing benign modelknowledge. Extensive evaluations across four widely used datasets demonstratethat DeTrigger achieves up to 251x faster detection than traditional methodsand mitigates backdoor attacks by up to 98.9%, with minimal impact on globalmodel accuracy. Our findings establish DeTrigger as a robust and scalablesolution to protect federated learning environments against sophisticatedbackdoor threats.

Quick Read (beta)

loading the full paper ...