Overload: Latency Attacks on Object Detection for Edge Devices

  • 2024-04-26 18:23:06
  • Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-rung Lee
  • 0

Abstract

Nowadays, the deployment of deep learning-based applications is an essentialtask owing to the increasing demands on intelligent services. In this paper, weinvestigate latency attacks on deep learning applications. Unlike commonadversarial attacks for misclassification, the goal of latency attacks is toincrease the inference time, which may stop applications from responding to therequests within a reasonable time. This kind of attack is ubiquitous forvarious applications, and we use object detection to demonstrate how such kindof attacks work. We also design a framework named Overload to generate latencyattacks at scale. Our method is based on a newly formulated optimizationproblem and a novel technique, called spatial attention. This attack serves toescalate the required computing costs during the inference time, consequentlyleading to an extended inference time for object detection. It presents asignificant threat, especially to systems with limited computing resources. Weconducted experiments using YOLOv5 models on Nvidia NX. Compared to existingmethods, our method is simpler and more effective. The experimental resultsshow that with latency attacks, the inference time of a single image can beincreased ten times longer in reference to the normal setting. Moreover, ourfindings pose a potential new threat to all object detection tasks requiringnon-maximum suppression (NMS), as our attack is NMS-agnostic.

 

Quick Read (beta)

loading the full paper ...