UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection

  • 2024-08-14 11:06:39
  • Mu Chen, Minghan Chen, Yi Yang
  • 0

Abstract

This paper focuses on Human-Object Interaction (HOI) detection, addressingthe challenge of identifying and understanding the interactions between humansand objects within a given image or video frame. Spearheaded by DetectionTransformer (DETR), recent developments lead to significant improvements byreplacing traditional region proposals by a set of learnable queries. However,despite the powerful representation capabilities provided by Transformers,existing Human-Object Interaction (HOI) detection methods still yield lowconfidence levels when dealing with complex interactions and are prone tooverlooking interactive actions. To address these issues, we propose a novelapproach \textsc{UAHOI}, Uncertainty-aware Robust Human-Object InteractionLearning that explicitly estimates prediction uncertainty during the trainingprocess to refine both detection and interaction predictions. Our model notonly predicts the HOI triplets but also quantifies the uncertainty of thesepredictions. Specifically, we model this uncertainty through the variance ofpredictions and incorporate it into the optimization objective, allowing themodel to adaptively adjust its confidence threshold based on predictionvariance. This integration helps in mitigating the adverse effects of incorrector ambiguous predictions that are common in traditional methods without anyhand-designed components, serving as an automatic confidence threshold. Ourmethod is flexible to existing HOI detection methods and demonstrates improvedaccuracy. We evaluate \textsc{UAHOI} on two standard benchmarks in the field:V-COCO and HICO-DET, which represent challenging scenarios for HOI detection.Through extensive experiments, we demonstrate that \textsc{UAHOI} achievessignificant improvements over existing state-of-the-art methods, enhancing boththe accuracy and robustness of HOI detection.

 

Quick Read (beta)

loading the full paper ...