SCL: Towards Accurate Domain Adaptive Object Detection via Gradient Detach Based Stacked Complementary Losses

Abstract

Unsupervised domain adaptive object detection aims to learn a robust detectorin the domain shift circumstance, where the training (source) domain islabel-rich with bounding box annotations, while the testing (target) domain islabel-agnostic and the feature distributions between training and testingdomains are dissimilar or even totally different. In this paper, we propose agradient detach based stacked complementary losses (SCL) method that usesdetection objective (cross entropy and smooth l1 regression) as the primaryobjective, and cuts in several auxiliary losses in different network stages toutilize information from the complement data (target images) that can beeffective in adapting model parameters to both source and target domains. Agradient detach operation is applied between detection and context sub-networkswith different objectives to force networks to learn more discriminativerepresentations. We argue that the conventional training with primary objectivemainly leverages the information from the source-domain for maximizinglikelihood and ignores the complement data in shallow layers of networks, whichleads to an insufficient integration within different domains. Thus, ourproposed method is a more syncretic adaptation learning process. We conductcomprehensive experiments on seven datasets, the results demonstrate that ourmethod performs favorably better than the state-of-the-art methods by a largemargin. For instance, from Cityscapes to FoggyCityscapes, we achieve 37.9% mAP,outperforming the previous art Strong-Weak by 3.6%.

Quick Read (beta)

loading the full paper ...