MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection

  • 2021-04-06 17:40:31
  • Cheng Zhang, Tai-Yu Pan, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao
  • 0

Abstract

Many objects do not appear frequently enough in complex scenes (e.g., certainhandbags in living rooms) for training an accurate object detector, but areoften found frequently by themselves (e.g., in product images). Yet, theseobject-centric images are not effectively leveraged for improving objectdetection in scene-centric images. In this paper, we propose Mosaic ofObject-centric images as Scene-centric images (MosaicOS), a simple and novelframework that is surprisingly effective at tackling the challenges oflong-tailed object detection. Keys to our approach are three-fold: (i) pseudoscene-centric image construction from object-centric images for mitigatingdomain differences, (ii) high-quality bounding box imputation using theobject-centric images' class labels, and (iii) a multi-stage trainingprocedure. On LVIS object detection (and instance segmentation), MosaicOS leadsto a massive 60% (and 23%) relative improvement in average precision for rareobject categories. We also show that our framework can be compatibly used withother existing approaches to achieve even further gains.

 

Quick Read (beta)

loading the full paper ...