Modeling Visual Context is Key to Augmenting Object Detection Datasets

  • 2018-07-19 13:50:42
  • Nikita Dvornik, Julien Mairal, Cordelia Schmid
  • 5

Abstract

Performing data augmentation for learning deep neural networks is well knownto be important for training visual recognition systems. By artificiallyincreasing the number of training examples, it helps reducing overfitting andimproves generalization. For object detection, classical approaches for dataaugmentation consist of generating images obtained by basic geometricaltransformations and color changes of original training images. In this work, wego one step further and leverage segmentation annotations to increase thenumber of object instances present on training data. For this approach to besuccessful, we show that modeling appropriately the visual context surroundingobjects is crucial to place them in the right environment. Otherwise, we showthat the previous strategy actually hurts. With our context model, we achievesignificant mean average precision improvements when few labeled examples areavailable on the VOC'12 benchmark.

 

Quick Read (beta)

loading the full paper ...