Active Learning for Deep Detection Neural Networks

Abstract

The cost of drawing object bounding boxes (i.e. labeling) for millions ofimages is prohibitively high. For instance, labeling pedestrians in a regularurban image could take 35 seconds on average. Active learning aims to reducethe cost of labeling by selecting only those images that are informative toimprove the detection network accuracy. In this paper, we propose a method toperform active learning of object detectors based on convolutional neuralnetworks. We propose a new image-level scoring process to rank unlabeled imagesfor their automatic selection, which clearly outperforms classical scores. Theproposed method can be applied to videos and sets of still images. In theformer case, temporal selection rules can complement our scoring process. As arelevant use case, we extensively study the performance of our method on thetask of pedestrian detection. Overall, the experiments show that the proposedmethod performs better than random selection. Our codes are publicly availableat www.gitlab.com/haghdam/deep_active_learning.

Quick Read (beta)

loading the full paper ...