A Comprehensive Analysis of Deep Regression

  • 2020-09-24 15:10:03
  • Stéphane Lathuilière, Pablo Mesejo, Xavier Alameda-Pineda, Radu Horaud
  • 1

Abstract

Deep learning revolutionized data science, and recently its popularity hasgrown exponentially, as did the amount of papers employing deep networks.Vision tasks, such as human pose estimation, did not escape from this trend.There is a large number of deep models, where small changes in the networkarchitecture, or in the data pre-processing, together with the stochasticnature of the optimization procedures, produce notably different results,making extremely difficult to sift methods that significantly outperformothers. This situation motivates the current study, in which we perform asystematic evaluation and statistical analysis of vanilla deep regression, i.e.convolutional neural networks with a linear regression top layer. This is thefirst comprehensive analysis of deep regression techniques. We performexperiments on four vision problems, and report confidence intervals for themedian performance as well as the statistical significance of the results, ifany. Surprisingly, the variability due to different data pre-processingprocedures generally eclipses the variability due to modifications in thenetwork architecture. Our results reinforce the hypothesis according to which,in general, a general-purpose network (e.g. VGG-16 or ResNet-50) adequatelytuned can yield results close to the state-of-the-art without having to resortto more complex and ad-hoc regression models.

 

Quick Read (beta)

loading the full paper ...