Adaloss: Adaptive Loss Function for Landmark Localization

Abstract

Landmark localization is a challenging problem in computer vision with amultitude of applications. Recent deep learning based methods have shownimproved results by regressing likelihood maps instead of regressing thecoordinates directly. However, setting the precision of these regressiontargets during the training is a cumbersome process since it creates atrade-off between trainability vs localization accuracy. Using precise targetsintroduces a significant sampling bias and hence makes the training moredifficult, whereas using imprecise targets results in inaccurate landmarkdetectors. In this paper, we introduce "Adaloss", an objective function thatadapts itself during the training by updating the target precision based on thetraining statistics. This approach does not require setting problem-specificparameters and shows improved stability in training and better localizationaccuracy during inference. We demonstrate the effectiveness of our proposedmethod in three different applications of landmark localization: 1) thechallenging task of precisely detecting catheter tips in medical X-ray images,2) localizing surgical instruments in endoscopic images, and 3) localizingfacial features on in-the-wild images where we show state-of-the-art results onthe 300-W benchmark dataset.

Quick Read (beta)

loading the full paper ...