Tracking Objects as Points

  • 2020-04-02 17:58:40
  • Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl
  • 122

Abstract

Tracking has traditionally been the art of following interest points throughspace and time. This changed with the rise of powerful deep networks. Nowadays,tracking is dominated by pipelines that perform object detection followed bytemporal association, also known as tracking-by-detection. In this paper, wepresent a simultaneous detection and tracking algorithm that is simpler,faster, and more accurate than the state of the art. Our tracker, CenterTrack,applies a detection model to a pair of images and detections from the priorframe. Given this minimal input, CenterTrack localizes objects and predictstheir associations with the previous frame. That's it. CenterTrack is simple,online (no peeking into the future), and real-time. It achieves 67.3% MOTA onthe MOT17 challenge at 22 FPS and 89.4% MOTA on the KITTI tracking benchmark at15 FPS, setting a new state of the art on both datasets. CenterTrack is easilyextended to monocular 3D tracking by regressing additional 3D attributes. Usingmonocular video input, it achieves 28.3% [email protected] on the newly releasednuScenes 3D tracking benchmark, substantially outperforming the monocularbaseline on this benchmark while running at 28 FPS.

 

Quick Read (beta)

loading the full paper ...