CoMotion: Concurrent Multi-person 3D Motion

  • 2025-04-16 16:40:15
  • Alejandro Newell, Peiyun Hu, Lahav Lipson, Stephan R. Richter, Vladlen Koltun
  • 0

Abstract

We introduce an approach for detecting and tracking detailed 3D poses ofmultiple people from a single monocular camera stream. Our system maintainstemporally coherent predictions in crowded scenes filled with difficult posesand occlusions. Our model performs both strong per-frame detection and alearned pose update to track people from frame to frame. Rather than matchdetections across time, poses are updated directly from a new input image,which enables online tracking through occlusion. We train on numerous image andvideo datasets leveraging pseudo-labeled annotations to produce a model thatmatches state-of-the-art systems in 3D pose estimation accuracy while beingfaster and more accurate in tracking multiple people through time. Code andweights are provided at https://github.com/apple/ml-comotion

 

Quick Read (beta)

loading the full paper ...