Estimating Body and Hand Motion in an Ego-sensed World

  • 2024-10-04 18:59:57
  • Brent Yi, Vickie Ye, Maya Zheng, Lea Müller, Georgios Pavlakos, Yi Ma, Jitendra Malik, Angjoo Kanazawa
  • 0

Abstract

We present EgoAllo, a system for human motion estimation from a head-mounteddevice. Using only egocentric SLAM poses and images, EgoAllo guides samplingfrom a conditional diffusion model to estimate 3D body pose, height, and handparameters that capture the wearer's actions in the allocentric coordinateframe of the scene. To achieve this, our key insight is in representation: wepropose spatial and temporal invariance criteria for improving modelperformance, from which we derive a head motion conditioning parameterizationthat improves estimation by up to 18%. We also show how the bodies estimated byour system can improve the hands: the resulting kinematic and temporalconstraints result in over 40% lower hand estimation errors compared to noisymonocular estimates. Project page: https://egoallo.github.io/

 

Quick Read (beta)

loading the full paper ...