Human Motion Diffusion Model

  • 2022-09-29 17:27:53
  • Guy Tevet, Sigal Raab, Brian Gordon, Yonatan Shafir, Amit H. Bermano, Daniel Cohen-Or
  • 490

Abstract

Natural and expressive human motion generation is the holy grail of computeranimation. It is a challenging task, due to the diversity of possible motion,human perceptual sensitivity to it, and the difficulty of accurately describingit. Therefore, current generative solutions are either low-quality or limitedin expressiveness. Diffusion models, which have already shown remarkablegenerative capabilities in other domains, are promising candidates for humanmotion due to their many-to-many nature, but they tend to be resource hungryand hard to control. In this paper, we introduce Motion Diffusion Model (MDM),a carefully adapted classifier-free diffusion-based generative model for thehuman motion domain. MDM is transformer-based, combining insights from motiongeneration literature. A notable design-choice is the prediction of the sample,rather than the noise, in each diffusion step. This facilitates the use ofestablished geometric losses on the locations and velocities of the motion,such as the foot contact loss. As we demonstrate, MDM is a generic approach,enabling different modes of conditioning, and different generation tasks. Weshow that our model is trained with lightweight resources and yet achievesstate-of-the-art results on leading benchmarks for text-to-motion andaction-to-motion. https://guytevet.github.io/mdm-page/ .

 

Quick Read (beta)

loading the full paper ...