Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions

  • 2024-10-16 18:48:50
  • Zhenyu Jiang, Yuqi Xie, Jinhan Li, Ye Yuan, Yifeng Zhu, Yuke Zhu
  • 0

Abstract

Humanoid robots, with their human-like embodiment, have the potential tointegrate seamlessly into human environments. Critical to their coexistence andcooperation with humans is the ability to understand natural languagecommunications and exhibit human-like behaviors. This work focuses ongenerating diverse whole-body motions for humanoid robots from languagedescriptions. We leverage human motion priors from extensive human motiondatasets to initialize humanoid motions and employ the commonsense reasoningcapabilities of Vision Language Models (VLMs) to edit and refine these motions.Our approach demonstrates the capability to produce natural, expressive, andtext-aligned humanoid motions, validated through both simulated and real-worldexperiments. More videos can be found athttps://ut-austin-rpl.github.io/Harmon/.

 

Quick Read (beta)

loading the full paper ...