ARFlow: Human Action-Reaction Flow Matching with Physical Guidance

  • 2025-06-02 08:31:13
  • Wentao Jiang, Jingya Wang, Kaiyang Ji, Baoxiong Jia, Siyuan Huang, Ye Shi
  • 0

Abstract

Human action-reaction synthesis, a fundamental challenge in modeling causalhuman interactions, plays a critical role in applications ranging from virtualreality to social robotics. While diffusion-based models have demonstratedpromising performance, they exhibit two key limitations for interactionsynthesis: reliance on complex noise-to-reaction generators with intricateconditional mechanisms, and frequent physical violations in generated motions.To address these issues, we propose Action-Reaction Flow Matching (ARFlow), anovel framework that establishes direct action-to-reaction mappings,eliminating the need for complex conditional mechanisms. Our approachintroduces a physical guidance mechanism specifically designed for FlowMatching (FM) that effectively prevents body penetration artifacts duringsampling. Moreover, we discover the bias of traditional flow matching samplingalgorithm and employ a reprojection method to revise the sampling direction ofFM. To further enhance the reaction diversity, we incorporate randomness intothe sampling process. Extensive experiments on NTU120, Chi3D and InterHumandatasets demonstrate that ARFlow not only outperforms existing methods in termsof Fr\'echet Inception Distance and motion diversity but also significantlyreduces body collisions, as measured by our new Intersection Volume andIntersection Frequency metrics.

 

Quick Read (beta)

loading the full paper ...