Abstract
Simulating hair dynamics that generalize across arbitrary hairstyles, bodyshapes, and motions is a critical challenge. Our novel two-stage neuralsolution is the first to leverage Transformer-based architectures for such abroad generalization. We propose a Transformer-powered static network thatpredicts static draped shapes for any hairstyle, effectively resolvinghair-body penetrations and preserving hair fidelity. Subsequently, a dynamicnetwork with a novel cross-attention mechanism fuses static hair features withkinematic input to generate expressive dynamics and complex secondary motions.This dynamic network also allows for efficient fine-tuning of challengingmotion sequences, such as abrupt head movements. Our method offers real-timeinference for both static single-frame drapes and dynamic drapes over posesequences. Our method demonstrates high-fidelity and generalizable dynamic hairacross various styles, guided by physics-informed losses, and can resolvepenetrations even for complex, unseen long hairstyles, highlighting its broadgeneralization.