Abstract
We present RigAnything, a novel autoregressive transformer-based model, whichmakes 3D assets rig-ready by probabilistically generating joints, skeletontopologies, and assigning skinning weights in a template-free manner. Unlikemost existing auto-rigging methods, which rely on predefined skeleton templateand are limited to specific categories like humanoid, RigAnything approachesthe rigging problem in an autoregressive manner, iteratively predicting thenext joint based on the global input shape and the previous prediction. Whileautoregressive models are typically used to generate sequential data,RigAnything extends their application to effectively learn and representskeletons, which are inherently tree structures. To achieve this, we organizethe joints in a breadth-first search (BFS) order, enabling the skeleton to bedefined as a sequence of 3D locations and the parent index. Furthermore, ourmodel improves the accuracy of position prediction by leveraging diffusionmodeling, ensuring precise and consistent placement of joints within thehierarchy. This formulation allows the autoregressive model to efficientlycapture both spatial and hierarchical relationships within the skeleton.Trained end-to-end on both RigNet and Objaverse datasets, RigAnythingdemonstrates state-of-the-art performance across diverse object types,including humanoids, quadrupeds, marine creatures, insects, and many more,surpassing prior methods in quality, robustness, generalizability, andefficiency. Please check our website for more details:https://www.liuisabella.com/RigAnything.