Abstract
Three-dimensional human body models are widely used in the analysis of humanpose and motion. Existing models, however, are learned from minimally-clothedhumans and thus do not capture the complexity of dressed humans in commonimages and videos. To address this, we learn a generative 3D mesh model ofclothing from 3D scans of people with varying pose. Going beyond previous work,our generative model is conditioned on different clothing types, giving theability to dress different body shapes in a variety of clothing. To do so, wetrain a conditional Mesh-VAE-GAN on clothing displacements from a 3D SMPL bodymodel. This generative clothing model enables us to sample various types ofclothing, in novel poses, on top of SMPL. With a focus on clothing geometry,the model captures both global shape and local structure, effectively extendingthe SMPL model to add clothing. To our knowledge, this is the first conditionalVAE-GAN that works on 3D meshes. For clothing specifically, it is the firstsuch model that directly dresses 3D human body meshes and generalizes todifferent poses.