Parametric models of humans, faces, hands and animals have been widely usedfor a range of tasks such as image-based reconstruction, shape correspondenceestimation, and animation. Their key strength is the ability to factor surfacevariations into shape and pose dependent components. Learning such modelsrequires lots of expert knowledge and hand-defined object-specific constraints,making the learning approach unscalable to novel objects. In this paper, wepresent a simple yet effective approach to learn disentangled shape and poserepresentations in an unsupervised setting. We use a combination ofself-consistency and cross-consistency constraints to learn pose and shapespace from registered meshes. We additionally incorporate as-rigid-as-possibledeformation(ARAP) into the training loop to avoid degenerate solutions. Wedemonstrate the usefulness of learned representations through a number of tasksincluding pose transfer and shape retrieval. The experiments on datasets of 3Dhumans, faces, hands and animals demonstrate the generality of our approach.Code is made available athttps://virtualhumans.mpi-inf.mpg.de/unsup_shape_pose/.