Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Abstract

We propose a method to learn 3D deformable object categories from rawsingle-view images, without external supervision. The method is based on anautoencoder that factors each input image into depth, albedo, viewpoint andillumination. In order to disentangle these components without supervision, weuse the fact that many object categories have, at least in principle, asymmetric structure. We show that reasoning about illumination allows us toexploit the underlying object symmetry even if the appearance is not symmetricdue to shading. Furthermore, we model objects that are probably, but notcertainly, symmetric by predicting a symmetry probability map, learnedend-to-end with the other components of the model. Our experiments show thatthis method can recover very accurately the 3D shape of human faces, cat facesand cars from single-view images, without any supervision or a prior shapemodel. On benchmarks, we demonstrate superior accuracy compared to anothermethod that uses supervision at the level of 2D image correspondences.

Quick Read (beta)

loading the full paper ...