Novel View Synthesis (NVS) is concerned with the generation of novel views ofa scene from one or more source images. NVS requires explicit reasoning about3D object structure and unseen parts of the scene. As a result, currentapproaches rely on supervised training with either 3D models or multiple targetimages. We present Unsupervised Continuous Object Representation Networks(UniCORN), which encode the geometry and appearance of a 3D scene using aneural 3D representation. Our model is trained with only two source images perobject, requiring no ground truth 3D models or target view supervision. Despitebeing unsupervised, UniCORN achieves comparable results to the state-of-the-arton challenging tasks, including novel view synthesis and single-view 3Dreconstruction.