Abstract
Existing methods for relightable view synthesis -- using a set of images ofan object under unknown lighting to recover a 3D representation that can berendered from novel viewpoints under a target illumination -- are based oninverse rendering, and attempt to disentangle the object geometry, materials,and lighting that explain the input images. Furthermore, this typicallyinvolves optimization through differentiable Monte Carlo rendering, which isbrittle and computationally-expensive. In this work, we propose a simplerapproach: we first relight each input image using an image diffusion modelconditioned on lighting and then reconstruct a Neural Radiance Field (NeRF)with these relit images, from which we render novel views under the targetlighting. We demonstrate that this strategy is surprisingly competitive andachieves state-of-the-art results on multiple relighting benchmarks. Please seeour project page at https://illuminerf.github.io/.