Abstract
Recent approaches for 3D relighting have shown promise in integrating 2Dimage relighting generative priors to alter the appearance of a 3Drepresentation while preserving the underlying structure. Nevertheless,generative priors used for 2D relighting that directly relight from an inputimage do not take advantage of intrinsic properties of the subject that can beinferred or cannot consider multi-view data at scale, leading to subparrelighting. In this paper, we propose Lightswitch, a novel finetunedmaterial-relighting diffusion framework that efficiently relights an arbitrarynumber of input images to a target lighting condition while incorporating cuesfrom inferred intrinsic properties. By using multi-view and materialinformation cues together with a scalable denoising scheme, our methodconsistently and efficiently relights dense multi-view data of objects withdiverse material compositions. We show that our 2D relighting predictionquality exceeds previous state-of-the-art relighting priors that directlyrelight from images. We further demonstrate that LightSwitch matches oroutperforms state-of-the-art diffusion inverse rendering methods in relightingsynthetic and real objects in as little as 2 minutes.