CAT3D: Create Anything in 3D with Multi-View Diffusion Models

  • 2024-05-16 18:59:05
  • Ruiqi Gao, Aleksander Holynski, Philipp Henzler, Arthur Brussee, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. Barron, Ben Poole
  • 0

Abstract

Advances in 3D reconstruction have enabled high-quality 3D capture, butrequire a user to collect hundreds to thousands of images to create a 3D scene.We present CAT3D, a method for creating anything in 3D by simulating thisreal-world capture process with a multi-view diffusion model. Given any numberof input images and a set of target novel viewpoints, our model generateshighly consistent novel views of a scene. These generated views can be used asinput to robust 3D reconstruction techniques to produce 3D representations thatcan be rendered from any viewpoint in real-time. CAT3D can create entire 3Dscenes in as little as one minute, and outperforms existing methods for singleimage and few-view 3D scene creation. See our project page for results andinteractive demos at https://cat3d.github.io .

 

Quick Read (beta)

loading the full paper ...