Abstract
Multi-view 3D reconstruction remains a core challenge in computer vision,particularly in applications requiring accurate and scalable representationsacross diverse perspectives. Current leading methods such as DUSt3R employ afundamentally pairwise approach, processing images in pairs and necessitatingcostly global alignment procedures to reconstruct from multiple views. In thiswork, we propose Fast 3D Reconstruction (Fast3R), a novel multi-viewgeneralization to DUSt3R that achieves efficient and scalable 3D reconstructionby processing many views in parallel. Fast3R's Transformer-based architectureforwards N images in a single forward pass, bypassing the need for iterativealignment. Through extensive experiments on camera pose estimation and 3Dreconstruction, Fast3R demonstrates state-of-the-art performance, withsignificant improvements in inference speed and reduced error accumulation.These results establish Fast3R as a robust alternative for multi-viewapplications, offering enhanced scalability without compromising reconstructionaccuracy.