Abstract
Dense matching methods like DUSt3R regress pairwise pointmaps for 3Dreconstruction. However, the reliance on pairwise prediction and the limitedgeneralization capability inherently restrict the global geometric consistency.In this work, we introduce Test3R, a surprisingly simple test-time learningtechnique that significantly boosts geometric accuracy. Using image triplets($I_1,I_2,I_3$), Test3R generates reconstructions from pairs ($I_1,I_2$) and($I_1,I_3$). The core idea is to optimize the network at test time via aself-supervised objective: maximizing the geometric consistency between thesetwo reconstructions relative to the common image $I_1$. This ensures the modelproduces cross-pair consistent outputs, regardless of the inputs. Extensiveexperiments demonstrate that our technique significantly outperforms previousstate-of-the-art methods on the 3D reconstruction and multi-view depthestimation tasks. Moreover, it is universally applicable and nearly cost-free,making it easily applied to other models and implemented with minimal test-timetraining overhead and parameter footprint. Code is available athttps://github.com/nopQAQ/Test3R.