Abstract
3D Gaussian Splatting (GS) significantly struggles to accurately representthe underlying 3D scene geometry, resulting in inaccuracies and floatingartifacts when rendering depth maps. In this paper, we address this limitation,undertaking a comprehensive analysis of the integration of depth priorsthroughout the optimization process of Gaussian primitives, and present a novelstrategy for this purpose. This latter dynamically exploits depth cues from areadily available stereo network, processing virtual stereo pairs rendered bythe GS model itself during training and achieving consistent self-improvementof the scene representation. Experimental results on three popular datasets,breaking ground as the first to assess depth accuracy for these models,validate our findings.