Abstract
Accurately estimating depth in 360-degree imagery is crucial for virtualreality, autonomous navigation, and immersive media applications. Existingdepth estimation methods designed for perspective-view imagery fail whenapplied to 360-degree images due to different camera projections anddistortions, whereas 360-degree methods perform inferior due to the lack oflabeled data pairs. We propose a new depth estimation framework that utilizesunlabeled 360-degree data effectively. Our approach uses state-of-the-artperspective depth estimation models as teacher models to generate pseudo labelsthrough a six-face cube projection technique, enabling efficient labeling ofdepth in 360-degree images. This method leverages the increasing availabilityof large datasets. Our approach includes two main stages: offline maskgeneration for invalid regions and an online semi-supervised joint trainingregime. We tested our approach on benchmark datasets such as Matterport3D andStanford2D3D, showing significant improvements in depth estimation accuracy,particularly in zero-shot scenarios. Our proposed training pipeline can enhanceany 360 monocular depth estimator and demonstrates effective knowledge transferacross different camera projections and data types. See our project page forresults: https://albert100121.github.io/Depth-Anywhere/