Abstract
We introduce Princeton365, a large-scale diverse dataset of 365 videos withaccurate camera pose. Our dataset bridges the gap between accuracy and datadiversity in current SLAM benchmarks by introducing a novel ground truthcollection framework that leverages calibration boards and a 360-camera. Wecollect indoor, outdoor, and object scanning videos with synchronized monocularand stereo RGB video outputs as well as IMU. We further propose a new scenescale-aware evaluation metric for SLAM based on the the optical flow induced bythe camera pose estimation error. In contrast to the current metrics, our newmetric allows for comparison between the performance of SLAM methods acrossscenes as opposed to existing metrics such as Average Trajectory Error (ATE),allowing researchers to analyze the failure modes of their methods. We alsopropose a challenging Novel View Synthesis benchmark that covers cases notcovered by current NVS benchmarks, such as fully non-Lambertian scenes with360-degree camera trajectories. Please visithttps://princeton365.cs.princeton.edu for the dataset, code, videos, andsubmission.