Learning Correspondence from the Cycle-Consistency of Time

  • 2019-03-18 17:36:00
  • Xiaolong Wang, Allan Jabri, Alexei A. Efros
  • 70

Abstract

We introduce a self-supervised method for learning visual correspondence fromunlabeled video. The main idea is to use cycle-consistency in time as freesupervisory signal for learning visual representations from scratch. Attraining time, our model learns a feature map representation to be useful forperforming cycle-consistent tracking. At test time, we use the acquiredrepresentation to find nearest neighbors across space and time. We demonstratethe generalizability of the representation -- without finetuning -- across arange of visual correspondence tasks, including video object segmentation,keypoint tracking, and optical flow. Our approach outperforms previousself-supervised methods and performs competitively with strongly supervisedmethods.

 

Quick Read (beta)

loading the full paper ...