Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation

  • 2018-03-14 12:37:50
  • Xiaoxiao Li, Chen Change Loy
  • 0

Abstract

The problem of video object segmentation can become extremely challengingwhen multiple instances co-exist. While each instance may exhibit large scaleand pose variations, the problem is compounded when instances occlude eachother causing failures in tracking. In this study, we formulate a deeprecurrent network that is capable of segmenting and tracking objects in videosimultaneously by their temporal continuity, yet able to re-identify them whenthey re-appear after a prolonged occlusion. We combine both temporalpropagation and re-identification functionalities into a single framework thatcan be trained end-to-end. In particular, we present a re-identification modulewith template expansion to retrieve missing objects despite their largeappearance changes. In addition, we contribute a new attention-based recurrentmask propagation approach that is robust to distractors not belonging to thetarget segment. Our approach achieves a new state-of-the-art global mean(Region Jaccard and Boundary F measure) of 68.2 on the challenging DAVIS 2017benchmark (test-dev set), outperforming the winning solution which achieves aglobal mean of 66.1 on the same partition.

 

Quick Read (beta)

loading the full paper ...