General Purpose Image Encoder DINOv2 for Medical Image Registration

  • 2024-02-24 02:15:30
  • Xinrui Song, Xuanang Xu, Pingkun Yan
  • 0

Abstract

Existing medical image registration algorithms rely on either datasetspecific training or local texture-based features to align images. The formercannot be reliably implemented without large modality-specific trainingdatasets, while the latter lacks global semantics thus could be easily trappedat local minima. In this paper, we present a training-free deformable imageregistration method, DINO-Reg, leveraging a general purpose image encoderDINOv2 for image feature extraction. The DINOv2 encoder was trained using theImageNet data containing natural images. We used the pretrained DINOv2 withoutany finetuning. Our method feeds the DINOv2 encoded features into a discreteoptimizer to find the optimal deformable registration field. We conducted aseries of experiments to understand the behavior and role of such a generalpurpose image encoder in the application of image registration. Combined withhandcrafted features, our method won the first place in the recent OncoRegChallenge. To our knowledge, this is the first application of general visionfoundation models in medical image registration.

 

Quick Read (beta)

loading the full paper ...