Dino-Reg: Efficient Multimodal Image Registration with Distilled Features.

IF 8.9 1区医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

IEEE Transactions on Medical Imaging Pub Date : 2025-05-06 DOI:10.1109/tmi.2025.3567247

Xinrui Song,Xuanang Xu,Jiajin Zhang,Diego Machado Reyes,Pingkun Yan

{"title":"Dino-Reg: Efficient Multimodal Image Registration with Distilled Features.","authors":"Xinrui Song,Xuanang Xu,Jiajin Zhang,Diego Machado Reyes,Pingkun Yan","doi":"10.1109/tmi.2025.3567247","DOIUrl":null,"url":null,"abstract":"Medical image registration is a crucial process for aligning anatomical structures, enabling applications such as atlas mapping, longitudinal analysis, and multimodal data fusion. This paper introduces DINO-Reg, an adaptation-free registration method leveraging the vision foundation model, DINOv2, to extract features for deformable 3D medical image alignment. Although DINOv2 was originally trained on natural images, our study links the vision foundation model with medical image registration and demonstrates that the generic image encoder could readily generalize to medical images with state-of-the-art performance. We further propose DINO-Reg-Eco, a knowledge-distilled version using a UNet-structured 3D convolutional neural network (CNN) for feature extraction. The Eco model reduces encoding time by 99% while maintaining state-of-the-art performance, which is essential for resource-limited settings and significantly lowers the carbon footprint associated with intensive computational demands. Benchmarking across diverse datasets shows that both methods outperform existing supervised and unsupervised approaches without fine-tuning, demonstrating the transformative potential of foundation models in medical image registration. Our code is open-sourced at https: //github.com/RPIDIAL/DINO-Reg.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"38 1","pages":""},"PeriodicalIF":8.9000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Medical Imaging","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/tmi.2025.3567247","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Medical image registration is a crucial process for aligning anatomical structures, enabling applications such as atlas mapping, longitudinal analysis, and multimodal data fusion. This paper introduces DINO-Reg, an adaptation-free registration method leveraging the vision foundation model, DINOv2, to extract features for deformable 3D medical image alignment. Although DINOv2 was originally trained on natural images, our study links the vision foundation model with medical image registration and demonstrates that the generic image encoder could readily generalize to medical images with state-of-the-art performance. We further propose DINO-Reg-Eco, a knowledge-distilled version using a UNet-structured 3D convolutional neural network (CNN) for feature extraction. The Eco model reduces encoding time by 99% while maintaining state-of-the-art performance, which is essential for resource-limited settings and significantly lowers the carbon footprint associated with intensive computational demands. Benchmarking across diverse datasets shows that both methods outperform existing supervised and unsupervised approaches without fine-tuning, demonstrating the transformative potential of foundation models in medical image registration. Our code is open-sourced at https: //github.com/RPIDIAL/DINO-Reg.

查看原文本刊更多论文

Dino-Reg：高效的多模态图像配准与蒸馏特征。

医学图像配准是对齐解剖结构的关键过程，可实现地图集测绘、纵向分析和多模态数据融合等应用。本文介绍了一种利用视觉基础模型DINOv2提取可变形三维医学图像特征的无自适应配准方法DINO-Reg。虽然DINOv2最初是在自然图像上训练的，但我们的研究将视觉基础模型与医学图像配准联系起来，并证明了通用图像编码器可以很容易地推广到具有最先进性能的医学图像。我们进一步提出了DINO-Reg-Eco，这是一种知识提炼版本，使用unet结构的3D卷积神经网络（CNN）进行特征提取。Eco模型将编码时间减少了99%，同时保持了最先进的性能，这对于资源有限的环境至关重要，并且显著降低了与密集计算需求相关的碳足迹。跨不同数据集的基准测试表明，两种方法都优于现有的有监督和无监督方法，无需微调，证明了基础模型在医学图像配准中的变革潜力。我们的代码是在https: //github.com/RPIDIAL/DINO-Reg上开源的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Medical Imaging 医学-成像科学与照相技术

CiteScore

21.80

自引率

5.70%

发文量

637

审稿时长

5.6 months

期刊介绍： The IEEE Transactions on Medical Imaging (T-MI) is a journal that welcomes the submission of manuscripts focusing on various aspects of medical imaging. The journal encourages the exploration of body structure, morphology, and function through different imaging techniques, including ultrasound, X-rays, magnetic resonance, radionuclides, microwaves, and optical methods. It also promotes contributions related to cell and molecular imaging, as well as all forms of microscopy. T-MI publishes original research papers that cover a wide range of topics, including but not limited to novel acquisition techniques, medical image processing and analysis, visualization and performance, pattern recognition, machine learning, and other related methods. The journal particularly encourages highly technical studies that offer new perspectives. By emphasizing the unification of medicine, biology, and imaging, T-MI seeks to bridge the gap between instrumentation, hardware, software, mathematics, physics, biology, and medicine by introducing new analysis methods. While the journal welcomes strong application papers that describe novel methods, it directs papers that focus solely on important applications using medically adopted or well-established methods without significant innovation in methodology to other journals. T-MI is indexed in Pubmed® and Medline®, which are products of the United States National Library of Medicine.