{"title":"Evaluation of Unsupervised Deformable Image Registration Using CNN and ViT on 4D-CT.","authors":"Peizhi Chen, Jialan Wang, Yifan Guo, Yan Wang","doi":"10.2174/0115734056385097251010051841","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Deformable image registration is essential in medical image analysis. The state-of-the-art approaches are unsupervised methods based on convolutional neural networks (CNN) and vision transformers (ViT). While CNNs perform well in extracting local features, ViTs perform better in extracting global features.</p><p><strong>Objective: </strong>This study aimed to compare the performance of CNN and ViT in unsupervised deformable image registration.</p><p><strong>Method: </strong>We have proposed a unified registration framework and evaluated both architectures. Experiments have been conducted using 4D-CT.</p><p><strong>Results: </strong>The results have shown ViT-based registration to achieve superior performance compared to CNN-based methods.</p><p><strong>Conclusion: </strong>The findings have indicated vision transformer architectures to be more effective than convolutional networks for unsupervised deformable registration on 4D-CT data. Foundation Item: This work is supported by the National Natural Science Foundation of China (No.61801413).</p>","PeriodicalId":54215,"journal":{"name":"Current Medical Imaging Reviews","volume":" ","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Medical Imaging Reviews","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2174/0115734056385097251010051841","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Deformable image registration is essential in medical image analysis. The state-of-the-art approaches are unsupervised methods based on convolutional neural networks (CNN) and vision transformers (ViT). While CNNs perform well in extracting local features, ViTs perform better in extracting global features.
Objective: This study aimed to compare the performance of CNN and ViT in unsupervised deformable image registration.
Method: We have proposed a unified registration framework and evaluated both architectures. Experiments have been conducted using 4D-CT.
Results: The results have shown ViT-based registration to achieve superior performance compared to CNN-based methods.
Conclusion: The findings have indicated vision transformer architectures to be more effective than convolutional networks for unsupervised deformable registration on 4D-CT data. Foundation Item: This work is supported by the National Natural Science Foundation of China (No.61801413).
期刊介绍:
Current Medical Imaging Reviews publishes frontier review articles, original research articles, drug clinical trial studies and guest edited thematic issues on all the latest advances on medical imaging dedicated to clinical research. All relevant areas are covered by the journal, including advances in the diagnosis, instrumentation and therapeutic applications related to all modern medical imaging techniques.
The journal is essential reading for all clinicians and researchers involved in medical imaging and diagnosis.