Automatic x-ray to CT registration using embedding reconstruction and lite cross-attention

IF 3.2 2区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Medical physics Pub Date : 2025-05-21 DOI:10.1002/mp.17896

Tonglong Li, Minheng Chen, Mingying Li, Chuanyou Li, Youyong Kong

{"title":"Automatic x-ray to CT registration using embedding reconstruction and lite cross-attention","authors":"Tonglong Li, Minheng Chen, Mingying Li, Chuanyou Li, Youyong Kong","doi":"10.1002/mp.17896","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3> \n <p>The registration of intraoperative x-ray images with preoperative CT images is an important step in image-guided surgery. However, existing regression-based methods lack an interpretable and stable mechanism when fusing information from intraoperative images and preoperative CT volumes. In addition, existing feature extraction and fusion methods limit the accuracy of pose regression.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3> \n <p>The objective of this study is to develop a method that leverages both x-ray and computed tomography (CT) images to rapidly and robustly estimate an accurate initial registration within a broad search space. This approach integrates the strengths of learning-based registration with those of traditional registration methodologies, enabling the acquisition of registration outcomes across a wide search space at an accelerated pace.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3> \n <p>We introduce a regression-based registration framework to address the aforementioned issues. We constrain the feature fusion process by training the network to reconstruct the high-dimensional feature representation vector of the preoperative CT volume in the embedding space from the input single-view x-ray, thereby enhancing the interpretability of feature extraction. Also, in order to promote the effective fusion and better extraction of local texture features and global information, we propose a lightweight cross-attention mechanism named lite cross-attention(LCAT). Besides, to meet the intraoperative requirements, we employ the intensity-based registration method CMA-ES to refine the result of pose regression.</p>\n </section>\n \n <section>\n \n <h3> Results</h3> \n <p>Our approach is verified on both real and simulated x-ray data. Experimental results show that compared with the existing learning-based registration methods, the median rotation error of our method can reach 1.9<span></span><math>\n <semantics>\n <msup>\n <mrow></mrow>\n <mo>∘</mo>\n </msup>\n <annotation>$^\\circ$</annotation>\n </semantics></math> and the median translation error can reach 5.6 mm in the case of a large search range. When evaluated on 52 real x-ray images, we have a median rotation error of 1.6<span></span><math>\n <semantics>\n <msup>\n <mrow></mrow>\n <mo>∘</mo>\n </msup>\n <annotation>$^\\circ$</annotation>\n </semantics></math> and a median translation error of 3.8 mm due to the smaller search range. We also verify the role of the LCAT and embedding reconstruction modules in our registration framework. If they are not used, our registration performance will be reduced to approximately random initialization results.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3> \n <p>During the experiments, our method demonstrates higher accuracy and larger capture range on both simulated images and real x-ray images compared to existing methods. The inspiring experimental results indicate the potential for future clinical application of our method.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 7","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17896","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background

The registration of intraoperative x-ray images with preoperative CT images is an important step in image-guided surgery. However, existing regression-based methods lack an interpretable and stable mechanism when fusing information from intraoperative images and preoperative CT volumes. In addition, existing feature extraction and fusion methods limit the accuracy of pose regression.

Purpose

The objective of this study is to develop a method that leverages both x-ray and computed tomography (CT) images to rapidly and robustly estimate an accurate initial registration within a broad search space. This approach integrates the strengths of learning-based registration with those of traditional registration methodologies, enabling the acquisition of registration outcomes across a wide search space at an accelerated pace.

Methods

We introduce a regression-based registration framework to address the aforementioned issues. We constrain the feature fusion process by training the network to reconstruct the high-dimensional feature representation vector of the preoperative CT volume in the embedding space from the input single-view x-ray, thereby enhancing the interpretability of feature extraction. Also, in order to promote the effective fusion and better extraction of local texture features and global information, we propose a lightweight cross-attention mechanism named lite cross-attention(LCAT). Besides, to meet the intraoperative requirements, we employ the intensity-based registration method CMA-ES to refine the result of pose regression.

Results

Our approach is verified on both real and simulated x-ray data. Experimental results show that compared with the existing learning-based registration methods, the median rotation error of our method can reach 1.9 $^{\circ}$ and the median translation error can reach 5.6 mm in the case of a large search range. When evaluated on 52 real x-ray images, we have a median rotation error of 1.6 $^{\circ}$ and a median translation error of 3.8 mm due to the smaller search range. We also verify the role of the LCAT and embedding reconstruction modules in our registration framework. If they are not used, our registration performance will be reduced to approximately random initialization results.

Conclusions

During the experiments, our method demonstrates higher accuracy and larger capture range on both simulated images and real x-ray images compared to existing methods. The inspiring experimental results indicate the potential for future clinical application of our method.

查看原文本刊更多论文

基于嵌入重建和低交叉关注的x线与CT自动配准。

背景：术中x线图像与术前CT图像的配准是图像引导手术的重要步骤。然而，现有的基于回归的方法在融合术中图像和术前CT体积信息时缺乏可解释和稳定的机制。此外，现有的特征提取和融合方法限制了姿态回归的准确性。目的：本研究的目的是开发一种利用x射线和计算机断层扫描（CT）图像在广泛的搜索空间内快速和稳健地估计准确的初始配准的方法。该方法将基于学习的注册方法与传统注册方法的优势相结合，能够以更快的速度在广泛的搜索空间中获取注册结果。方法：引入基于回归的配准框架来解决上述问题。我们通过训练网络约束特征融合过程，从输入的单视图x射线中重构出术前CT体积在嵌入空间中的高维特征表示向量，从而增强特征提取的可解释性。此外，为了促进局部纹理特征和全局信息的有效融合和更好的提取，我们提出了一种轻量级的交叉注意机制，称为生命交叉注意（LCAT）。此外，为了满足术中要求，我们采用基于强度的配准方法CMA-ES对位姿回归结果进行了细化。结果：我们的方法在真实和模拟x射线数据上得到了验证。实验结果表明，与现有的基于学习的配准方法相比，在较大搜索范围的情况下，我们的方法旋转误差中值可达1.9°$^\circ$，平移误差中值可达5.6 mm。当对52张真实x射线图像进行评估时，由于搜索范围较小，我们的中位旋转误差为1.6°$^\circ$，中位平移误差为3.8 mm。我们还验证了LCAT和嵌入重构模块在注册框架中的作用。如果不使用它们，我们的注册性能将降低到近似随机初始化结果。结论：在实验中，与现有方法相比，我们的方法在模拟图像和真实x射线图像上都具有更高的精度和更大的捕获范围。这一鼓舞人心的实验结果显示了我们的方法在未来临床应用的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical physics 医学-核医学

CiteScore

6.80

自引率

15.80%

发文量

660

审稿时长

1.7 months

期刊介绍： Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.