MFCTrans: Multi-scale Feature Connection Transformer for Deformable Medical Image Registration

IF 4.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Longji Wang, Zhiyue Yan, Wenming Cao, Jianhua Ji
{"title":"MFCTrans: Multi-scale Feature Connection Transformer for Deformable Medical Image Registration","authors":"Longji Wang, Zhiyue Yan, Wenming Cao, Jianhua Ji","doi":"10.1007/s12559-023-10239-z","DOIUrl":null,"url":null,"abstract":"<p>Deformable Medical Image Registration (DMIR) aims to establish precise anatomical alignment of multiple medical images. However, the existing U-shape networks encounter difficulties in efficiently transferring multi-scale feature information from the encoder to the decoder. To address this issue, we propose a novel backbone network called MFCTrans, which constructs effective feature connection in DMIR. Drawing inspiration from the attention mechanism observed in the human cognitive system, our proposed method employs a Feature Fusion and Assignment Transformer (FFAT) module and a Spatial Cross Attention Fusion (SCAF) module. The former facilitates the fusion of multi-channel features, while the latter guides the integration of multi-scale information. A Multiple Residual (MR) branch is also deployed between the encoder and FFAT to improve the network’s generalization. We conduct extensive qualitative and quantitative evaluations on the OASIS and LPBA40 datasets. The proposed method achieves higher Dice scores than Transmorph by 1.3% and 2.0% on the respective datasets while maintaining a comparable voxel folding percentage. Ablation studies analyze the impacts and efficiency of each component in the proposed method. In summary, our proposed network offers a promising framework for achieving high-quality medical image registration and holds significant potential for applications in computer vision and cognitive computation.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"88 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12559-023-10239-z","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Deformable Medical Image Registration (DMIR) aims to establish precise anatomical alignment of multiple medical images. However, the existing U-shape networks encounter difficulties in efficiently transferring multi-scale feature information from the encoder to the decoder. To address this issue, we propose a novel backbone network called MFCTrans, which constructs effective feature connection in DMIR. Drawing inspiration from the attention mechanism observed in the human cognitive system, our proposed method employs a Feature Fusion and Assignment Transformer (FFAT) module and a Spatial Cross Attention Fusion (SCAF) module. The former facilitates the fusion of multi-channel features, while the latter guides the integration of multi-scale information. A Multiple Residual (MR) branch is also deployed between the encoder and FFAT to improve the network’s generalization. We conduct extensive qualitative and quantitative evaluations on the OASIS and LPBA40 datasets. The proposed method achieves higher Dice scores than Transmorph by 1.3% and 2.0% on the respective datasets while maintaining a comparable voxel folding percentage. Ablation studies analyze the impacts and efficiency of each component in the proposed method. In summary, our proposed network offers a promising framework for achieving high-quality medical image registration and holds significant potential for applications in computer vision and cognitive computation.

Abstract Image

MFCTrans:用于可变形医学图像配准的多尺度特征连接变换器
可变形医学图像配准(DMIR)旨在为多幅医学图像建立精确的解剖配准。然而,现有的 U 型网络在将多尺度特征信息从编码器有效传输到解码器时遇到了困难。为了解决这个问题,我们提出了一种名为 MFCTrans 的新型骨干网络,它能在 DMIR 中构建有效的特征连接。从人类认知系统中观察到的注意力机制中汲取灵感,我们提出的方法采用了特征融合与分配转换器(FFAT)模块和空间交叉注意力融合(SCAF)模块。前者有助于多通道特征的融合,后者则指导多尺度信息的整合。在编码器和 FFAT 之间还部署了多重残差(MR)分支,以提高网络的泛化能力。我们在 OASIS 和 LPBA40 数据集上进行了广泛的定性和定量评估。在相应的数据集上,所提出的方法比 Transmorph 的 Dice 分数分别高出 1.3% 和 2.0%,同时保持了相当的体素折叠率。消融研究分析了拟议方法中每个组件的影响和效率。总之,我们提出的网络为实现高质量的医学图像配准提供了一个前景广阔的框架,在计算机视觉和认知计算领域具有巨大的应用潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cognitive Computation
Cognitive Computation COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-NEUROSCIENCES
CiteScore
9.30
自引率
3.70%
发文量
116
审稿时长
>12 weeks
期刊介绍: Cognitive Computation is an international, peer-reviewed, interdisciplinary journal that publishes cutting-edge articles describing original basic and applied work involving biologically-inspired computational accounts of all aspects of natural and artificial cognitive systems. It provides a new platform for the dissemination of research, current practices and future trends in the emerging discipline of cognitive computation that bridges the gap between life sciences, social sciences, engineering, physical and mathematical sciences, and humanities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信