SS-SSAN: a self-supervised subspace attentional network for multi-modal medical image fusion

IF 10.7 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Review Pub Date : 2023-06-26 DOI:10.1007/s10462-023-10529-w

Ying Zhang, Rencan Nie, Jinde Cao, Chaozhen Ma, Chengchao Wang

{"title":"SS-SSAN: a self-supervised subspace attentional network for multi-modal medical image fusion","authors":"Ying Zhang, Rencan Nie, Jinde Cao, Chaozhen Ma, Chengchao Wang","doi":"10.1007/s10462-023-10529-w","DOIUrl":null,"url":null,"abstract":"<div><p>Multi-modal medical image fusion (MMIF) is used to merge multiple modes of medical images for better imaging quality and more comprehensive information, such that enhancing the reliability of clinical diagnosis. Since different types of medical images have different imaging mechanisms and focus on different pathological tissues, how to accurately fuse the information from various medical images has become an obstacle in image fusion research. In this paper, we propose a self-supervised subspace attentional framework for multi-modal image fusion, which is constructed by two sub-networks, i.e., the feature extract network and the feature fusion network. We implement a self-supervised strategy that facilitates the framework adaptively extracts the features of source images with the reconstruction of the fused image. Specifically, we adopt a subspace attentional Siamese Weighted Auto-Encoder as a feature extractor to extract the source image features including local and global features at first. Then, the extracted features are given into a weighted fusion decoding network to reconstruct the fused result, and the shallow features from the extractor are used to assist reconstruct the fused image. Finally, the feature extractor adaptively extracts the optimal features according to the fused results by simultaneously training the two sub-networks. Furthermore, to achieve better fusion results, we design a novel weight estimation in the weighted fidelity loss that measures the importance of each pixel by calculating a mixture of salient features and local contrast features of the image. Experiments demonstrate that our method gives the best results compared with other state-of-the-art fusion approaches.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"56 1","pages":"421 - 443"},"PeriodicalIF":10.7000,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-023-10529-w","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-modal medical image fusion (MMIF) is used to merge multiple modes of medical images for better imaging quality and more comprehensive information, such that enhancing the reliability of clinical diagnosis. Since different types of medical images have different imaging mechanisms and focus on different pathological tissues, how to accurately fuse the information from various medical images has become an obstacle in image fusion research. In this paper, we propose a self-supervised subspace attentional framework for multi-modal image fusion, which is constructed by two sub-networks, i.e., the feature extract network and the feature fusion network. We implement a self-supervised strategy that facilitates the framework adaptively extracts the features of source images with the reconstruction of the fused image. Specifically, we adopt a subspace attentional Siamese Weighted Auto-Encoder as a feature extractor to extract the source image features including local and global features at first. Then, the extracted features are given into a weighted fusion decoding network to reconstruct the fused result, and the shallow features from the extractor are used to assist reconstruct the fused image. Finally, the feature extractor adaptively extracts the optimal features according to the fused results by simultaneously training the two sub-networks. Furthermore, to achieve better fusion results, we design a novel weight estimation in the weighted fidelity loss that measures the importance of each pixel by calculating a mixture of salient features and local contrast features of the image. Experiments demonstrate that our method gives the best results compared with other state-of-the-art fusion approaches.

查看原文本刊更多论文

SS-SSAN:多模态医学图像融合的自监督子空间注意网络

多模态医学图像融合(MMIF)是将多模态医学图像进行融合，以获得更好的成像质量和更全面的信息，从而提高临床诊断的可靠性。由于不同类型的医学图像具有不同的成像机制，关注的病理组织也不同，如何准确地融合各种医学图像的信息成为图像融合研究的一个障碍。本文提出了一种多模态图像融合的自监督子空间关注框架，该框架由特征提取网络和特征融合网络两个子网络构成。我们实现了一种自监督策略，使框架能够自适应地提取源图像的特征，并对融合后的图像进行重建。具体来说，我们首先采用子空间关注Siamese加权自编码器作为特征提取器提取源图像的局部特征和全局特征。然后，将提取的特征进行加权融合解码网络重构融合结果，并利用提取的浅层特征辅助重构融合图像。最后，特征提取器通过同时训练两个子网络，根据融合结果自适应提取最优特征。此外，为了获得更好的融合结果，我们在加权保真度损失中设计了一种新的权重估计，通过计算图像的显著特征和局部对比度特征的混合来衡量每个像素的重要性。实验结果表明，与其他先进的融合方法相比，我们的方法得到了最好的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.