{"title":"CM-CSAMFNet:一种用于多模态医学图像融合的跨模态通道和空间关注模块融合网络","authors":"Yixiang Lu, Changqing Xu, Jingyun Gong, Qingwei Gao, Dong Sun, De Zhu","doi":"10.1016/j.sigpro.2025.110288","DOIUrl":null,"url":null,"abstract":"<div><div>The fusion technology of functional and structural images contributes to clinical diagnosis by integrating complementary information from different modalities. However, traditional state-of-the-art fusion methods and convolutional networks still require manually designed fusion strategies, which are inefficient and struggle to effectively merge complementary information across modalities. In addition, multiscale fusion methods suffer from excessive model parameters and inadequate consideration of long-range dependencies. To overcome these limitations, an attention-based end-to-end framework (CM-CSAMFNet) is proposed for medical image fusion using a multiscale autoencoder architecture. To design a learnable fusion strategy, we introduce a convolutional block attention module fusion network (CBAMFNet), which leverages cross-modal channel and spatial attention mechanisms to replace conventional fusion approaches. To reduce the number of parameters in multiscale network, the entire network employs ghost convolution techniques, which require only a small number of convolutional operations while extensively utilizing linear computations. Furthermore, to fully account for long-range dependencies, a cross-domain attention mechanism named the cross-modal residual convolutional block attention module (RCBAM) is proposed. This mechanism aims to comprehensively integrate locally complementary features and enhance global brightness information. More specifically, the cross-domain attention module incorporates spatial and channel attention mechanisms to integrate long-range dependencies within and across different modalities. Compared to existing approaches, the proposed fusion algorithm achieves superior performance in SPECT–MRI and PET–MRI image fusion tasks, as evaluated by both subjective and objective metrics. The code of the proposed method is available at <span><span>https://github.com/ahu-dsp/CM-CSAMFNet</span><svg><path></path></svg></span></div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"239 ","pages":"Article 110288"},"PeriodicalIF":3.6000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CM-CSAMFNet: A cross-modality channel and spatial attention module fusion network for multimodal medical image fusion\",\"authors\":\"Yixiang Lu, Changqing Xu, Jingyun Gong, Qingwei Gao, Dong Sun, De Zhu\",\"doi\":\"10.1016/j.sigpro.2025.110288\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The fusion technology of functional and structural images contributes to clinical diagnosis by integrating complementary information from different modalities. However, traditional state-of-the-art fusion methods and convolutional networks still require manually designed fusion strategies, which are inefficient and struggle to effectively merge complementary information across modalities. In addition, multiscale fusion methods suffer from excessive model parameters and inadequate consideration of long-range dependencies. To overcome these limitations, an attention-based end-to-end framework (CM-CSAMFNet) is proposed for medical image fusion using a multiscale autoencoder architecture. To design a learnable fusion strategy, we introduce a convolutional block attention module fusion network (CBAMFNet), which leverages cross-modal channel and spatial attention mechanisms to replace conventional fusion approaches. To reduce the number of parameters in multiscale network, the entire network employs ghost convolution techniques, which require only a small number of convolutional operations while extensively utilizing linear computations. Furthermore, to fully account for long-range dependencies, a cross-domain attention mechanism named the cross-modal residual convolutional block attention module (RCBAM) is proposed. This mechanism aims to comprehensively integrate locally complementary features and enhance global brightness information. More specifically, the cross-domain attention module incorporates spatial and channel attention mechanisms to integrate long-range dependencies within and across different modalities. Compared to existing approaches, the proposed fusion algorithm achieves superior performance in SPECT–MRI and PET–MRI image fusion tasks, as evaluated by both subjective and objective metrics. The code of the proposed method is available at <span><span>https://github.com/ahu-dsp/CM-CSAMFNet</span><svg><path></path></svg></span></div></div>\",\"PeriodicalId\":49523,\"journal\":{\"name\":\"Signal Processing\",\"volume\":\"239 \",\"pages\":\"Article 110288\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165168425004025\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425004025","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
CM-CSAMFNet: A cross-modality channel and spatial attention module fusion network for multimodal medical image fusion
The fusion technology of functional and structural images contributes to clinical diagnosis by integrating complementary information from different modalities. However, traditional state-of-the-art fusion methods and convolutional networks still require manually designed fusion strategies, which are inefficient and struggle to effectively merge complementary information across modalities. In addition, multiscale fusion methods suffer from excessive model parameters and inadequate consideration of long-range dependencies. To overcome these limitations, an attention-based end-to-end framework (CM-CSAMFNet) is proposed for medical image fusion using a multiscale autoencoder architecture. To design a learnable fusion strategy, we introduce a convolutional block attention module fusion network (CBAMFNet), which leverages cross-modal channel and spatial attention mechanisms to replace conventional fusion approaches. To reduce the number of parameters in multiscale network, the entire network employs ghost convolution techniques, which require only a small number of convolutional operations while extensively utilizing linear computations. Furthermore, to fully account for long-range dependencies, a cross-domain attention mechanism named the cross-modal residual convolutional block attention module (RCBAM) is proposed. This mechanism aims to comprehensively integrate locally complementary features and enhance global brightness information. More specifically, the cross-domain attention module incorporates spatial and channel attention mechanisms to integrate long-range dependencies within and across different modalities. Compared to existing approaches, the proposed fusion algorithm achieves superior performance in SPECT–MRI and PET–MRI image fusion tasks, as evaluated by both subjective and objective metrics. The code of the proposed method is available at https://github.com/ahu-dsp/CM-CSAMFNet
期刊介绍:
Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing.
Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.