SFMamba: a novel spatial-frequency collaborative learning for multimodal medical image fusion with mamba

IF 3.5 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence Pub Date : 2026-05-07 DOI:10.1007/s10489-026-07252-8

Zhaijuan Ding, Zhaisheng Ding, Yunzhe Men, Yanyu Liu, Shengyang Luan, Shufang Tian

{"title":"SFMamba: a novel spatial-frequency collaborative learning for multimodal medical image fusion with mamba","authors":"Zhaijuan Ding, Zhaisheng Ding, Yunzhe Men, Yanyu Liu, Shengyang Luan, Shufang Tian","doi":"10.1007/s10489-026-07252-8","DOIUrl":null,"url":null,"abstract":"<div><p>The objective of medical image fusion is to discern and amalgamate complementary features extracted from multimodal medical images, producing fused representations that are more interpretable and informative. This enhancement facilitates higher diagnostic accuracy and efficiency. While Transformer-based fusion methods excel at capturing long-range dependencies and enable parallel computation, their computational cost rises sharply with increasing input dimensions, often reaching quadratic complexity. To address this challenge, we propose a novel spatial-frequency domain fusion network, termed as SFMamba, which exploits a frequency transformation in conjunction with the Mamba model to fully leverage both spatial and frequency information. An efficient Mamba branch incorporates a spatial-frequency state-space (SFSS) model, reducing computational burden to linear or near-linear complexity. The selective-band feature extraction (SBFE) branch is constructed using a discrete wavelet pyramid, designed to capture multi-scale frequency components consistently across source images. To dynamically and effectively fuse multi-modal complementary information, we introduce a multi-domain feature fusion (MDFF) module that elevates fusion performance. Training is conducted with a multi-teacher learning strategy (MTLS) that integrates pre-trained convolutional neural networks and transformer-based fusion methods to generate multiple pseudo-labels, guiding the network to inherit fused knowledge from these priors. Extensive experiments demonstrate that SFMamba achieves state-of-the-art performance in both subjective and objective evaluations. The code for SFMamba is available at https://github.com/DZSYUNNAN/SFMamba.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 7","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2026-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-026-07252-8","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The objective of medical image fusion is to discern and amalgamate complementary features extracted from multimodal medical images, producing fused representations that are more interpretable and informative. This enhancement facilitates higher diagnostic accuracy and efficiency. While Transformer-based fusion methods excel at capturing long-range dependencies and enable parallel computation, their computational cost rises sharply with increasing input dimensions, often reaching quadratic complexity. To address this challenge, we propose a novel spatial-frequency domain fusion network, termed as SFMamba, which exploits a frequency transformation in conjunction with the Mamba model to fully leverage both spatial and frequency information. An efficient Mamba branch incorporates a spatial-frequency state-space (SFSS) model, reducing computational burden to linear or near-linear complexity. The selective-band feature extraction (SBFE) branch is constructed using a discrete wavelet pyramid, designed to capture multi-scale frequency components consistently across source images. To dynamically and effectively fuse multi-modal complementary information, we introduce a multi-domain feature fusion (MDFF) module that elevates fusion performance. Training is conducted with a multi-teacher learning strategy (MTLS) that integrates pre-trained convolutional neural networks and transformer-based fusion methods to generate multiple pseudo-labels, guiding the network to inherit fused knowledge from these priors. Extensive experiments demonstrate that SFMamba achieves state-of-the-art performance in both subjective and objective evaluations. The code for SFMamba is available at https://github.com/DZSYUNNAN/SFMamba.

Abstract Image

查看原文本刊更多论文

SFMamba：一种基于曼巴的多模态医学图像融合的新型空间-频率协同学习方法

医学图像融合的目的是识别和合并从多模态医学图像中提取的互补特征，产生更具可解释性和信息量的融合表示。这种增强有助于提高诊断的准确性和效率。虽然基于变压器的融合方法在捕获远程依赖关系和实现并行计算方面表现出色，但它们的计算成本随着输入维数的增加而急剧上升，通常达到二次复杂度。为了应对这一挑战，我们提出了一种新的空间频域融合网络，称为SFMamba，它利用频率变换与Mamba模型相结合，充分利用空间和频率信息。一个有效的曼巴分支结合了一个空间频率状态空间（SFSS）模型，减少了线性或近线性复杂性的计算负担。选择频带特征提取（SBFE）分支使用离散小波金字塔构建，旨在捕获跨源图像一致的多尺度频率成分。为了动态有效地融合多模态互补信息，我们引入了多域特征融合（MDFF）模块，提高了融合性能。使用多教师学习策略（MTLS）进行训练，该策略集成了预训练的卷积神经网络和基于变压器的融合方法，生成多个伪标签，引导网络从这些先验中继承融合的知识。广泛的实验表明，SFMamba在主观和客观评估方面都达到了最先进的性能。SFMamba的代码可在https://github.com/DZSYUNNAN/SFMamba上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Intelligence 工程技术-计算机：人工智能

CiteScore

6.60

自引率

20.80%

发文量

1361

审稿时长

5.9 months

期刊介绍： With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.