SpectMamba: Remote sensing change detection network integrating frequency and visual state space model

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Zhiwei Dong, Dapeng Cheng, Jinjiang Li
{"title":"SpectMamba: Remote sensing change detection network integrating frequency and visual state space model","authors":"Zhiwei Dong,&nbsp;Dapeng Cheng,&nbsp;Jinjiang Li","doi":"10.1016/j.eswa.2025.127902","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, the fusion of Convolutional Neural Network (CNNs) and Transformer models, which can simultaneously leverage the former’s efficiency in local feature extraction and the latter’s advantage in capturing long-range dependencies, has achieved complementary strengths and demonstrated superior modeling potential. However, some of the high-frequency subtle changes and periodic structural changes (e.g., regularly arranged clusters of reconstructed buildings) in multispectral remote sensing images are often difficult to detect in the spatial domain; at the same time, the high computational complexity of the Transformer model restricts its practical application. Recently, the state-space model-based Mamba architecture has performed well in the RSCD task, efficiently learning image global information with linear complexity. Based on this, this study hypothesizes that a strategy combining spectral layers with visual state space (VSS) modules can more efficiently parse these challenges in dense prediction tasks. Specifically, we propose the frontier strategy of using a spectral layer for the initial layer and a VSS layer for the deeper layer and verify its effectiveness through extensive experiments. At the same time, we identify and optimize the limitations of VSS in independently processing the high-frequency information output from the spectral layer, and develop Conv-VSS. These techniques are integrated and extended into a network called SpectMamba, which fuses the spectral layer and Conv-VSS to more appropriately capture feature representations by analyzing the feature images in both the frequency domain and spatial features while avoiding the complexity associated with high-dimensional matrix operations in self-attention. Extensive experimental results on three publicly available datasets show that SpectMamba significantly outperforms existing techniques on several performance metrics.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"287 ","pages":"Article 127902"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425015246","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, the fusion of Convolutional Neural Network (CNNs) and Transformer models, which can simultaneously leverage the former’s efficiency in local feature extraction and the latter’s advantage in capturing long-range dependencies, has achieved complementary strengths and demonstrated superior modeling potential. However, some of the high-frequency subtle changes and periodic structural changes (e.g., regularly arranged clusters of reconstructed buildings) in multispectral remote sensing images are often difficult to detect in the spatial domain; at the same time, the high computational complexity of the Transformer model restricts its practical application. Recently, the state-space model-based Mamba architecture has performed well in the RSCD task, efficiently learning image global information with linear complexity. Based on this, this study hypothesizes that a strategy combining spectral layers with visual state space (VSS) modules can more efficiently parse these challenges in dense prediction tasks. Specifically, we propose the frontier strategy of using a spectral layer for the initial layer and a VSS layer for the deeper layer and verify its effectiveness through extensive experiments. At the same time, we identify and optimize the limitations of VSS in independently processing the high-frequency information output from the spectral layer, and develop Conv-VSS. These techniques are integrated and extended into a network called SpectMamba, which fuses the spectral layer and Conv-VSS to more appropriately capture feature representations by analyzing the feature images in both the frequency domain and spatial features while avoiding the complexity associated with high-dimensional matrix operations in self-attention. Extensive experimental results on three publicly available datasets show that SpectMamba significantly outperforms existing techniques on several performance metrics.
spectrmamba:结合频率和视觉状态空间模型的遥感变化检测网络
近年来,卷积神经网络(Convolutional Neural Network, cnn)与Transformer模型的融合实现了优势互补,展现出了卓越的建模潜力,同时利用了前者在局部特征提取方面的效率和后者在捕获远程依赖关系方面的优势。然而,多光谱遥感影像中的一些高频细微变化和周期性结构变化(如有序排列的重建建筑群)往往难以在空间域中检测到;同时,变压器模型的高计算复杂度限制了其实际应用。近年来,基于状态空间模型的Mamba结构在RSCD任务中表现良好,能够有效地学习具有线性复杂度的图像全局信息。基于此,本研究假设将光谱层与视觉状态空间(VSS)模块相结合的策略可以更有效地解析密集预测任务中的这些挑战。具体而言,我们提出了在初始层使用光谱层,在深层使用VSS层的前沿策略,并通过大量实验验证了其有效性。同时,我们发现并优化了VSS在独立处理频谱层输出的高频信息方面的局限性,并开发了convv -VSS。这些技术被集成并扩展到一个名为spectrmamba的网络中,该网络融合了频谱层和convv - vss,通过在频域和空间特征上分析特征图像,更恰当地捕获特征表示,同时避免了自关注中高维矩阵操作带来的复杂性。在三个公开数据集上的大量实验结果表明,spectrmamba在几个性能指标上明显优于现有技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信