状态空间模型与用于高光谱图像分类的变换器相遇

IF 3.4 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Xuefei Shi , Yisi Zhang , Kecheng Liu , Zhaokun Wen , Wenxuan Wang , Tianxiang Zhang , Jiangyun Li
{"title":"状态空间模型与用于高光谱图像分类的变换器相遇","authors":"Xuefei Shi ,&nbsp;Yisi Zhang ,&nbsp;Kecheng Liu ,&nbsp;Zhaokun Wen ,&nbsp;Wenxuan Wang ,&nbsp;Tianxiang Zhang ,&nbsp;Jiangyun Li","doi":"10.1016/j.sigpro.2024.109669","DOIUrl":null,"url":null,"abstract":"<div><p>In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at <span><span>https://github.com/PPPPPsanG/MamTrans</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"226 ","pages":"Article 109669"},"PeriodicalIF":3.4000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0165168424002895/pdfft?md5=4a576501dea0e507ceb0f2a46f4619c0&pid=1-s2.0-S0165168424002895-main.pdf","citationCount":"0","resultStr":"{\"title\":\"State space models meet transformers for hyperspectral image classification\",\"authors\":\"Xuefei Shi ,&nbsp;Yisi Zhang ,&nbsp;Kecheng Liu ,&nbsp;Zhaokun Wen ,&nbsp;Wenxuan Wang ,&nbsp;Tianxiang Zhang ,&nbsp;Jiangyun Li\",\"doi\":\"10.1016/j.sigpro.2024.109669\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at <span><span>https://github.com/PPPPPsanG/MamTrans</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":49523,\"journal\":{\"name\":\"Signal Processing\",\"volume\":\"226 \",\"pages\":\"Article 109669\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0165168424002895/pdfft?md5=4a576501dea0e507ceb0f2a46f4619c0&pid=1-s2.0-S0165168424002895-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165168424002895\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168424002895","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

近年来,卷积神经网络和视觉变换器已成为高光谱遥感图像分类任务的主要模型,它们分别利用了计算成本较高的堆积卷积层和自注意机制。最近的研究,如 Mamba 模型,展示了具有高效硬件感知设计的状态空间模型(SSM)在高效建模序列和提取标记隐含特征方面的能力,而这正是准确的高光谱图像(HSI)分类所需要的。因此,基于 SSM 的模型有可能成为遥感高光谱图像分类任务的新架构。然而,由于空间信息和冗余光谱特征的不敏感性,SSM 在对 HSI 进行建模时遇到了挑战。鉴于基于 SSM 的方法在 HSI 分类中鲜有探索,在这项工作中,我们首次探索了基于 SSM 的 HSI 分类模型。我们提出的 MamTrans 方法有效地利用了转换器捕捉空间标记关系的能力和 Mamba 提取标记隐含特征的能力。此外,我们还提出了双向 Mamba 模块,以增强 SSM 在提取人机交互信息中空间特征的空间感知能力。我们提出的 MamTrans 在五个常用的人机交互分类基准中取得了新的一流性能,证明了 MamTrans 的强大泛化能力和基于 SSM 结构的人机交互分类任务的有效性。我们的代码见 https://github.com/PPPPPsanG/MamTrans。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
State space models meet transformers for hyperspectral image classification

In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at https://github.com/PPPPPsanG/MamTrans.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Signal Processing
Signal Processing 工程技术-工程:电子与电气
CiteScore
9.20
自引率
9.10%
发文量
309
审稿时长
41 days
期刊介绍: Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing. Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信