状态空间模型与用于高光谱图像分类的变换器相遇

IF 3.4 2区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Signal Processing Pub Date : 2024-08-22 DOI:10.1016/j.sigpro.2024.109669

Xuefei Shi , Yisi Zhang , Kecheng Liu , Zhaokun Wen , Wenxuan Wang , Tianxiang Zhang , Jiangyun Li

{"title":"状态空间模型与用于高光谱图像分类的变换器相遇","authors":"Xuefei Shi , Yisi Zhang , Kecheng Liu , Zhaokun Wen , Wenxuan Wang , Tianxiang Zhang , Jiangyun Li","doi":"10.1016/j.sigpro.2024.109669","DOIUrl":null,"url":null,"abstract":"<div><p>In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at <span><span>https://github.com/PPPPPsanG/MamTrans</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"226 ","pages":"Article 109669"},"PeriodicalIF":3.4000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0165168424002895/pdfft?md5=4a576501dea0e507ceb0f2a46f4619c0&pid=1-s2.0-S0165168424002895-main.pdf","citationCount":"0","resultStr":"{\"title\":\"State space models meet transformers for hyperspectral image classification\",\"authors\":\"Xuefei Shi , Yisi Zhang , Kecheng Liu , Zhaokun Wen , Wenxuan Wang , Tianxiang Zhang , Jiangyun Li\",\"doi\":\"10.1016/j.sigpro.2024.109669\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at <span><span>https://github.com/PPPPPsanG/MamTrans</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":49523,\"journal\":{\"name\":\"Signal Processing\",\"volume\":\"226 \",\"pages\":\"Article 109669\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0165168424002895/pdfft?md5=4a576501dea0e507ceb0f2a46f4619c0&pid=1-s2.0-S0165168424002895-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165168424002895\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168424002895","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

近年来，卷积神经网络和视觉变换器已成为高光谱遥感图像分类任务的主要模型，它们分别利用了计算成本较高的堆积卷积层和自注意机制。最近的研究，如 Mamba 模型，展示了具有高效硬件感知设计的状态空间模型（SSM）在高效建模序列和提取标记隐含特征方面的能力，而这正是准确的高光谱图像（HSI）分类所需要的。因此，基于 SSM 的模型有可能成为遥感高光谱图像分类任务的新架构。然而，由于空间信息和冗余光谱特征的不敏感性，SSM 在对 HSI 进行建模时遇到了挑战。鉴于基于 SSM 的方法在 HSI 分类中鲜有探索，在这项工作中，我们首次探索了基于 SSM 的 HSI 分类模型。我们提出的 MamTrans 方法有效地利用了转换器捕捉空间标记关系的能力和 Mamba 提取标记隐含特征的能力。此外，我们还提出了双向 Mamba 模块，以增强 SSM 在提取人机交互信息中空间特征的空间感知能力。我们提出的 MamTrans 在五个常用的人机交互分类基准中取得了新的一流性能，证明了 MamTrans 的强大泛化能力和基于 SSM 结构的人机交互分类任务的有效性。我们的代码见 https://github.com/PPPPPsanG/MamTrans。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

State space models meet transformers for hyperspectral image classification

In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self-attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM’s spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at https://github.com/PPPPPsanG/MamTrans.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Signal Processing 工程技术-工程：电子与电气

CiteScore

9.20

自引率

9.10%

发文量

309

审稿时长

41 days

期刊介绍： Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing. Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.