用于高光谱图像分类的自适应令牌混合器

IF 4.7 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Shuhan Lei;Meng Zhang;Yuhang Wang;Nan Tang;Ni Jia;Lihua Fu
{"title":"用于高光谱图像分类的自适应令牌混合器","authors":"Shuhan Lei;Meng Zhang;Yuhang Wang;Nan Tang;Ni Jia;Lihua Fu","doi":"10.1109/JSTARS.2025.3552817","DOIUrl":null,"url":null,"abstract":"MLP-like models have shown strong potential in hyperspectral image (HSI) classification. However, their dense connections among all neurons (tokens) lead to large model sizes, high computational costs, and increased risk of overfitting. To address these issues, researchers have proposed sparse connectivity strategies to create more compact MLP models by selecting and mixing only a subset of tokens. However, most token selection rules overlook image patch content, often introducing task-irrelevant tokens with little valuable class distribution information. This problem is particularly severe in HSIs, which contain rich spatial and spectral information. To overcome this, we propose an adaptive token mixer (ATM) to effectively integrate spatial information in HSIs. ATM adaptively learns token positions based on their content, enabling the model to identify relevant tokens and capture global spatial information across the entire spatial domain. In addition, we introduce a cross-shaped convolutional operator (COSTCO) to enhance local spatial feature extraction. The combination of ATM and COSTCO enables comprehensive token mixing by integrating both global and local spatial information. Experimental results show that this proposed adaptive MLP focuses on the most informative, task-relevant regions during decision-making, offering interpretability to help users understand its predictions. Moreover, the adaptive MLP achieves state-of-the-art performance on HSI classification tasks across four publicly available datasets.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"8882-8896"},"PeriodicalIF":4.7000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10933583","citationCount":"0","resultStr":"{\"title\":\"Adaptive Token Mixer for Hyperspectral Image Classification\",\"authors\":\"Shuhan Lei;Meng Zhang;Yuhang Wang;Nan Tang;Ni Jia;Lihua Fu\",\"doi\":\"10.1109/JSTARS.2025.3552817\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"MLP-like models have shown strong potential in hyperspectral image (HSI) classification. However, their dense connections among all neurons (tokens) lead to large model sizes, high computational costs, and increased risk of overfitting. To address these issues, researchers have proposed sparse connectivity strategies to create more compact MLP models by selecting and mixing only a subset of tokens. However, most token selection rules overlook image patch content, often introducing task-irrelevant tokens with little valuable class distribution information. This problem is particularly severe in HSIs, which contain rich spatial and spectral information. To overcome this, we propose an adaptive token mixer (ATM) to effectively integrate spatial information in HSIs. ATM adaptively learns token positions based on their content, enabling the model to identify relevant tokens and capture global spatial information across the entire spatial domain. In addition, we introduce a cross-shaped convolutional operator (COSTCO) to enhance local spatial feature extraction. The combination of ATM and COSTCO enables comprehensive token mixing by integrating both global and local spatial information. Experimental results show that this proposed adaptive MLP focuses on the most informative, task-relevant regions during decision-making, offering interpretability to help users understand its predictions. Moreover, the adaptive MLP achieves state-of-the-art performance on HSI classification tasks across four publicly available datasets.\",\"PeriodicalId\":13116,\"journal\":{\"name\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"volume\":\"18 \",\"pages\":\"8882-8896\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10933583\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10933583/\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10933583/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

mlp样模型在高光谱图像(HSI)分类中显示出强大的潜力。然而,它们在所有神经元(令牌)之间的密集连接导致模型尺寸大,计算成本高,并且过度拟合的风险增加。为了解决这些问题,研究人员提出了稀疏连接策略,通过选择和混合令牌的子集来创建更紧凑的MLP模型。然而,大多数令牌选择规则忽略了图像补丁内容,通常引入与任务无关的令牌,这些令牌几乎没有有价值的类分布信息。这个问题在包含丰富空间和光谱信息的hsi中尤为严重。为了克服这个问题,我们提出了一种自适应令牌混合器(ATM)来有效地集成hsi中的空间信息。ATM根据token的内容自适应学习token的位置,使模型能够识别相关token并捕获整个空间域的全局空间信息。此外,我们引入了一个十字卷积算子(COSTCO)来增强局部空间特征提取。ATM和COSTCO的结合通过整合全球和本地的空间信息,实现了全面的代币混合。实验结果表明,该自适应MLP在决策过程中关注信息最丰富、任务相关的区域,提供可解释性,帮助用户理解其预测。此外,自适应MLP在四个公开可用数据集的HSI分类任务上实现了最先进的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Adaptive Token Mixer for Hyperspectral Image Classification
MLP-like models have shown strong potential in hyperspectral image (HSI) classification. However, their dense connections among all neurons (tokens) lead to large model sizes, high computational costs, and increased risk of overfitting. To address these issues, researchers have proposed sparse connectivity strategies to create more compact MLP models by selecting and mixing only a subset of tokens. However, most token selection rules overlook image patch content, often introducing task-irrelevant tokens with little valuable class distribution information. This problem is particularly severe in HSIs, which contain rich spatial and spectral information. To overcome this, we propose an adaptive token mixer (ATM) to effectively integrate spatial information in HSIs. ATM adaptively learns token positions based on their content, enabling the model to identify relevant tokens and capture global spatial information across the entire spatial domain. In addition, we introduce a cross-shaped convolutional operator (COSTCO) to enhance local spatial feature extraction. The combination of ATM and COSTCO enables comprehensive token mixing by integrating both global and local spatial information. Experimental results show that this proposed adaptive MLP focuses on the most informative, task-relevant regions during decision-making, offering interpretability to help users understand its predictions. Moreover, the adaptive MLP achieves state-of-the-art performance on HSI classification tasks across four publicly available datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.30
自引率
10.90%
发文量
563
审稿时长
4.7 months
期刊介绍: The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信