Shuhan Lei;Meng Zhang;Yuhang Wang;Nan Tang;Ni Jia;Lihua Fu
{"title":"用于高光谱图像分类的自适应令牌混合器","authors":"Shuhan Lei;Meng Zhang;Yuhang Wang;Nan Tang;Ni Jia;Lihua Fu","doi":"10.1109/JSTARS.2025.3552817","DOIUrl":null,"url":null,"abstract":"MLP-like models have shown strong potential in hyperspectral image (HSI) classification. However, their dense connections among all neurons (tokens) lead to large model sizes, high computational costs, and increased risk of overfitting. To address these issues, researchers have proposed sparse connectivity strategies to create more compact MLP models by selecting and mixing only a subset of tokens. However, most token selection rules overlook image patch content, often introducing task-irrelevant tokens with little valuable class distribution information. This problem is particularly severe in HSIs, which contain rich spatial and spectral information. To overcome this, we propose an adaptive token mixer (ATM) to effectively integrate spatial information in HSIs. ATM adaptively learns token positions based on their content, enabling the model to identify relevant tokens and capture global spatial information across the entire spatial domain. In addition, we introduce a cross-shaped convolutional operator (COSTCO) to enhance local spatial feature extraction. The combination of ATM and COSTCO enables comprehensive token mixing by integrating both global and local spatial information. Experimental results show that this proposed adaptive MLP focuses on the most informative, task-relevant regions during decision-making, offering interpretability to help users understand its predictions. Moreover, the adaptive MLP achieves state-of-the-art performance on HSI classification tasks across four publicly available datasets.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"8882-8896"},"PeriodicalIF":4.7000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10933583","citationCount":"0","resultStr":"{\"title\":\"Adaptive Token Mixer for Hyperspectral Image Classification\",\"authors\":\"Shuhan Lei;Meng Zhang;Yuhang Wang;Nan Tang;Ni Jia;Lihua Fu\",\"doi\":\"10.1109/JSTARS.2025.3552817\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"MLP-like models have shown strong potential in hyperspectral image (HSI) classification. However, their dense connections among all neurons (tokens) lead to large model sizes, high computational costs, and increased risk of overfitting. To address these issues, researchers have proposed sparse connectivity strategies to create more compact MLP models by selecting and mixing only a subset of tokens. However, most token selection rules overlook image patch content, often introducing task-irrelevant tokens with little valuable class distribution information. This problem is particularly severe in HSIs, which contain rich spatial and spectral information. To overcome this, we propose an adaptive token mixer (ATM) to effectively integrate spatial information in HSIs. ATM adaptively learns token positions based on their content, enabling the model to identify relevant tokens and capture global spatial information across the entire spatial domain. In addition, we introduce a cross-shaped convolutional operator (COSTCO) to enhance local spatial feature extraction. The combination of ATM and COSTCO enables comprehensive token mixing by integrating both global and local spatial information. Experimental results show that this proposed adaptive MLP focuses on the most informative, task-relevant regions during decision-making, offering interpretability to help users understand its predictions. Moreover, the adaptive MLP achieves state-of-the-art performance on HSI classification tasks across four publicly available datasets.\",\"PeriodicalId\":13116,\"journal\":{\"name\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"volume\":\"18 \",\"pages\":\"8882-8896\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10933583\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10933583/\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10933583/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Adaptive Token Mixer for Hyperspectral Image Classification
MLP-like models have shown strong potential in hyperspectral image (HSI) classification. However, their dense connections among all neurons (tokens) lead to large model sizes, high computational costs, and increased risk of overfitting. To address these issues, researchers have proposed sparse connectivity strategies to create more compact MLP models by selecting and mixing only a subset of tokens. However, most token selection rules overlook image patch content, often introducing task-irrelevant tokens with little valuable class distribution information. This problem is particularly severe in HSIs, which contain rich spatial and spectral information. To overcome this, we propose an adaptive token mixer (ATM) to effectively integrate spatial information in HSIs. ATM adaptively learns token positions based on their content, enabling the model to identify relevant tokens and capture global spatial information across the entire spatial domain. In addition, we introduce a cross-shaped convolutional operator (COSTCO) to enhance local spatial feature extraction. The combination of ATM and COSTCO enables comprehensive token mixing by integrating both global and local spatial information. Experimental results show that this proposed adaptive MLP focuses on the most informative, task-relevant regions during decision-making, offering interpretability to help users understand its predictions. Moreover, the adaptive MLP achieves state-of-the-art performance on HSI classification tasks across four publicly available datasets.
期刊介绍:
The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.