PCET: Patch Confidence-Enhanced Transformer with efficient spectral–spatial features for hyperspectral image classification

IF 7.6 Q1 REMOTE SENSING

International journal of applied earth observation and geoinformation : ITC journal Pub Date : 2025-02-01 DOI:10.1016/j.jag.2024.104308

Li Fang , Xuanli Lan , Tianyu Li , Huifang Shen

{"title":"PCET: Patch Confidence-Enhanced Transformer with efficient spectral–spatial features for hyperspectral image classification","authors":"Li Fang , Xuanli Lan , Tianyu Li , Huifang Shen","doi":"10.1016/j.jag.2024.104308","DOIUrl":null,"url":null,"abstract":"<div><div>Hyperspectral image (HSI) classification based on deep learning has demonstrated promising performance. In general, using patch-wise samples helps to extract the spatial relationship between pixels and local contextual information. However, the presence of background or other category information in an image patch that is inconsistent with the central target category has a negative effect on classification. To solve this issue, a patch confidence-enhanced transformer (PCET) approach for HSI classification is proposed. To be specific, we design a patch quality assessment (PQA) branch model to evaluate the input patches during training process, which effectively filters out the intrusive non-central pixels. The output confidence of the branch model serves as a quantitative indicator of the contribution degree of the input patch to the overall training efficacy, which is subsequently weighted in the loss function, thereby endowing the model with the capability to dynamically adjust its learning focus based on the qualitative of the inputs. Second, a spectral–spatial multi-feature fusion (SSMF) module is devised to procure scores of representative information simultaneously and fully exploit the potential of multi-scale feature HSI data. Finally, to enhance feature discrimination, global context is efficiently modeled using the efficient additive attention transformer (<span><math><mrow><msup><mrow><mi>EA</mi></mrow><mrow><mn>2</mn></mrow></msup><mi>T</mi></mrow></math></span>) module, which streamlines the attention process and allows the model to learn efficient and robust global representations for accurate classification of the central pixel. A series of experimental results executed on real HSI datasets have substantiated that the proposed PCET can achieve outstanding performance, even when only 10 samples per category are used for training.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"136 ","pages":"Article 104308"},"PeriodicalIF":7.6000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843224006666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}

引用次数: 0

Abstract

Hyperspectral image (HSI) classification based on deep learning has demonstrated promising performance. In general, using patch-wise samples helps to extract the spatial relationship between pixels and local contextual information. However, the presence of background or other category information in an image patch that is inconsistent with the central target category has a negative effect on classification. To solve this issue, a patch confidence-enhanced transformer (PCET) approach for HSI classification is proposed. To be specific, we design a patch quality assessment (PQA) branch model to evaluate the input patches during training process, which effectively filters out the intrusive non-central pixels. The output confidence of the branch model serves as a quantitative indicator of the contribution degree of the input patch to the overall training efficacy, which is subsequently weighted in the loss function, thereby endowing the model with the capability to dynamically adjust its learning focus based on the qualitative of the inputs. Second, a spectral–spatial multi-feature fusion (SSMF) module is devised to procure scores of representative information simultaneously and fully exploit the potential of multi-scale feature HSI data. Finally, to enhance feature discrimination, global context is efficiently modeled using the efficient additive attention transformer (

{EA}^{2} T

) module, which streamlines the attention process and allows the model to learn efficient and robust global representations for accurate classification of the central pixel. A series of experimental results executed on real HSI datasets have substantiated that the proposed PCET can achieve outstanding performance, even when only 10 samples per category are used for training.

查看原文本刊更多论文

ppet：具有高效光谱空间特征的高光谱图像分类补丁置信度增强变压器

基于深度学习的高光谱图像（HSI）分类已经显示出良好的性能。通常，使用逐块采样有助于提取像素和局部上下文信息之间的空间关系。然而，在图像补丁中存在与中心目标类别不一致的背景或其他类别信息会对分类产生负面影响。为了解决这一问题，提出了一种贴片置信度增强变压器（PCET）方法用于HSI分类。具体来说，我们设计了一个patch quality assessment （PQA）分支模型，在训练过程中对输入的patch进行评估，有效滤除了干扰的非中心像素。分支模型的输出置信度作为输入片段对整体训练效果贡献程度的定量指标，随后在损失函数中进行加权，从而使模型具有根据输入的定性动态调整学习重点的能力。其次，设计了光谱-空间多特征融合（SSMF）模块，同时获取大量代表性信息，充分挖掘多尺度特征HSI数据的潜力；最后，为了增强特征识别，使用高效的加性注意转换器（EA2T）模块高效地对全局上下文进行建模，该模块简化了注意过程，并允许模型学习高效和鲁棒的全局表示，以准确分类中心像素。在真实HSI数据集上执行的一系列实验结果证实，即使每个类别仅使用10个样本进行训练，所提出的PCET也可以取得出色的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International journal of applied earth observation and geoinformation : ITC journal Global and Planetary Change, Management, Monitoring, Policy and Law, Earth-Surface Processes, Computers in Earth Sciences

CiteScore

12.00

自引率

0.00%

发文量

审稿时长

77 days

期刊介绍： The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.