HyperTaFOR: Task-Adaptive Few-Shot Open-Set Recognition With Spatial-Spectral Selective Transformer for Hyperspectral Imagery

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-07-04 DOI:10.1109/TIP.2025.3555069

Bobo Xi;Wenjie Zhang;Jiaojiao Li;Rui Song;Yunsong Li

{"title":"HyperTaFOR: Task-Adaptive Few-Shot Open-Set Recognition With Spatial-Spectral Selective Transformer for Hyperspectral Imagery","authors":"Bobo Xi;Wenjie Zhang;Jiaojiao Li;Rui Song;Yunsong Li","doi":"10.1109/TIP.2025.3555069","DOIUrl":null,"url":null,"abstract":"Open-set recognition (OSR) aims to accurately classify known categories while effectively rejecting unknown negative samples. Existing methods for OSR in hyperspectral images (HSI) can be generally divided into two categories: reconstruction-based and distance-based methods. Reconstruction-based approaches focus on analyzing reconstruction errors during inference, whereas distance-based methods determine the rejection of unknown samples by measuring their distance to each prototype. However, these techniques often require a substantial amount of training data, which can be both time-consuming and expensive to gather, and they require manual threshold setting, which can be difficult for different tasks. Furthermore, effectively utilizing spectral-spatial information in HSI remains a significant challenge, particularly in open-set scenarios. To tackle these challenges, we introduce a few-shot OSR framework for HSI named HyperTaFOR, which incorporates a novel spatial-spectral selective transformer (S3Former). This framework employs a meta-learning strategy to implement a negative prototype generation module (NPGM) that generates task-adaptive rejection scores, allowing flexible categorization of samples into various known classes and anomalies for each task. Additionally, the S3Former is designed to extract spectral-spatial features, optimizing the use of central pixel information while reducing the impact of irrelevant spatial data. Comprehensive experiments conducted on three benchmark hyperspectral datasets show that our proposed method delivers competitive classification and detection performance in open-set environments when compared to state-of-the-art methods. The code is available online at <uri>https://github.com/B-Xi/TIP_2025_HyperTaFOR</uri>.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"4148-4160"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11071942/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Open-set recognition (OSR) aims to accurately classify known categories while effectively rejecting unknown negative samples. Existing methods for OSR in hyperspectral images (HSI) can be generally divided into two categories: reconstruction-based and distance-based methods. Reconstruction-based approaches focus on analyzing reconstruction errors during inference, whereas distance-based methods determine the rejection of unknown samples by measuring their distance to each prototype. However, these techniques often require a substantial amount of training data, which can be both time-consuming and expensive to gather, and they require manual threshold setting, which can be difficult for different tasks. Furthermore, effectively utilizing spectral-spatial information in HSI remains a significant challenge, particularly in open-set scenarios. To tackle these challenges, we introduce a few-shot OSR framework for HSI named HyperTaFOR, which incorporates a novel spatial-spectral selective transformer (S3Former). This framework employs a meta-learning strategy to implement a negative prototype generation module (NPGM) that generates task-adaptive rejection scores, allowing flexible categorization of samples into various known classes and anomalies for each task. Additionally, the S3Former is designed to extract spectral-spatial features, optimizing the use of central pixel information while reducing the impact of irrelevant spatial data. Comprehensive experiments conducted on three benchmark hyperspectral datasets show that our proposed method delivers competitive classification and detection performance in open-set environments when compared to state-of-the-art methods. The code is available online at https://github.com/B-Xi/TIP_2025_HyperTaFOR.

查看原文本刊更多论文

HyperTaFOR：基于空间光谱选择转换器的高光谱图像任务自适应少镜头开集识别

开放集识别（Open-set recognition， OSR）旨在对已知的类别进行准确分类，同时有效地拒绝未知的负样本。现有的高光谱图像OSR方法大致分为基于重建的方法和基于距离的方法两大类。基于重构的方法侧重于分析推理过程中的重构误差，而基于距离的方法通过测量未知样本到每个原型的距离来确定拒绝。然而，这些技术通常需要大量的训练数据，收集这些数据既耗时又昂贵，而且它们需要手动设置阈值，这对于不同的任务来说可能很困难。此外，在HSI中有效利用光谱空间信息仍然是一个重大挑战，特别是在开放场景下。为了应对这些挑战，我们为HSI引入了一种名为HyperTaFOR的少量OSR框架，该框架采用了一种新型的空间光谱选择变压器（S3Former）。该框架采用元学习策略来实现负原型生成模块（NPGM），该模块生成任务自适应拒绝分数，允许将样本灵活地分类为每个任务的各种已知类别和异常。此外，S3Former设计用于提取光谱空间特征，优化中心像素信息的使用，同时减少不相关空间数据的影响。在三个基准高光谱数据集上进行的综合实验表明，与最先进的方法相比，我们提出的方法在开放集环境中具有竞争力的分类和检测性能。该代码可在https://github.com/B-Xi/TIP_2025_HyperTaFOR上在线获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量