Hyperspectral image classification using local-to-global retention network

IF 5 2区 物理与天体物理 Q1 OPTICS
Rajat Kumar Arya, Subhojit Paul, Rajeev Srivastava
{"title":"Hyperspectral image classification using local-to-global retention network","authors":"Rajat Kumar Arya,&nbsp;Subhojit Paul,&nbsp;Rajeev Srivastava","doi":"10.1016/j.optlastec.2025.113991","DOIUrl":null,"url":null,"abstract":"<div><div>Several transformer-based methods have demonstrated competitiveness in the hyperspectral image (HSI) classification tasks. However, the transformer has an expensive and slow inference due to the computational complexity of the self-attention mechanism, which increases the risk of a bottleneck while processing all the tokens for high-resolution images. Retentive Network (RetNet) has recently been proposed for large language models to achieve training parallelism, low-cost inference, and high performance. However, RetNet also struggles to capture local information, such as conventional transformers. This study proposes local-to-global retention (LGRetention) to overcome the issues. The proposed method directly obtains the inherent features from the original spatial-spectral patch, thereby preserving the spatial distribution and intrinsic spectral-spatial correlation. Moreover, the proposed LGRetention includes a local-to-global self-retention (LGSR), which contains Local Convolutional Network (LCN) and Global Self-Retention (GSR) modules created using three-dimensional (3D) convolutional neural networks (CNNs) and a developed HSI Retention mechanism to capture local and global features and enhance the long-range dependencies and spectral-spatial feature learning ability, thereby capturing both fine and coarse-grained data. In the HSI Retention mechanism, we developed a 3D parallel retention mechanism to retain the degradation between succeeding horizontal and vertical positions with the spectral dimension of the HSI data. Experimental results on three benchmark HSI datasets demonstrate that the proposed LGRetention outperforms the state-of-the-art methods. It increases accuracy and creates finer classification maps, demonstrating robustness and generalizability in the HSI classification task.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"192 ","pages":"Article 113991"},"PeriodicalIF":5.0000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030399225015828","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Several transformer-based methods have demonstrated competitiveness in the hyperspectral image (HSI) classification tasks. However, the transformer has an expensive and slow inference due to the computational complexity of the self-attention mechanism, which increases the risk of a bottleneck while processing all the tokens for high-resolution images. Retentive Network (RetNet) has recently been proposed for large language models to achieve training parallelism, low-cost inference, and high performance. However, RetNet also struggles to capture local information, such as conventional transformers. This study proposes local-to-global retention (LGRetention) to overcome the issues. The proposed method directly obtains the inherent features from the original spatial-spectral patch, thereby preserving the spatial distribution and intrinsic spectral-spatial correlation. Moreover, the proposed LGRetention includes a local-to-global self-retention (LGSR), which contains Local Convolutional Network (LCN) and Global Self-Retention (GSR) modules created using three-dimensional (3D) convolutional neural networks (CNNs) and a developed HSI Retention mechanism to capture local and global features and enhance the long-range dependencies and spectral-spatial feature learning ability, thereby capturing both fine and coarse-grained data. In the HSI Retention mechanism, we developed a 3D parallel retention mechanism to retain the degradation between succeeding horizontal and vertical positions with the spectral dimension of the HSI data. Experimental results on three benchmark HSI datasets demonstrate that the proposed LGRetention outperforms the state-of-the-art methods. It increases accuracy and creates finer classification maps, demonstrating robustness and generalizability in the HSI classification task.
基于局部到全局保持网络的高光谱图像分类
几种基于变压器的方法在高光谱图像(HSI)分类任务中表现出了竞争力。然而,由于自关注机制的计算复杂性,变压器具有昂贵且缓慢的推理,这增加了在处理高分辨率图像的所有令牌时出现瓶颈的风险。保留网络(RetNet)最近被提出用于大型语言模型,以实现训练并行性、低成本推理和高性能。然而,RetNet也在努力获取本地信息,比如传统的变压器。本研究提出了局部到全局保留(lgreitation)来克服这些问题。该方法直接从原始的空间-光谱斑块中获取固有特征,从而保持了空间分布和固有的光谱-空间相关性。此外,所提出的lgretension包括一个局部到全局自保留(LGSR),其中包含使用三维(3D)卷积神经网络(cnn)创建的局部卷积网络(LCN)和全局自保留(GSR)模块,以及一个开发的HSI保留机制,以捕获局部和全局特征,增强远程依赖关系和频谱空间特征学习能力,从而捕获细粒度和粗粒度数据。在HSI保留机制中,我们开发了一种三维平行保留机制,以保留HSI数据的光谱维度在后续水平和垂直位置之间的退化。在三个基准HSI数据集上的实验结果表明,所提出的lgreattention优于目前最先进的方法。它提高了准确性并创建了更精细的分类图,在HSI分类任务中展示了健壮性和泛化性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.50
自引率
10.00%
发文量
1060
审稿时长
3.4 months
期刊介绍: Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication. The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas: •development in all types of lasers •developments in optoelectronic devices and photonics •developments in new photonics and optical concepts •developments in conventional optics, optical instruments and components •techniques of optical metrology, including interferometry and optical fibre sensors •LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow •applications of lasers to materials processing, optical NDT display (including holography) and optical communication •research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume) •developments in optical computing and optical information processing •developments in new optical materials •developments in new optical characterization methods and techniques •developments in quantum optics •developments in light assisted micro and nanofabrication methods and techniques •developments in nanophotonics and biophotonics •developments in imaging processing and systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信