Rajat Kumar Arya, Subhojit Paul, Rajeev Srivastava
{"title":"Hyperspectral image classification using local-to-global retention network","authors":"Rajat Kumar Arya, Subhojit Paul, Rajeev Srivastava","doi":"10.1016/j.optlastec.2025.113991","DOIUrl":null,"url":null,"abstract":"<div><div>Several transformer-based methods have demonstrated competitiveness in the hyperspectral image (HSI) classification tasks. However, the transformer has an expensive and slow inference due to the computational complexity of the self-attention mechanism, which increases the risk of a bottleneck while processing all the tokens for high-resolution images. Retentive Network (RetNet) has recently been proposed for large language models to achieve training parallelism, low-cost inference, and high performance. However, RetNet also struggles to capture local information, such as conventional transformers. This study proposes local-to-global retention (LGRetention) to overcome the issues. The proposed method directly obtains the inherent features from the original spatial-spectral patch, thereby preserving the spatial distribution and intrinsic spectral-spatial correlation. Moreover, the proposed LGRetention includes a local-to-global self-retention (LGSR), which contains Local Convolutional Network (LCN) and Global Self-Retention (GSR) modules created using three-dimensional (3D) convolutional neural networks (CNNs) and a developed HSI Retention mechanism to capture local and global features and enhance the long-range dependencies and spectral-spatial feature learning ability, thereby capturing both fine and coarse-grained data. In the HSI Retention mechanism, we developed a 3D parallel retention mechanism to retain the degradation between succeeding horizontal and vertical positions with the spectral dimension of the HSI data. Experimental results on three benchmark HSI datasets demonstrate that the proposed LGRetention outperforms the state-of-the-art methods. It increases accuracy and creates finer classification maps, demonstrating robustness and generalizability in the HSI classification task.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"192 ","pages":"Article 113991"},"PeriodicalIF":5.0000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030399225015828","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Several transformer-based methods have demonstrated competitiveness in the hyperspectral image (HSI) classification tasks. However, the transformer has an expensive and slow inference due to the computational complexity of the self-attention mechanism, which increases the risk of a bottleneck while processing all the tokens for high-resolution images. Retentive Network (RetNet) has recently been proposed for large language models to achieve training parallelism, low-cost inference, and high performance. However, RetNet also struggles to capture local information, such as conventional transformers. This study proposes local-to-global retention (LGRetention) to overcome the issues. The proposed method directly obtains the inherent features from the original spatial-spectral patch, thereby preserving the spatial distribution and intrinsic spectral-spatial correlation. Moreover, the proposed LGRetention includes a local-to-global self-retention (LGSR), which contains Local Convolutional Network (LCN) and Global Self-Retention (GSR) modules created using three-dimensional (3D) convolutional neural networks (CNNs) and a developed HSI Retention mechanism to capture local and global features and enhance the long-range dependencies and spectral-spatial feature learning ability, thereby capturing both fine and coarse-grained data. In the HSI Retention mechanism, we developed a 3D parallel retention mechanism to retain the degradation between succeeding horizontal and vertical positions with the spectral dimension of the HSI data. Experimental results on three benchmark HSI datasets demonstrate that the proposed LGRetention outperforms the state-of-the-art methods. It increases accuracy and creates finer classification maps, demonstrating robustness and generalizability in the HSI classification task.
期刊介绍:
Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication.
The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas:
•development in all types of lasers
•developments in optoelectronic devices and photonics
•developments in new photonics and optical concepts
•developments in conventional optics, optical instruments and components
•techniques of optical metrology, including interferometry and optical fibre sensors
•LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow
•applications of lasers to materials processing, optical NDT display (including holography) and optical communication
•research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume)
•developments in optical computing and optical information processing
•developments in new optical materials
•developments in new optical characterization methods and techniques
•developments in quantum optics
•developments in light assisted micro and nanofabrication methods and techniques
•developments in nanophotonics and biophotonics
•developments in imaging processing and systems