Saad Sohail;Muhammad Usama;Usman Ghous;Manuel Mazzara;Salvatore Distefano;Muhammad Ahmad
{"title":"EnergyFormer:用于高光谱图像分类的傅里叶嵌入能量关注","authors":"Saad Sohail;Muhammad Usama;Usman Ghous;Manuel Mazzara;Salvatore Distefano;Muhammad Ahmad","doi":"10.1109/LGRS.2025.3596629","DOIUrl":null,"url":null,"abstract":"Hyperspectral images (HSIs) capture detailed spectral–spatial information across hundreds of contiguous bands, enabling precise material identification in domains such as environmental monitoring, agriculture, and urban analysis. However, the high dimensionality and spectral variability inherent to HSIs present significant challenges for effective feature extraction and classification. This letter introduces EnergyFormer (EF), a transformer-based framework designed to overcome these limitations through three key innovations: 1) multihead energy attention (MHEA), which formulates an energy optimization mechanism to selectively enhance discriminative spectral–spatial features; 2) Fourier positional embedding (FoPE), which adaptively models long-range spectral and spatial dependencies; and 3) enhanced convolutional block attention module (ECBAM), which emphasizes informative wavelength bands and spatial structures for robust representation learning. Extensive experiments on the WHU-Hi-HanChuan, Salinas, and Pavia University datasets demonstrate that EF achieves superior classification performance with overall accuracies of 99.28%, 98.63%, and 98.72%, respectively, outperforming leading CNN-, transformer-, and Mamba-based models.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EnergyFormer: Energy Attention With Fourier Embedding for Hyperspectral Image Classification\",\"authors\":\"Saad Sohail;Muhammad Usama;Usman Ghous;Manuel Mazzara;Salvatore Distefano;Muhammad Ahmad\",\"doi\":\"10.1109/LGRS.2025.3596629\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hyperspectral images (HSIs) capture detailed spectral–spatial information across hundreds of contiguous bands, enabling precise material identification in domains such as environmental monitoring, agriculture, and urban analysis. However, the high dimensionality and spectral variability inherent to HSIs present significant challenges for effective feature extraction and classification. This letter introduces EnergyFormer (EF), a transformer-based framework designed to overcome these limitations through three key innovations: 1) multihead energy attention (MHEA), which formulates an energy optimization mechanism to selectively enhance discriminative spectral–spatial features; 2) Fourier positional embedding (FoPE), which adaptively models long-range spectral and spatial dependencies; and 3) enhanced convolutional block attention module (ECBAM), which emphasizes informative wavelength bands and spatial structures for robust representation learning. Extensive experiments on the WHU-Hi-HanChuan, Salinas, and Pavia University datasets demonstrate that EF achieves superior classification performance with overall accuracies of 99.28%, 98.63%, and 98.72%, respectively, outperforming leading CNN-, transformer-, and Mamba-based models.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-08-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11119702/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11119702/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
EnergyFormer: Energy Attention With Fourier Embedding for Hyperspectral Image Classification
Hyperspectral images (HSIs) capture detailed spectral–spatial information across hundreds of contiguous bands, enabling precise material identification in domains such as environmental monitoring, agriculture, and urban analysis. However, the high dimensionality and spectral variability inherent to HSIs present significant challenges for effective feature extraction and classification. This letter introduces EnergyFormer (EF), a transformer-based framework designed to overcome these limitations through three key innovations: 1) multihead energy attention (MHEA), which formulates an energy optimization mechanism to selectively enhance discriminative spectral–spatial features; 2) Fourier positional embedding (FoPE), which adaptively models long-range spectral and spatial dependencies; and 3) enhanced convolutional block attention module (ECBAM), which emphasizes informative wavelength bands and spatial structures for robust representation learning. Extensive experiments on the WHU-Hi-HanChuan, Salinas, and Pavia University datasets demonstrate that EF achieves superior classification performance with overall accuracies of 99.28%, 98.63%, and 98.72%, respectively, outperforming leading CNN-, transformer-, and Mamba-based models.