{"title":"HCT: a hybrid CNN and transformer network for hyperspectral image super-resolution","authors":"Huapeng Wu, Chenyun Wang, Chenyang Lu, Tianming Zhan","doi":"10.1007/s00530-024-01387-9","DOIUrl":null,"url":null,"abstract":"<p>Recently, convolutional neural network (CNN) and transformer based on hyperspectral image super-resolution methods have achieved superior performance. Nevertheless, this is still an important problem how to effectively extract local and global features and improve spectral representation of hyperspectral image. In this paper, we propose a hybrid CNN and transformer network (HCT) for hyperspectral image super-resolution, which consists of a transformer module with local–global spatial attention mechanism (LSMSAformer) and a convolution module with 3D convolution (3DDWTC) to process high and low frequency information, respectively. Specifically, in the transformer branch, the introduced attention mechanism module (LSMSA) is used to extract local–global spatial features at different scales. In the convolution branch, 3DDWTC is proposed to learn local spatial information and preserve the spectral features, which can enhance the representation of the network. Extensive experimental results show that the proposed method can obtain better results than some state-of-the-art hyperspectral image super-resolution methods.</p>","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00530-024-01387-9","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, convolutional neural network (CNN) and transformer based on hyperspectral image super-resolution methods have achieved superior performance. Nevertheless, this is still an important problem how to effectively extract local and global features and improve spectral representation of hyperspectral image. In this paper, we propose a hybrid CNN and transformer network (HCT) for hyperspectral image super-resolution, which consists of a transformer module with local–global spatial attention mechanism (LSMSAformer) and a convolution module with 3D convolution (3DDWTC) to process high and low frequency information, respectively. Specifically, in the transformer branch, the introduced attention mechanism module (LSMSA) is used to extract local–global spatial features at different scales. In the convolution branch, 3DDWTC is proposed to learn local spatial information and preserve the spectral features, which can enhance the representation of the network. Extensive experimental results show that the proposed method can obtain better results than some state-of-the-art hyperspectral image super-resolution methods.