{"title":"HCT:用于高光谱图像超分辨率的混合 CNN 和变换器网络","authors":"Huapeng Wu, Chenyun Wang, Chenyang Lu, Tianming Zhan","doi":"10.1007/s00530-024-01387-9","DOIUrl":null,"url":null,"abstract":"<p>Recently, convolutional neural network (CNN) and transformer based on hyperspectral image super-resolution methods have achieved superior performance. Nevertheless, this is still an important problem how to effectively extract local and global features and improve spectral representation of hyperspectral image. In this paper, we propose a hybrid CNN and transformer network (HCT) for hyperspectral image super-resolution, which consists of a transformer module with local–global spatial attention mechanism (LSMSAformer) and a convolution module with 3D convolution (3DDWTC) to process high and low frequency information, respectively. Specifically, in the transformer branch, the introduced attention mechanism module (LSMSA) is used to extract local–global spatial features at different scales. In the convolution branch, 3DDWTC is proposed to learn local spatial information and preserve the spectral features, which can enhance the representation of the network. Extensive experimental results show that the proposed method can obtain better results than some state-of-the-art hyperspectral image super-resolution methods.</p>","PeriodicalId":51138,"journal":{"name":"Multimedia Systems","volume":"14 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HCT: a hybrid CNN and transformer network for hyperspectral image super-resolution\",\"authors\":\"Huapeng Wu, Chenyun Wang, Chenyang Lu, Tianming Zhan\",\"doi\":\"10.1007/s00530-024-01387-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Recently, convolutional neural network (CNN) and transformer based on hyperspectral image super-resolution methods have achieved superior performance. Nevertheless, this is still an important problem how to effectively extract local and global features and improve spectral representation of hyperspectral image. In this paper, we propose a hybrid CNN and transformer network (HCT) for hyperspectral image super-resolution, which consists of a transformer module with local–global spatial attention mechanism (LSMSAformer) and a convolution module with 3D convolution (3DDWTC) to process high and low frequency information, respectively. Specifically, in the transformer branch, the introduced attention mechanism module (LSMSA) is used to extract local–global spatial features at different scales. In the convolution branch, 3DDWTC is proposed to learn local spatial information and preserve the spectral features, which can enhance the representation of the network. Extensive experimental results show that the proposed method can obtain better results than some state-of-the-art hyperspectral image super-resolution methods.</p>\",\"PeriodicalId\":51138,\"journal\":{\"name\":\"Multimedia Systems\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Multimedia Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s00530-024-01387-9\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimedia Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00530-024-01387-9","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
HCT: a hybrid CNN and transformer network for hyperspectral image super-resolution
Recently, convolutional neural network (CNN) and transformer based on hyperspectral image super-resolution methods have achieved superior performance. Nevertheless, this is still an important problem how to effectively extract local and global features and improve spectral representation of hyperspectral image. In this paper, we propose a hybrid CNN and transformer network (HCT) for hyperspectral image super-resolution, which consists of a transformer module with local–global spatial attention mechanism (LSMSAformer) and a convolution module with 3D convolution (3DDWTC) to process high and low frequency information, respectively. Specifically, in the transformer branch, the introduced attention mechanism module (LSMSA) is used to extract local–global spatial features at different scales. In the convolution branch, 3DDWTC is proposed to learn local spatial information and preserve the spectral features, which can enhance the representation of the network. Extensive experimental results show that the proposed method can obtain better results than some state-of-the-art hyperspectral image super-resolution methods.
期刊介绍:
This journal details innovative research ideas, emerging technologies, state-of-the-art methods and tools in all aspects of multimedia computing, communication, storage, and applications. It features theoretical, experimental, and survey articles.