Yifeng Yang;Hengqian Zhao;Xiadan Huangfu;Zihan Li;Pan Wang
{"title":"ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images","authors":"Yifeng Yang;Hengqian Zhao;Xiadan Huangfu;Zihan Li;Pan Wang","doi":"10.1109/JSTARS.2025.3527226","DOIUrl":null,"url":null,"abstract":"The reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sensing images. Super-resolution reconstruction of remote sensing multispectral images not only enhances the spatial resolution of these images but also preserves and improves the spectral information of multispectral data, thereby providing richer ground object information and more accurate environmental monitoring data. To improve the effectiveness of feature extraction in the generator network while maintaining model efficiency, this article proposes the vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model. This model is an improvement upon the original SRGAN super-resolution image reconstruction method, incorporating lightweight network modules, channel attention modules, spatial-spectral residual attention, and the vision transformer structure. The ViT-ISRGAN model focuses on reconstructing four types of typical ground objects based on Sentinel-2 images: urban, water, farmland, and forest. Results indicate that the ViT-ISRGAN model excels in capturing texture details and color restoration, effectively extracting spectral and texture information from multispectral remote sensing images across various scenes. Compared to other super-resolution (SR) models, this approach demonstrates superior effectiveness and performance in the SR tasks of remote sensing multispectral images.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"3973-3988"},"PeriodicalIF":4.7000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10836746","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10836746/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sensing images. Super-resolution reconstruction of remote sensing multispectral images not only enhances the spatial resolution of these images but also preserves and improves the spectral information of multispectral data, thereby providing richer ground object information and more accurate environmental monitoring data. To improve the effectiveness of feature extraction in the generator network while maintaining model efficiency, this article proposes the vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model. This model is an improvement upon the original SRGAN super-resolution image reconstruction method, incorporating lightweight network modules, channel attention modules, spatial-spectral residual attention, and the vision transformer structure. The ViT-ISRGAN model focuses on reconstructing four types of typical ground objects based on Sentinel-2 images: urban, water, farmland, and forest. Results indicate that the ViT-ISRGAN model excels in capturing texture details and color restoration, effectively extracting spectral and texture information from multispectral remote sensing images across various scenes. Compared to other super-resolution (SR) models, this approach demonstrates superior effectiveness and performance in the SR tasks of remote sensing multispectral images.
期刊介绍:
The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.