{"title":"基于Swin Transformer融合关注网络的遥感图像超分辨率重建","authors":"Zhilin Wang, Hai-Dong Shang, Shuang Wang","doi":"10.1117/12.2653433","DOIUrl":null,"url":null,"abstract":"Image super-resolution reconstruction technology in remote sensing can improve the spatial resolution of remote sensing images with the breakthrough of physical hardware limitations. With the development of deep learning technology, more and more algorithms proposed in the field of natural images are applied to the field of remote sensing super-resolution. Due to the large difference in the size of the objects in remote sensing images and the high complexity of the image, the reconstructed image will be blurred when the algorithm in the field of natural images is directly used. To address this problem, this paper proposes a shallow feature extraction feature fusion with multiple convolutions, followed by the extraction of high-frequency information using the Swin Transformer module with a fusion attention mechanism. The edge details of the image are extracted using the gradient of the image in the final reconstruction process, and complementary fusion is performed at the end of the network, which can effectively supplement the lack of shallow features caused by the deep network. Finally, experiments show that the proposed model obtains satisfactory reconstruction results of remote sensing images.","PeriodicalId":253792,"journal":{"name":"Conference on Optics and Communication Technology","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Super-resolution reconstruction of remote sensing images based on Swin Transformer fusion attention network\",\"authors\":\"Zhilin Wang, Hai-Dong Shang, Shuang Wang\",\"doi\":\"10.1117/12.2653433\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Image super-resolution reconstruction technology in remote sensing can improve the spatial resolution of remote sensing images with the breakthrough of physical hardware limitations. With the development of deep learning technology, more and more algorithms proposed in the field of natural images are applied to the field of remote sensing super-resolution. Due to the large difference in the size of the objects in remote sensing images and the high complexity of the image, the reconstructed image will be blurred when the algorithm in the field of natural images is directly used. To address this problem, this paper proposes a shallow feature extraction feature fusion with multiple convolutions, followed by the extraction of high-frequency information using the Swin Transformer module with a fusion attention mechanism. The edge details of the image are extracted using the gradient of the image in the final reconstruction process, and complementary fusion is performed at the end of the network, which can effectively supplement the lack of shallow features caused by the deep network. Finally, experiments show that the proposed model obtains satisfactory reconstruction results of remote sensing images.\",\"PeriodicalId\":253792,\"journal\":{\"name\":\"Conference on Optics and Communication Technology\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Optics and Communication Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2653433\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Optics and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2653433","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Super-resolution reconstruction of remote sensing images based on Swin Transformer fusion attention network
Image super-resolution reconstruction technology in remote sensing can improve the spatial resolution of remote sensing images with the breakthrough of physical hardware limitations. With the development of deep learning technology, more and more algorithms proposed in the field of natural images are applied to the field of remote sensing super-resolution. Due to the large difference in the size of the objects in remote sensing images and the high complexity of the image, the reconstructed image will be blurred when the algorithm in the field of natural images is directly used. To address this problem, this paper proposes a shallow feature extraction feature fusion with multiple convolutions, followed by the extraction of high-frequency information using the Swin Transformer module with a fusion attention mechanism. The edge details of the image are extracted using the gradient of the image in the final reconstruction process, and complementary fusion is performed at the end of the network, which can effectively supplement the lack of shallow features caused by the deep network. Finally, experiments show that the proposed model obtains satisfactory reconstruction results of remote sensing images.