{"title":"Research on Image Super-Resolution Reconstruction Based on Transformer","authors":"Haiyong Wang, Kai Jiang","doi":"10.1109/AIID51893.2021.9456580","DOIUrl":null,"url":null,"abstract":"The existing super-resolution methods have the problems of blurring contour and losing details, which cannot make full use of the internal and external information of the image. On the basis of convolutional neural network, a super-resolution model embedded in Transformer module is proposed. Through the fusion of CNN and Transformer, image feature extraction and corrupted information recovery are performed. The training process is divided into an unsupervised pre-training process on the ImageNet subset and a supervised fine-tuning training process on the DIV2K data set. Contrastive loss is introduced into the loss function to achieve the consistency of super-resolution image category and texture. It is compared with the current mainstream image super-resolution methods on the standard 5 benchmark data sets. Experiments show that compared with other algorithms, this algorithm can better recover the high-frequency information of the image, and generate super-resolution results with clear details and fine structure.","PeriodicalId":412698,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIID51893.2021.9456580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The existing super-resolution methods have the problems of blurring contour and losing details, which cannot make full use of the internal and external information of the image. On the basis of convolutional neural network, a super-resolution model embedded in Transformer module is proposed. Through the fusion of CNN and Transformer, image feature extraction and corrupted information recovery are performed. The training process is divided into an unsupervised pre-training process on the ImageNet subset and a supervised fine-tuning training process on the DIV2K data set. Contrastive loss is introduced into the loss function to achieve the consistency of super-resolution image category and texture. It is compared with the current mainstream image super-resolution methods on the standard 5 benchmark data sets. Experiments show that compared with other algorithms, this algorithm can better recover the high-frequency information of the image, and generate super-resolution results with clear details and fine structure.