{"title":"RealFuVSR: Feature Enhanced Real-World Video Super-Resolution","authors":"Zhi Li , Xiong Pang , Yiyue Jiang , Yujie Wang","doi":"10.1016/j.vrih.2023.06.006","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>The recurrent recovery is one of the common methods for video super-resolution, which models the correlation between frames via hidden states. However, when we apply the structure to real-world scenarios, it leads to unsatisfactory artifacts. We found that, in the real-world video super-resolution training, the use of unknown and complex degradation can better simulate the degradation process of the real world.</p></div><div><h3>Methods</h3><p>Based on this, we propose the RealFuVSR model, which simulates the real-world degradation and mitigates the artifacts caused by the video super-resolution. Specifically, we propose a multi-scale feature extraction module(MSF) which extracts and fuses features from multiple scales, it facilitates the elimination of hidden state artifacts. In order to improve the accuracy of hidden states alignment information, RealFuVSR use advanced optical flow-guided deformable convolution. Besides, cascaded residual upsampling module is used to eliminate the noise caused by the upsampling process.</p></div><div><h3>Results</h3><p>The experiment demonstrates that our RealFuVSR model can not only recover the high-quality video but also outperform the state-of-the-art RealBasicVSR and RealESRGAN models.</p></div>","PeriodicalId":33538,"journal":{"name":"Virtual Reality Intelligent Hardware","volume":"5 6","pages":"Pages 523-537"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2096579623000396/pdf?md5=ccc376bebb752e8cce7b3633ad69bf64&pid=1-s2.0-S2096579623000396-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virtual Reality Intelligent Hardware","FirstCategoryId":"1093","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2096579623000396","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
Background
The recurrent recovery is one of the common methods for video super-resolution, which models the correlation between frames via hidden states. However, when we apply the structure to real-world scenarios, it leads to unsatisfactory artifacts. We found that, in the real-world video super-resolution training, the use of unknown and complex degradation can better simulate the degradation process of the real world.
Methods
Based on this, we propose the RealFuVSR model, which simulates the real-world degradation and mitigates the artifacts caused by the video super-resolution. Specifically, we propose a multi-scale feature extraction module(MSF) which extracts and fuses features from multiple scales, it facilitates the elimination of hidden state artifacts. In order to improve the accuracy of hidden states alignment information, RealFuVSR use advanced optical flow-guided deformable convolution. Besides, cascaded residual upsampling module is used to eliminate the noise caused by the upsampling process.
Results
The experiment demonstrates that our RealFuVSR model can not only recover the high-quality video but also outperform the state-of-the-art RealBasicVSR and RealESRGAN models.