{"title":"基于多尺度卷积的保结构视频超分辨率","authors":"Feifan Gu, Zhaohui Meng","doi":"10.1109/ISPDS56360.2022.9874251","DOIUrl":null,"url":null,"abstract":"Video super resolution technology refers to the reconstruction of low resolution video into high resolution frames. In recent years, the application of deep learning to super-resolution technology has attracted extensive attention. However, the reconstruction effect of the existing model still has some problems such as double shadow, structural loss, and the solutions of problems are relatively rare. In this paper, we propose a new idea to use gradient extraction branches to guide the reconstruction of high resolution frames in backbone networks. The loss function is improved by combining gradient loss with pixel loss to improve convergence ability. Multi-scale convolution is introduced into the alignment module to enlarge the receptive field and improve the performance of the model to extract large motion features. Experimental results show that the model has good performance on REDS4 and Vid4 data sets.","PeriodicalId":280244,"journal":{"name":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Structure-Preserving Video Super Resolution with Multi-Scale Convolution\",\"authors\":\"Feifan Gu, Zhaohui Meng\",\"doi\":\"10.1109/ISPDS56360.2022.9874251\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Video super resolution technology refers to the reconstruction of low resolution video into high resolution frames. In recent years, the application of deep learning to super-resolution technology has attracted extensive attention. However, the reconstruction effect of the existing model still has some problems such as double shadow, structural loss, and the solutions of problems are relatively rare. In this paper, we propose a new idea to use gradient extraction branches to guide the reconstruction of high resolution frames in backbone networks. The loss function is improved by combining gradient loss with pixel loss to improve convergence ability. Multi-scale convolution is introduced into the alignment module to enlarge the receptive field and improve the performance of the model to extract large motion features. Experimental results show that the model has good performance on REDS4 and Vid4 data sets.\",\"PeriodicalId\":280244,\"journal\":{\"name\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPDS56360.2022.9874251\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPDS56360.2022.9874251","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Structure-Preserving Video Super Resolution with Multi-Scale Convolution
Video super resolution technology refers to the reconstruction of low resolution video into high resolution frames. In recent years, the application of deep learning to super-resolution technology has attracted extensive attention. However, the reconstruction effect of the existing model still has some problems such as double shadow, structural loss, and the solutions of problems are relatively rare. In this paper, we propose a new idea to use gradient extraction branches to guide the reconstruction of high resolution frames in backbone networks. The loss function is improved by combining gradient loss with pixel loss to improve convergence ability. Multi-scale convolution is introduced into the alignment module to enlarge the receptive field and improve the performance of the model to extract large motion features. Experimental results show that the model has good performance on REDS4 and Vid4 data sets.