A Deformable Convolutional Neural Network for Video Super-Resolution

IF 1.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Xi Chen, Qi Zhang, Kai Liu, Yong Zhang
{"title":"A Deformable Convolutional Neural Network for Video Super-Resolution","authors":"Xi Chen,&nbsp;Qi Zhang,&nbsp;Kai Liu,&nbsp;Yong Zhang","doi":"10.1111/coin.70052","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Convolutional Neural Networks used deep architectures to achieve deep feature extraction in video super-resolution. However, they suffered from challenges of rapid motion and complex scenes in video super-resolution. In this paper, we present a deformable convolutional neural network for video super-resolution (DVSRNet). DVSRNet mainly contains forward and backward feature propagation blocks (FPBs), feature enhancement blocks (FEBs), a feature fusion block (FFB), and a reconstruction block (RB). FPBs can leverage temporal sequence information to capture rich temporal dimensional information in video super-resolution. To restore detailed information, an optical flow technique guided a CNN to align the obtained structural information of different frames to reduce motion-induced blur and artifacts. To address deformable videos from motioned objects, two FEBs utilized deformable convolutions to adaptively correct misaligned objects to improve spatial continuity of videos. To improve reliability of obtained videos, an FFB is used to integrate relations of different video frames from forward and backward propagations. Finally, an RB via upsampling operations and a residual learning technique is used to construct high-quality videos. Experimental results demonstrate that our DVSRNet exhibits superior performance on multiple public datasets for video super-resolution. Its codes can be available at https://github.com/leyoukai/DVSRNet.</p>\n </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 3","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/coin.70052","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Convolutional Neural Networks used deep architectures to achieve deep feature extraction in video super-resolution. However, they suffered from challenges of rapid motion and complex scenes in video super-resolution. In this paper, we present a deformable convolutional neural network for video super-resolution (DVSRNet). DVSRNet mainly contains forward and backward feature propagation blocks (FPBs), feature enhancement blocks (FEBs), a feature fusion block (FFB), and a reconstruction block (RB). FPBs can leverage temporal sequence information to capture rich temporal dimensional information in video super-resolution. To restore detailed information, an optical flow technique guided a CNN to align the obtained structural information of different frames to reduce motion-induced blur and artifacts. To address deformable videos from motioned objects, two FEBs utilized deformable convolutions to adaptively correct misaligned objects to improve spatial continuity of videos. To improve reliability of obtained videos, an FFB is used to integrate relations of different video frames from forward and backward propagations. Finally, an RB via upsampling operations and a residual learning technique is used to construct high-quality videos. Experimental results demonstrate that our DVSRNet exhibits superior performance on multiple public datasets for video super-resolution. Its codes can be available at https://github.com/leyoukai/DVSRNet.

用于视频超分辨率的可变形卷积神经网络
卷积神经网络采用深度架构实现视频超分辨率的深度特征提取。然而,它们在视频超分辨率中面临着快速运动和复杂场景的挑战。本文提出了一种用于视频超分辨率(DVSRNet)的可变形卷积神经网络。DVSRNet主要包含正向和反向特征传播块(FPBs)、特征增强块(FEBs)、特征融合块(FFB)和重构块(RB)。FPBs可以利用时间序列信息在视频超分辨率中捕获丰富的时间维信息。为了恢复细节信息,采用光流技术引导CNN对获取的不同帧的结构信息进行对齐,以减少运动引起的模糊和伪影。为了处理来自运动对象的可变形视频,两个feb利用可变形卷积自适应校正不对齐的对象,以提高视频的空间连续性。为了提高获得的视频的可靠性,采用FFB对不同视频帧的前向和后向传播的关系进行整合。最后,通过上采样操作和残差学习技术构建高质量视频。实验结果表明,我们的DVSRNet在多个公共数据集上表现出优异的视频超分辨率性能。其代码可在https://github.com/leyoukai/DVSRNet上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computational Intelligence
Computational Intelligence 工程技术-计算机:人工智能
CiteScore
6.90
自引率
3.60%
发文量
65
审稿时长
>12 weeks
期刊介绍: This leading international journal promotes and stimulates research in the field of artificial intelligence (AI). Covering a wide range of issues - from the tools and languages of AI to its philosophical implications - Computational Intelligence provides a vigorous forum for the publication of both experimental and theoretical research, as well as surveys and impact studies. The journal is designed to meet the needs of a wide range of AI workers in academic and industrial research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信