SA-CVSR: Scale-Arbitrary Compressed Video Super-Resolution

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Gang He , Chang Wu , Guancheng Quan , Xinquan Lai , Yunsong Li
{"title":"SA-CVSR: Scale-Arbitrary Compressed Video Super-Resolution","authors":"Gang He ,&nbsp;Chang Wu ,&nbsp;Guancheng Quan ,&nbsp;Xinquan Lai ,&nbsp;Yunsong Li","doi":"10.1016/j.patcog.2025.111745","DOIUrl":null,"url":null,"abstract":"<div><div>To mitigate transmission and storage expenses, the existing compressed video super-resolution (CVSR) approaches typically downsample high-resolution (HR) videos before encoding and then restore decoded videos to their original resolution leveraging deep neural networks (DNNs). However, they employ fixed integer scale factors for diverse video types and compression ratios, potentially resulting in suboptimal performance. In this paper, we propose a Scale-Arbitrary Compressed Video Super-Resolution (SA-CVSR) approach to achieve the optimal trade-off between bit-rate and quality. In our approach, we first apply a Support Vector Machine (SVM) based scale predictor to determine the optimal scale factors for an individual video across various compression ratios. Then, we design a novel Priors-Guided Restoration–Reconstruction Network (PGRRN), which is constructed by stacking multiple Priors-Guided Processing Blocks (PGPBs), to process low-resolution (LR) compressed videos in two stages. Specifically, in the restoration stage, PGPBs perform precise motion compensation between two temporally adjacent frames and incorporate the coding prior, which enables PGRRN to effectively eliminate the compression damage content-adaptively. In the subsequent reconstruction stage, PGPBs incorporate the scale prior to achieve high-quality scale-arbitrary super-resolution. Extensive experimental results provide evidence of the effectiveness of SA-CVSR, as it demonstrates a substantial improvement in bit-rate reduction when compared to other CVSR approaches on multiple datasets.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"167 ","pages":"Article 111745"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325004054","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

To mitigate transmission and storage expenses, the existing compressed video super-resolution (CVSR) approaches typically downsample high-resolution (HR) videos before encoding and then restore decoded videos to their original resolution leveraging deep neural networks (DNNs). However, they employ fixed integer scale factors for diverse video types and compression ratios, potentially resulting in suboptimal performance. In this paper, we propose a Scale-Arbitrary Compressed Video Super-Resolution (SA-CVSR) approach to achieve the optimal trade-off between bit-rate and quality. In our approach, we first apply a Support Vector Machine (SVM) based scale predictor to determine the optimal scale factors for an individual video across various compression ratios. Then, we design a novel Priors-Guided Restoration–Reconstruction Network (PGRRN), which is constructed by stacking multiple Priors-Guided Processing Blocks (PGPBs), to process low-resolution (LR) compressed videos in two stages. Specifically, in the restoration stage, PGPBs perform precise motion compensation between two temporally adjacent frames and incorporate the coding prior, which enables PGRRN to effectively eliminate the compression damage content-adaptively. In the subsequent reconstruction stage, PGPBs incorporate the scale prior to achieve high-quality scale-arbitrary super-resolution. Extensive experimental results provide evidence of the effectiveness of SA-CVSR, as it demonstrates a substantial improvement in bit-rate reduction when compared to other CVSR approaches on multiple datasets.
SA-CVSR:任意尺度压缩视频超分辨率
为了降低传输和存储费用,现有的压缩视频超分辨率(CVSR)方法通常在编码前对高分辨率(HR)视频进行采样,然后利用深度神经网络(dnn)将解码后的视频恢复到原始分辨率。然而,对于不同的视频类型和压缩比,它们采用固定的整数比例因子,这可能导致性能不理想。在本文中,我们提出了一种任意尺度压缩视频超分辨率(SA-CVSR)方法来实现比特率和质量之间的最佳权衡。在我们的方法中,我们首先应用基于支持向量机(SVM)的比例预测器来确定不同压缩比下单个视频的最佳比例因子。然后,我们设计了一种新的先验引导恢复重建网络(PGRRN),该网络由多个先验引导处理块(PGPBs)堆叠而成,分两个阶段处理低分辨率(LR)压缩视频。具体而言,在恢复阶段,PGRRN在两个时间相邻帧之间进行精确的运动补偿,并结合编码先验,使PGRRN能够有效地自适应消除压缩损伤内容。在随后的重建阶段,PGPBs将先前的比例尺合并,以实现高质量的任意比例尺超分辨率。大量的实验结果证明了SA-CVSR的有效性,因为与其他CVSR方法相比,它在多个数据集上显示了比特率降低的实质性改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信