Gang He , Chang Wu , Guancheng Quan , Xinquan Lai , Yunsong Li
{"title":"SA-CVSR:任意尺度压缩视频超分辨率","authors":"Gang He , Chang Wu , Guancheng Quan , Xinquan Lai , Yunsong Li","doi":"10.1016/j.patcog.2025.111745","DOIUrl":null,"url":null,"abstract":"<div><div>To mitigate transmission and storage expenses, the existing compressed video super-resolution (CVSR) approaches typically downsample high-resolution (HR) videos before encoding and then restore decoded videos to their original resolution leveraging deep neural networks (DNNs). However, they employ fixed integer scale factors for diverse video types and compression ratios, potentially resulting in suboptimal performance. In this paper, we propose a Scale-Arbitrary Compressed Video Super-Resolution (SA-CVSR) approach to achieve the optimal trade-off between bit-rate and quality. In our approach, we first apply a Support Vector Machine (SVM) based scale predictor to determine the optimal scale factors for an individual video across various compression ratios. Then, we design a novel Priors-Guided Restoration–Reconstruction Network (PGRRN), which is constructed by stacking multiple Priors-Guided Processing Blocks (PGPBs), to process low-resolution (LR) compressed videos in two stages. Specifically, in the restoration stage, PGPBs perform precise motion compensation between two temporally adjacent frames and incorporate the coding prior, which enables PGRRN to effectively eliminate the compression damage content-adaptively. In the subsequent reconstruction stage, PGPBs incorporate the scale prior to achieve high-quality scale-arbitrary super-resolution. Extensive experimental results provide evidence of the effectiveness of SA-CVSR, as it demonstrates a substantial improvement in bit-rate reduction when compared to other CVSR approaches on multiple datasets.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"167 ","pages":"Article 111745"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SA-CVSR: Scale-Arbitrary Compressed Video Super-Resolution\",\"authors\":\"Gang He , Chang Wu , Guancheng Quan , Xinquan Lai , Yunsong Li\",\"doi\":\"10.1016/j.patcog.2025.111745\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>To mitigate transmission and storage expenses, the existing compressed video super-resolution (CVSR) approaches typically downsample high-resolution (HR) videos before encoding and then restore decoded videos to their original resolution leveraging deep neural networks (DNNs). However, they employ fixed integer scale factors for diverse video types and compression ratios, potentially resulting in suboptimal performance. In this paper, we propose a Scale-Arbitrary Compressed Video Super-Resolution (SA-CVSR) approach to achieve the optimal trade-off between bit-rate and quality. In our approach, we first apply a Support Vector Machine (SVM) based scale predictor to determine the optimal scale factors for an individual video across various compression ratios. Then, we design a novel Priors-Guided Restoration–Reconstruction Network (PGRRN), which is constructed by stacking multiple Priors-Guided Processing Blocks (PGPBs), to process low-resolution (LR) compressed videos in two stages. Specifically, in the restoration stage, PGPBs perform precise motion compensation between two temporally adjacent frames and incorporate the coding prior, which enables PGRRN to effectively eliminate the compression damage content-adaptively. In the subsequent reconstruction stage, PGPBs incorporate the scale prior to achieve high-quality scale-arbitrary super-resolution. Extensive experimental results provide evidence of the effectiveness of SA-CVSR, as it demonstrates a substantial improvement in bit-rate reduction when compared to other CVSR approaches on multiple datasets.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"167 \",\"pages\":\"Article 111745\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325004054\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325004054","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
SA-CVSR: Scale-Arbitrary Compressed Video Super-Resolution
To mitigate transmission and storage expenses, the existing compressed video super-resolution (CVSR) approaches typically downsample high-resolution (HR) videos before encoding and then restore decoded videos to their original resolution leveraging deep neural networks (DNNs). However, they employ fixed integer scale factors for diverse video types and compression ratios, potentially resulting in suboptimal performance. In this paper, we propose a Scale-Arbitrary Compressed Video Super-Resolution (SA-CVSR) approach to achieve the optimal trade-off between bit-rate and quality. In our approach, we first apply a Support Vector Machine (SVM) based scale predictor to determine the optimal scale factors for an individual video across various compression ratios. Then, we design a novel Priors-Guided Restoration–Reconstruction Network (PGRRN), which is constructed by stacking multiple Priors-Guided Processing Blocks (PGPBs), to process low-resolution (LR) compressed videos in two stages. Specifically, in the restoration stage, PGPBs perform precise motion compensation between two temporally adjacent frames and incorporate the coding prior, which enables PGRRN to effectively eliminate the compression damage content-adaptively. In the subsequent reconstruction stage, PGPBs incorporate the scale prior to achieve high-quality scale-arbitrary super-resolution. Extensive experimental results provide evidence of the effectiveness of SA-CVSR, as it demonstrates a substantial improvement in bit-rate reduction when compared to other CVSR approaches on multiple datasets.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.