基于残基超分辨卷积神经网络的H.266 VVC内编码复杂度降低

IF 4.1 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
A Dhanalakshmi , L Balaji , C Raja , Jayant Giri , Mubarak Alrashoud
{"title":"基于残基超分辨卷积神经网络的H.266 VVC内编码复杂度降低","authors":"A Dhanalakshmi ,&nbsp;L Balaji ,&nbsp;C Raja ,&nbsp;Jayant Giri ,&nbsp;Mubarak Alrashoud","doi":"10.1016/j.icte.2025.04.010","DOIUrl":null,"url":null,"abstract":"<div><div>Versatile Video Coding (VVC) promised to provide the same video quality as HEVC with 50 % bitrate reduction, which was introduced in 2020. Our suggested method for VVC Intra-coding is residue super-resolution convolutional neural network (RSR-CNN) utilizing downsampling and upsampling procedures. We present an effective complexity reduced VVC intra-coding scheme based on residue SR-CNN. Reducing an original video's resolution in both the vertical and horizontal directions is all that is required to execute down sampling. Increasing the video dimensions for improved visual quality, convolutional neural networks are utilized in the upsampling process to create residue super-resolution. Specifically, for every block, we train a CNN model to perform residue SR after downsampling and compressing the residue at low resolution, and then we carry out motion estimation (ME) and motion compensation (MC) to extract the residue. Using the MC prediction signal, a new residue SR-CNN is designed. Additionally, this work comprehensively examines the complexity and performance of VVC intra-coding tools and integrates them with the residue SR-CNN method. The experiments demonstrate a substantial time savings of 40 % in encoding with BDBR coding gains of 4.2 %, and 2.9 % in AI and RA configurations respectively.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"11 3","pages":"Pages 460-466"},"PeriodicalIF":4.1000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Residue super-resolution convolutional neural network based complexity reduction for H.266 VVC intra-coding\",\"authors\":\"A Dhanalakshmi ,&nbsp;L Balaji ,&nbsp;C Raja ,&nbsp;Jayant Giri ,&nbsp;Mubarak Alrashoud\",\"doi\":\"10.1016/j.icte.2025.04.010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Versatile Video Coding (VVC) promised to provide the same video quality as HEVC with 50 % bitrate reduction, which was introduced in 2020. Our suggested method for VVC Intra-coding is residue super-resolution convolutional neural network (RSR-CNN) utilizing downsampling and upsampling procedures. We present an effective complexity reduced VVC intra-coding scheme based on residue SR-CNN. Reducing an original video's resolution in both the vertical and horizontal directions is all that is required to execute down sampling. Increasing the video dimensions for improved visual quality, convolutional neural networks are utilized in the upsampling process to create residue super-resolution. Specifically, for every block, we train a CNN model to perform residue SR after downsampling and compressing the residue at low resolution, and then we carry out motion estimation (ME) and motion compensation (MC) to extract the residue. Using the MC prediction signal, a new residue SR-CNN is designed. Additionally, this work comprehensively examines the complexity and performance of VVC intra-coding tools and integrates them with the residue SR-CNN method. The experiments demonstrate a substantial time savings of 40 % in encoding with BDBR coding gains of 4.2 %, and 2.9 % in AI and RA configurations respectively.</div></div>\",\"PeriodicalId\":48526,\"journal\":{\"name\":\"ICT Express\",\"volume\":\"11 3\",\"pages\":\"Pages 460-466\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICT Express\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2405959525000554\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICT Express","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405959525000554","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

多功能视频编码(VVC)承诺提供与HEVC相同的视频质量,比特率降低50%,这是在2020年推出的。我们建议的VVC内编码方法是残基超分辨率卷积神经网络(RSR-CNN),利用下采样和上采样过程。提出了一种有效的基于残差SR-CNN的降低复杂度的VVC内编码方案。在垂直和水平方向上降低原始视频的分辨率是执行向下采样所需要的。为了提高视频的视觉质量,在上采样过程中使用卷积神经网络来产生残差超分辨率。具体来说,对于每个块,我们训练一个CNN模型,在低分辨率下对残差进行下采样和压缩后进行残差SR,然后进行运动估计(ME)和运动补偿(MC)来提取残差。利用MC预测信号,设计了一种新的残差SR-CNN。此外,本工作全面考察了VVC内编码工具的复杂性和性能,并将其与残差SR-CNN方法相结合。实验表明,在BDBR编码增益为4.2%的情况下,编码时间节省了40%,在AI和RA配置下,编码增益分别为2.9%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Residue super-resolution convolutional neural network based complexity reduction for H.266 VVC intra-coding
Versatile Video Coding (VVC) promised to provide the same video quality as HEVC with 50 % bitrate reduction, which was introduced in 2020. Our suggested method for VVC Intra-coding is residue super-resolution convolutional neural network (RSR-CNN) utilizing downsampling and upsampling procedures. We present an effective complexity reduced VVC intra-coding scheme based on residue SR-CNN. Reducing an original video's resolution in both the vertical and horizontal directions is all that is required to execute down sampling. Increasing the video dimensions for improved visual quality, convolutional neural networks are utilized in the upsampling process to create residue super-resolution. Specifically, for every block, we train a CNN model to perform residue SR after downsampling and compressing the residue at low resolution, and then we carry out motion estimation (ME) and motion compensation (MC) to extract the residue. Using the MC prediction signal, a new residue SR-CNN is designed. Additionally, this work comprehensively examines the complexity and performance of VVC intra-coding tools and integrates them with the residue SR-CNN method. The experiments demonstrate a substantial time savings of 40 % in encoding with BDBR coding gains of 4.2 %, and 2.9 % in AI and RA configurations respectively.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ICT Express
ICT Express Multiple-
CiteScore
10.20
自引率
1.90%
发文量
167
审稿时长
35 weeks
期刊介绍: The ICT Express journal published by the Korean Institute of Communications and Information Sciences (KICS) is an international, peer-reviewed research publication covering all aspects of information and communication technology. The journal aims to publish research that helps advance the theoretical and practical understanding of ICT convergence, platform technologies, communication networks, and device technologies. The technology advancement in information and communication technology (ICT) sector enables portable devices to be always connected while supporting high data rate, resulting in the recent popularity of smartphones that have a considerable impact in economic and social development.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信