Towards Next Generation Video Coding: from Neural Network Based Predictive Coding to In-Loop Filtering

2023 IEEE International Symposium on Circuits and Systems (ISCAS) Pub Date : 2023-05-21 DOI:10.1109/ISCAS46773.2023.10181462

Yanchen Zhao, Suhong Wang, Kai Lin, Meng Lei, Chuanmin Jia, Shanshe Wang, Siwei Ma

{"title":"Towards Next Generation Video Coding: from Neural Network Based Predictive Coding to In-Loop Filtering","authors":"Yanchen Zhao, Suhong Wang, Kai Lin, Meng Lei, Chuanmin Jia, Shanshe Wang, Siwei Ma","doi":"10.1109/ISCAS46773.2023.10181462","DOIUrl":null,"url":null,"abstract":"Audio Video Coding Standard (AVS) Intelligent Coding Group mainly studies video coding tools based on neural network technology and its potential benefit for next generation video coding. Extensive efforts have been dedicated to the research on neural network (NN) based coding tools. In this paper, we present a novel NN based video coding framework by leveraging the supervised trained NN models for multiple modules in the hybrid coding framework, from the predictive coding to the in-loop filtering. Specifically, NN based intra prediction models the non-linear mapping from contextual pixels to the predictions. The inter prediction efficiency is enhanced by introducing a virtual reference frame (VRF) network. The convolutional neural network based loop filtering (CNNLF) with discriminative model selection exploits the texture adaptivity. The experimental results show that the CNNLF, NN Intra, and VRF models can bring 8.60%, 1.02%, and 2.26% luma BD-rate reduction under random access (RA) configuration compared with AVS reference software HPM13.0. Additional experiments with the combined three NN coding tools reveal that around 13% YUV BD-rate reduction could be obtained. The proposed framework opens novel sights for next generation video coding from the intelligent coding perspective.","PeriodicalId":177320,"journal":{"name":"2023 IEEE International Symposium on Circuits and Systems (ISCAS)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Symposium on Circuits and Systems (ISCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAS46773.2023.10181462","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Audio Video Coding Standard (AVS) Intelligent Coding Group mainly studies video coding tools based on neural network technology and its potential benefit for next generation video coding. Extensive efforts have been dedicated to the research on neural network (NN) based coding tools. In this paper, we present a novel NN based video coding framework by leveraging the supervised trained NN models for multiple modules in the hybrid coding framework, from the predictive coding to the in-loop filtering. Specifically, NN based intra prediction models the non-linear mapping from contextual pixels to the predictions. The inter prediction efficiency is enhanced by introducing a virtual reference frame (VRF) network. The convolutional neural network based loop filtering (CNNLF) with discriminative model selection exploits the texture adaptivity. The experimental results show that the CNNLF, NN Intra, and VRF models can bring 8.60%, 1.02%, and 2.26% luma BD-rate reduction under random access (RA) configuration compared with AVS reference software HPM13.0. Additional experiments with the combined three NN coding tools reveal that around 13% YUV BD-rate reduction could be obtained. The proposed framework opens novel sights for next generation video coding from the intelligent coding perspective.

查看原文本刊更多论文

迈向新一代视频编码:从基于神经网络的预测编码到环内滤波

音频视频编码标准(AVS)智能编码组主要研究基于神经网络技术的视频编码工具及其在下一代视频编码中的潜在优势。基于神经网络的编码工具已经得到了广泛的研究。在本文中，我们提出了一种新的基于神经网络的视频编码框架，利用混合编码框架中多个模块的监督训练神经网络模型，从预测编码到环内滤波。具体来说，基于神经网络的内预测对上下文像素到预测的非线性映射进行建模。通过引入虚拟参考帧(VRF)网络，提高了相互预测的效率。基于判别模型选择的卷积神经网络环路滤波(CNNLF)利用了纹理的自适应性。实验结果表明，与AVS参考软件HPM13.0相比，在随机接入(RA)配置下，CNNLF、NN Intra和VRF模型的亮度亮度降低率分别为8.60%、1.02%和2.26%。结合三种神经网络编码工具的额外实验表明，可以获得约13%的YUV bd率降低。该框架从智能编码的角度为下一代视频编码开辟了新的视野。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Symposium on Circuits and Systems (ISCAS)

自引率

0.00%

发文量