Neural Network based Inter bi-prediction Blending

2021 International Conference on Visual Communications and Image Processing (VCIP) Pub Date : 2021-12-05 DOI:10.1109/VCIP53242.2021.9675422

Franck Galpin, P. Bordes, Thierry Dumas, Pavel Nikitin, F. L. Léannec

引用次数: 0

Abstract

This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, the motion compensation of blocks from already decoded reference pictures stands out as the principal tool used to predict the current frame. Especially, the bi-prediction, in which a block is obtained by averaging two different motion-compensated prediction blocks, significantly improves the final temporal prediction accuracy. In this context, we introduce a simple neural network that further improves the blending operation. A complexity balance, both in terms of network size and encoder mode selection, is carried out. Extensive tests on top of the recently standardized VVC codec are performed and show a BD-rate improvement of −1.4% in random access configuration for a network size of fewer than 10k parameters. We also propose a simple CPU-based implementation and direct network quantization to assess the complexity/gains tradeoff in a conventional codec framework.

查看原文本刊更多论文

基于神经网络的双向预测混合

提出了一种基于学习的改进视频编码双预测的方法。在传统的视频编码解决方案中，已经解码的参考图像块的运动补偿是用于预测当前帧的主要工具。特别是通过对两个不同的运动补偿预测块进行平均得到一个块的双预测，显著提高了最终的时间预测精度。在这种情况下，我们引入了一个简单的神经网络，进一步改进了混合操作。在网络大小和编码器模式选择方面进行了复杂性平衡。在最近标准化的VVC编解码器之上进行了广泛的测试，并显示在随机访问配置中，对于小于10k参数的网络大小，bd速率提高了- 1.4%。我们还提出了一个简单的基于cpu的实现和直接的网络量化来评估传统编解码器框架中的复杂性/增益权衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Visual Communications and Image Processing (VCIP)

自引率

0.00%

发文量