FID: Frame Interpolation and DCT-based Video Compression

2020 IEEE International Symposium on Multimedia (ISM) Pub Date : 2020-12-01 DOI:10.1109/ISM.2020.00045

Yeganeh Jalalpour, Li-Yun Wang, W. Feng, Feng Liu

引用次数: 2

Abstract

In this paper, we present a hybrid video compression technique that combines the advantages of residual coding techniques found in traditional DCT-based video compression and learning-based video frame interpolation to reduce the amount of residual data that needs to be compressed. Learning-based frame interpolation techniques use machine learning algorithms to predict frames but have difficulty with uncovered areas and non-linear motion. This approach uses DCT-based residual coding only on areas that are difficult for video interpolation and provides tunable compression for such areas through an adaptive selection of data to be encoded. Experimental data for both PSNR and the newer video multi-method assessment fusion (VMAF) metrics are provided. Our results show that we can reduce the amount of data required to represent a video stream compared with traditional video coding while outperforming video frame interpolation techniques in quality.

查看原文本刊更多论文

FID:帧插值和基于dct的视频压缩

在本文中，我们提出了一种混合视频压缩技术，该技术结合了传统基于dct的视频压缩和基于学习的视频帧插值中残差编码技术的优点，以减少需要压缩的残差数据量。基于学习的帧插值技术使用机器学习算法来预测帧，但在未覆盖区域和非线性运动方面存在困难。该方法仅对难以进行视频插值的区域使用基于dct的残差编码，并通过自适应选择要编码的数据为这些区域提供可调压缩。给出了PSNR和较新的视频多方法评估融合(VMAF)指标的实验数据。我们的结果表明，与传统的视频编码相比，我们可以减少表示视频流所需的数据量，同时在质量上优于视频帧插值技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE International Symposium on Multimedia (ISM)

自引率

0.00%

发文量