Deep Frame Interpolation for Video Compression

2019 Data Compression Conference (DCC) Pub Date : 2019-03-01 DOI:10.1109/DCC.2019.00068

Jean Bégaint, Franck Galpin, P. Guillotel, C. Guillemot

引用次数: 12

Abstract

Deep neural networks have been recently proposed to solve video interpolation tasks. Given a past and future frame, such networks can be trained to successfully predict the intermediate frame(s). In the context of video compression, these architectures could be useful as an additional inter-prediction mode. Current inter-prediction methods rely on block-matching techniques to estimate the motion between consecutive frames. This approach has severe limitations for handling complex non-translational motions, and is still limited to block-based motion vectors. This paper presents a deep frame interpolation network for video compression aiming at solving the previous limitations, i.e. able to cope with all types of geometrical deformations by providing a dense motion compensation. Experiments with the classical bi-directional hierarchical video coding structure demonstrate the efficiency of the proposed approach over the traditional tools of the HEVC codec.

查看原文本刊更多论文

视频压缩的深帧插值

深度神经网络最近被提出用于解决视频插值任务。给定一个过去和未来的框架，这样的网络可以被训练来成功地预测中间框架。在视频压缩的背景下，这些架构可以作为额外的相互预测模式。目前的相互预测方法依赖于块匹配技术来估计连续帧之间的运动。这种方法在处理复杂的非平移运动方面有严重的局限性，并且仍然局限于基于块的运动向量。本文提出了一种用于视频压缩的深度帧插值网络，旨在解决以往的局限性，即通过提供密集的运动补偿来处理各种类型的几何变形。用经典的双向分层视频编码结构进行的实验表明，该方法比传统的HEVC编解码器工具更有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 Data Compression Conference (DCC)

自引率

0.00%

发文量