Reinforcement learning for video encoder control in HEVC

2017 International Conference on Systems, Signals and Image Processing (IWSSIP) Pub Date : 2017-05-22 DOI:10.1109/IWSSIP.2017.7965586

Philipp Helle, H. Schwarz, T. Wiegand, K. Müller

引用次数: 15

Abstract

In todays video compression systems, the encoder typically follows an optimization procedure to find a compressed representation of the video signal. While primary optimization criteria are bit rate and image distortion, low complexity of this procedure may also be of importance in some applications, making complexity a third objective. We approach this problem by treating the encoding procedure as a decision process in time and make it amenable to reinforcement learning. Our learning algorithm computes a strategy in a compact functional representation, which is then employed in the video encoder to control its search. By including measured execution time into the reinforcement signal with a lagrangian weight, we realize a trade-off between RD-performance and computational complexity controlled by a single parameter. Using the reference software test model (HM) of the HEVC video coding standard, we show that over half the encoding time can be saved at the same RD-performance.

查看原文本刊更多论文

HEVC中视频编码器控制的强化学习

在今天的视频压缩系统中，编码器通常遵循一个优化过程来找到视频信号的压缩表示。虽然主要的优化标准是比特率和图像失真，但在某些应用中，该过程的低复杂性也可能很重要，使复杂性成为第三个目标。我们通过将编码过程视为一个及时的决策过程来解决这个问题，并使其易于强化学习。我们的学习算法在一个紧凑的函数表示中计算一个策略，然后在视频编码器中使用该策略来控制其搜索。通过将测量的执行时间包含在具有拉格朗日权值的增强信号中，我们实现了在单参数控制的rd性能和计算复杂度之间的权衡。使用HEVC视频编码标准的参考软件测试模型(HM)，我们证明在相同的rd性能下可以节省一半以上的编码时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on Systems, Signals and Image Processing (IWSSIP)

自引率

0.00%

发文量