A Data-Trained, Affine-Linear Intra-Picture Prediction in the Frequency Domain

2019 Picture Coding Symposium (PCS) Pub Date : 2019-11-01 DOI:10.1109/PCS48520.2019.8954559

Michael Schäfer, Björn Stallenberger, Jonathan Pfaff, Philipp Helle, H. Schwarz, D. Marpe, T. Wiegand

{"title":"A Data-Trained, Affine-Linear Intra-Picture Prediction in the Frequency Domain","authors":"Michael Schäfer, Björn Stallenberger, Jonathan Pfaff, Philipp Helle, H. Schwarz, D. Marpe, T. Wiegand","doi":"10.1109/PCS48520.2019.8954559","DOIUrl":null,"url":null,"abstract":"This paper presents a data-driven training of affine- linear predictors which perform intra-picture prediction for video coding. The trained predictors use a single line of reconstructed boundary samples as input like the conventional intra prediction modes. For large blocks, the presented predictors initially transform the input samples via Discrete Cosine Transform. This allows to omit high frequency coefficients and consequently reduce the input dimension. The output is the result of a single matrix-vector multiplication and offset addition. Here, the predictors only construct certain coefficients in the frequency domain. The final prediction signal is then obtained by inverse transform. The coefficients of the prediction modes need to be stored in advance, requiring 0.273 MB of memory. The training employs a recursive block partitioning, where the loss function targets to approximate the bit-rate of the DCT-transformed block residuals. The obtained predictors are incorporated into the Versatile Video Coding Test Model 4. The authors report All- Intra bit-rate savings ranging from 0.7% to 2.0% across different resolutions in terms of the Bjøntegaard-Delta bit rate (BD-rate).","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Picture Coding Symposium (PCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCS48520.2019.8954559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

This paper presents a data-driven training of affine- linear predictors which perform intra-picture prediction for video coding. The trained predictors use a single line of reconstructed boundary samples as input like the conventional intra prediction modes. For large blocks, the presented predictors initially transform the input samples via Discrete Cosine Transform. This allows to omit high frequency coefficients and consequently reduce the input dimension. The output is the result of a single matrix-vector multiplication and offset addition. Here, the predictors only construct certain coefficients in the frequency domain. The final prediction signal is then obtained by inverse transform. The coefficients of the prediction modes need to be stored in advance, requiring 0.273 MB of memory. The training employs a recursive block partitioning, where the loss function targets to approximate the bit-rate of the DCT-transformed block residuals. The obtained predictors are incorporated into the Versatile Video Coding Test Model 4. The authors report All- Intra bit-rate savings ranging from 0.7% to 2.0% across different resolutions in terms of the Bjøntegaard-Delta bit rate (BD-rate).

查看原文本刊更多论文

频域数据训练的仿射线性图像内预测

本文提出了一种数据驱动的仿射线性预测器训练方法，用于视频编码的图像内预测。训练后的预测器使用单线重构边界样本作为输入，就像传统的内部预测模式一样。对于较大的块，所提出的预测器最初通过离散余弦变换变换输入样本。这允许省略高频系数，从而降低输入维数。输出是单个矩阵向量乘法和偏移量加法的结果。在这里，预测器只在频域中构造某些系数。然后通过逆变换得到最终的预测信号。预测模式的系数需要提前存储，需要0.273 MB的内存。训练采用递归块划分，其中损失函数的目标是近似dct变换后的块残差的比特率。得到的预测因子被纳入多功能视频编码测试模型4。作者报告说，在不同分辨率下，就Bjøntegaard-Delta比特率(BD-rate)而言，All- Intra比特率可节省0.7%至2.0%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 Picture Coding Symposium (PCS)

自引率

0.00%

发文量