Optimal intra coding of HEVC by structured set prediction mode with discriminative learning

Wenrui Dai, H. Xiong
{"title":"Optimal intra coding of HEVC by structured set prediction mode with discriminative learning","authors":"Wenrui Dai, H. Xiong","doi":"10.1109/VCIP.2012.6410818","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel model on intra-coding for high efficiency video coding (HEVC), which can simultaneously make the set of prediction for block of pixels in an optimal rate-distortion sense. It not only utilizes the spatial statistical correlation for the optimal prediction based on 2-D contexts, but also formulates the data-driven structural interdependencies to make the prediction error coherent with the probability distribution which is favorable for subsequent transform and coding. The so-called structured set prediction model incorporates max-margin Markov network to regulate and reason the multiple prediction in the blocks. The model parameters are learned by discriminating the actual pixel value from the other possible estimates to the maximal margin. Distinguished from the existing methods concerning the minimal prediction error, the Markov network is adaptively derived to maintain the coherence of set of prediction. To be concrete, the proposed model seeks the concurrent optimization of the set of prediction by relating the loss function to the probability distribution of subsequent DCT coefficients. The prediction error is demonstrated to be asymptotically upper bounded by the training error under the decomposable loss function. For validation, we integrate the proposed model into HEVC intra coding and experimental results show obvious improvement of coding performance in terms of BD-rate.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Visual Communications and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP.2012.6410818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper proposes a novel model on intra-coding for high efficiency video coding (HEVC), which can simultaneously make the set of prediction for block of pixels in an optimal rate-distortion sense. It not only utilizes the spatial statistical correlation for the optimal prediction based on 2-D contexts, but also formulates the data-driven structural interdependencies to make the prediction error coherent with the probability distribution which is favorable for subsequent transform and coding. The so-called structured set prediction model incorporates max-margin Markov network to regulate and reason the multiple prediction in the blocks. The model parameters are learned by discriminating the actual pixel value from the other possible estimates to the maximal margin. Distinguished from the existing methods concerning the minimal prediction error, the Markov network is adaptively derived to maintain the coherence of set of prediction. To be concrete, the proposed model seeks the concurrent optimization of the set of prediction by relating the loss function to the probability distribution of subsequent DCT coefficients. The prediction error is demonstrated to be asymptotically upper bounded by the training error under the decomposable loss function. For validation, we integrate the proposed model into HEVC intra coding and experimental results show obvious improvement of coding performance in terms of BD-rate.
基于判别学习的结构化集预测模式的HEVC优化内部编码
本文提出了一种高效视频编码(HEVC)的内编码模型,该模型可以同时在最优的率失真意义下对像素块进行预测集。该方法不仅利用空间统计相关性进行基于二维上下文的最优预测,而且建立了数据驱动的结构依存关系,使预测误差与概率分布一致,有利于后续的变换和编码。所谓的结构化集预测模型采用最大边际马尔可夫网络对块中的多个预测进行调节和推理。通过将实际像素值与其他可能的估计值区分到最大边缘来学习模型参数。与现有预测误差最小的方法不同,该方法自适应地推导了马尔可夫网络,以保持预测集的一致性。具体而言,该模型通过将损失函数与后续DCT系数的概率分布联系起来,寻求预测集的并发优化。在可分解损失函数下,证明了预测误差是训练误差的渐近上界。为了验证,我们将该模型集成到HEVC帧内编码中,实验结果表明,从BD-rate的角度来看,该模型的编码性能有了明显的提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信