A. Tanizawa, Shin-ichiro Koto, Takeshi Chujoh
{"title":"Fast rate‐distortion optimized coding mode decision for H.264","authors":"A. Tanizawa, Shin-ichiro Koto, Takeshi Chujoh","doi":"10.1002/ECJC.20342","DOIUrl":null,"url":null,"abstract":"In H.264/MPEG-4 AVC a wide range of different prediction block forms and prediction signal generation methods are available. By selecting an optimal combination of coding modes from the multiple possible combinations of such modes, improvements in coding efficiency can be achieved. However, when using the rate-distortion optimized coding mode decision method based on the Lagrange multipliers, at the same time as seeing a significant improvement in coding efficiency, we are faced with the problem that the mode decision procedure becomes extremely demanding computationally. The H.264/MPEG-4 AVC high profile introduces adaptive block size transformations thereby making the number of combinations of coding mode that can be selected even larger than under the main profile. In this paper we investigate a method for hierarchically and adaptively reducing the number of mode combinations. Specifically we propose a method for quickly deciding the coding mode while limiting the reductions in coding efficiency by the correlation information between two mode decision cost functions in accordance with a quantization parameter. The results of experiments confirm that by using the proposed method, the encoding time excluding the motion search can be reduced by up to 4 times for the main profile and by up to 7 times for the high profile as compared to rate-distortion optimized coding mode decision. © 2007 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 90(9): 41– 55, 2007; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ecjc.20342","PeriodicalId":100407,"journal":{"name":"Electronics and Communications in Japan (Part III: Fundamental Electronic Science)","volume":"26 1","pages":"41-55"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronics and Communications in Japan (Part III: Fundamental Electronic Science)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/ECJC.20342","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
H.264的快速率失真优化编码模式决策
在H.264/MPEG-4 AVC中,有各种不同的预测块形式和预测信号生成方法。通过从多种可能的编码模式组合中选择一种最优的编码模式组合,可以实现编码效率的提高。然而,在使用基于拉格朗日乘子的率失真优化编码模式决策方法时,在显著提高编码效率的同时,也面临着模式决策过程对计算量要求极高的问题。H.264/MPEG-4 AVC高调引入了自适应块大小转换,从而使得可以选择的编码模式组合的数量甚至比在主要配置文件下更大。本文研究了一种分层自适应减少模态组合数的方法。具体来说,我们提出了一种根据量化参数利用两种模式决策代价函数之间的相关信息来快速确定编码模式,同时限制编码效率降低的方法。实验结果表明,与基于率失真优化的编码模式决策相比,采用该方法剔除运动搜索后,主轮廓的编码时间最多减少4倍,高轮廓的编码时间最多减少7倍。©2007 Wiley期刊公司电子工程学报,2009,29 (3):393 - 398;在线发表于Wiley InterScience (www.interscience.wiley.com)。DOI 10.1002 / ecjc.20342
本文章由计算机程序翻译,如有差异,请以英文原文为准。