运动估计的最优拉格朗日参数:一种提高视频编码性能的低成本有效方法

J. Molinero, Amaya Jiménez, Eduardo Martínez-Enríquez, F. Díaz-de-María
{"title":"运动估计的最优拉格朗日参数:一种提高视频编码性能的低成本有效方法","authors":"J. Molinero, Amaya Jiménez, Eduardo Martínez-Enríquez, F. Díaz-de-María","doi":"10.1109/CONATEL.2011.5958668","DOIUrl":null,"url":null,"abstract":"The most recent video coding standards are usually based on a rate-distortion optimization (RDO) process that has been formulated in terms of an unconstrained Lagrangian optimization. The RDO provides outstanding results in exchange for a high computational cost, especially for the Inter frames, which require a computationally heavy motion estimation (ME) process. In particular, for H.264/AVC, this RDO process allows selecting both the MB partition size and the motion vector. However, as the optimum procedure is not feasible for computational reasons, the ME process uses a simplified rate-distortion (RD) cost function. Therefore, two RDO processes are involved, one for selecting the MB partition size and one for ME. Both RDO processes rely on an Lagrangian formulation and, for practical purposes, the corresponding Lagrangian parameters are related by a simple, experimentally obtained relationship. In this paper, some evidences of the weaknesses of such a relationship between the two Lagrangian parameters are given and a simple effective procedure to improve the R-D encoding performance is proposed according to such weaknesses. The proposed method has been comparatively evaluated with respect to one recently published method, showing significant average performance improvements, above 0.4 dB in terms of PSNR.","PeriodicalId":197632,"journal":{"name":"CONATEL 2011","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2011-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On the optimal Lagrangian parameter for motion estimation: A low-cost and effective method for improving video coding performance\",\"authors\":\"J. Molinero, Amaya Jiménez, Eduardo Martínez-Enríquez, F. Díaz-de-María\",\"doi\":\"10.1109/CONATEL.2011.5958668\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The most recent video coding standards are usually based on a rate-distortion optimization (RDO) process that has been formulated in terms of an unconstrained Lagrangian optimization. The RDO provides outstanding results in exchange for a high computational cost, especially for the Inter frames, which require a computationally heavy motion estimation (ME) process. In particular, for H.264/AVC, this RDO process allows selecting both the MB partition size and the motion vector. However, as the optimum procedure is not feasible for computational reasons, the ME process uses a simplified rate-distortion (RD) cost function. Therefore, two RDO processes are involved, one for selecting the MB partition size and one for ME. Both RDO processes rely on an Lagrangian formulation and, for practical purposes, the corresponding Lagrangian parameters are related by a simple, experimentally obtained relationship. In this paper, some evidences of the weaknesses of such a relationship between the two Lagrangian parameters are given and a simple effective procedure to improve the R-D encoding performance is proposed according to such weaknesses. The proposed method has been comparatively evaluated with respect to one recently published method, showing significant average performance improvements, above 0.4 dB in terms of PSNR.\",\"PeriodicalId\":197632,\"journal\":{\"name\":\"CONATEL 2011\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CONATEL 2011\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONATEL.2011.5958668\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CONATEL 2011","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONATEL.2011.5958668","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

最新的视频编码标准通常是基于基于无约束拉格朗日优化而制定的率失真优化(RDO)过程。RDO提供了出色的结果,但代价是高昂的计算成本,特别是对于需要大量计算的帧间运动估计(ME)过程。特别是,对于H.264/AVC,这个RDO过程允许选择MB分区大小和运动向量。然而,由于计算原因,最优程序不可行,ME过程使用简化的率失真(RD)成本函数。因此,涉及到两个RDO进程,一个用于选择MB分区大小,另一个用于选择ME。两个RDO过程都依赖于拉格朗日公式,并且为了实际目的,相应的拉格朗日参数通过一个简单的、实验得到的关系来关联。本文给出了两个拉格朗日参数之间这种关系的一些弱点的证据,并根据这些弱点提出了一种简单有效的改进R-D编码性能的方法。该方法与最近发表的一种方法进行了比较评估,显示出显着的平均性能改进,在PSNR方面超过0.4 dB。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On the optimal Lagrangian parameter for motion estimation: A low-cost and effective method for improving video coding performance
The most recent video coding standards are usually based on a rate-distortion optimization (RDO) process that has been formulated in terms of an unconstrained Lagrangian optimization. The RDO provides outstanding results in exchange for a high computational cost, especially for the Inter frames, which require a computationally heavy motion estimation (ME) process. In particular, for H.264/AVC, this RDO process allows selecting both the MB partition size and the motion vector. However, as the optimum procedure is not feasible for computational reasons, the ME process uses a simplified rate-distortion (RD) cost function. Therefore, two RDO processes are involved, one for selecting the MB partition size and one for ME. Both RDO processes rely on an Lagrangian formulation and, for practical purposes, the corresponding Lagrangian parameters are related by a simple, experimentally obtained relationship. In this paper, some evidences of the weaknesses of such a relationship between the two Lagrangian parameters are given and a simple effective procedure to improve the R-D encoding performance is proposed according to such weaknesses. The proposed method has been comparatively evaluated with respect to one recently published method, showing significant average performance improvements, above 0.4 dB in terms of PSNR.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信