The disparity between optimal and practical Lagrangian multiplier estimation in video encoders

IF 1.3 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC
D. Ringis, Vibhoothi, François Pitié, A. Kokaram
{"title":"The disparity between optimal and practical Lagrangian multiplier estimation in video encoders","authors":"D. Ringis, Vibhoothi, François Pitié, A. Kokaram","doi":"10.3389/frsip.2023.1205104","DOIUrl":null,"url":null,"abstract":"With video streaming making up 80% of the global internet bandwidth, the need to deliver high-quality video at low bitrate, combined with the high complexity of modern codecs, has led to the idea of a per-clip optimisation approach in transcoding. In this paper, we revisit the Lagrangian multiplier parameter, which is at the core of rate-distortion optimisation. Currently, video encoders use prediction models to set this parameter but these models are agnostic to the video at hand. We explore the gains that could be achieved using a per-clip direct-search optimisation of the Lagrangian multiplier parameter. We evaluate this optimisation framework on a much larger corpus of videos than that has been attempted by previous research. Our results show that per-clip optimisation of the Lagrangian multiplier leads to BD-Rate average improvements of 1.87% for x265 across a 10 k clip corpus of modern videos, and up to 25% in a single clip. Average improvements of 0.69% are reported for libaom-av1 on a subset of 100 clips. However, we show that a per-clip, per-frame-type optimisation of λ for libaom-av1 can increase these average gains to 2.5% and up to 14.9% on a single clip. Our optimisation scheme requires about 50–250 additional encodes per-clip but we show that significant speed-up can be made using proxy videos in the optimisation. These computational gains (of up to ×200) incur a slight loss to BD-Rate improvement because the optimisation is conducted at lower resolutions. Overall, this paper highlights the value of re-examining the estimation of the Lagrangian multiplier in modern codecs as there are significant gains still available without changing the tools used in the standards.","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"25 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in signal processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frsip.2023.1205104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

With video streaming making up 80% of the global internet bandwidth, the need to deliver high-quality video at low bitrate, combined with the high complexity of modern codecs, has led to the idea of a per-clip optimisation approach in transcoding. In this paper, we revisit the Lagrangian multiplier parameter, which is at the core of rate-distortion optimisation. Currently, video encoders use prediction models to set this parameter but these models are agnostic to the video at hand. We explore the gains that could be achieved using a per-clip direct-search optimisation of the Lagrangian multiplier parameter. We evaluate this optimisation framework on a much larger corpus of videos than that has been attempted by previous research. Our results show that per-clip optimisation of the Lagrangian multiplier leads to BD-Rate average improvements of 1.87% for x265 across a 10 k clip corpus of modern videos, and up to 25% in a single clip. Average improvements of 0.69% are reported for libaom-av1 on a subset of 100 clips. However, we show that a per-clip, per-frame-type optimisation of λ for libaom-av1 can increase these average gains to 2.5% and up to 14.9% on a single clip. Our optimisation scheme requires about 50–250 additional encodes per-clip but we show that significant speed-up can be made using proxy videos in the optimisation. These computational gains (of up to ×200) incur a slight loss to BD-Rate improvement because the optimisation is conducted at lower resolutions. Overall, this paper highlights the value of re-examining the estimation of the Lagrangian multiplier in modern codecs as there are significant gains still available without changing the tools used in the standards.
视频编码器中最优拉格朗日乘子估计与实际拉格朗日乘子估计的差异
视频流占全球互联网带宽的80%,需要以低比特率传输高质量的视频,再加上现代编解码器的高度复杂性,导致了转码中每个片段优化方法的想法。在本文中,我们重新审视拉格朗日乘子参数,这是在率失真优化的核心。目前,视频编码器使用预测模型来设置该参数,但这些模型对手头的视频是不可知的。我们探索了使用拉格朗日乘子参数的每个片段直接搜索优化可以实现的增益。我们在一个比以前的研究尝试过的更大的视频语料库上评估这个优化框架。我们的结果表明,拉格朗日乘子的每个片段优化导致x265在10 k现代视频剪辑语料库中的BD-Rate平均提高1.87%,在单个剪辑中高达25%。在100个片段的子集上,libaom-av1的平均改进率为0.69%。然而,我们表明,针对libaom-av1的每个片段,每个帧类型的λ优化可以在单个片段上将这些平均增益提高到2.5%和高达14.9%。我们的优化方案需要每个剪辑大约50-250个额外的编码,但我们表明,在优化中使用代理视频可以实现显著的加速。由于优化是在较低的分辨率下进行的,因此这些计算增益(高达×200)会导致BD-Rate改进的轻微损失。总的来说,本文强调了在现代编解码器中重新检查拉格朗日乘子估计的价值,因为在不改变标准中使用的工具的情况下仍然可以获得显着的收益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信