Low encoding overhead ultra-low latency streaming via HESP through sparse initialization streams

Pieter-Jan Speelmans
{"title":"Low encoding overhead ultra-low latency streaming via HESP through sparse initialization streams","authors":"Pieter-Jan Speelmans","doi":"10.1145/3510450.3517294","DOIUrl":null,"url":null,"abstract":"HESP, the High Efficiency Streaming Protocol [4], realizes ultra-low latencies and ultra-short start-up times by combining two feeds, the keyframe-only Initialization Stream and the ultra-low latency CMAF-CTE Continuation Stream. HESP uses a keyframe from the Initialization Stream to start playback (via keyframe injection) of the Continuation Stream extremely close to the live edge. In previous research [5], the impact of the HESP keyframe injection on the video quality has been proven to be very low or even negligible. In contrast to the trivial double encoding for each quality in the bitrate ladder, in this paper we show that the overhead of the generation of the keyframe-only Initialization Streams can be reduced. We designed an approach in which the frequency of keyframes in the Initialization Streams is defined by a trade-off between the encoding overhead and two metrics in the viewing QoE: start-up time and time that it takes to switch to the highest feasible video quality of the ABR ladder. More specifically, for each quality Qi, fi is defined such that (i) switching to Qi, either for start-up or for switching to Qi as a higher quality, takes [EQUATION] additional delay, and (ii) there always is a Qi, lower than Qcurrent (unless Qcurrent is the lowest quality) to which the player can switch down instantly, which is needed in case of network problems. The resulting impact on the viewer QoE is characterizedby occasional (whenever an ABR switch to a higher quality is needed) short intervals [EQUATION] during which playback potentially is done at a lower than feasible video quality. Based on measurements, the proposed approach results in an overhead when encoding Initialization Streams of only 15 to 20%. Compared to \"standard\" HESP, the viewer QoE reduction is hardly noticeable.","PeriodicalId":122386,"journal":{"name":"Proceedings of the 1st Mile-High Video Conference","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st Mile-High Video Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3510450.3517294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

HESP, the High Efficiency Streaming Protocol [4], realizes ultra-low latencies and ultra-short start-up times by combining two feeds, the keyframe-only Initialization Stream and the ultra-low latency CMAF-CTE Continuation Stream. HESP uses a keyframe from the Initialization Stream to start playback (via keyframe injection) of the Continuation Stream extremely close to the live edge. In previous research [5], the impact of the HESP keyframe injection on the video quality has been proven to be very low or even negligible. In contrast to the trivial double encoding for each quality in the bitrate ladder, in this paper we show that the overhead of the generation of the keyframe-only Initialization Streams can be reduced. We designed an approach in which the frequency of keyframes in the Initialization Streams is defined by a trade-off between the encoding overhead and two metrics in the viewing QoE: start-up time and time that it takes to switch to the highest feasible video quality of the ABR ladder. More specifically, for each quality Qi, fi is defined such that (i) switching to Qi, either for start-up or for switching to Qi as a higher quality, takes [EQUATION] additional delay, and (ii) there always is a Qi, lower than Qcurrent (unless Qcurrent is the lowest quality) to which the player can switch down instantly, which is needed in case of network problems. The resulting impact on the viewer QoE is characterizedby occasional (whenever an ABR switch to a higher quality is needed) short intervals [EQUATION] during which playback potentially is done at a lower than feasible video quality. Based on measurements, the proposed approach results in an overhead when encoding Initialization Streams of only 15 to 20%. Compared to "standard" HESP, the viewer QoE reduction is hardly noticeable.
低编码开销超低延迟流通过HESP通过稀疏初始化流
HESP,即高效流协议[4],通过结合两个源,即仅关键帧初始化流和超低延迟CMAF-CTE延续流,实现了超低延迟和超短启动时间。HESP使用来自初始化流的关键帧来开始播放(通过关键帧注入)非常接近活动边缘的延续流。在以往的研究[5]中,HESP关键帧注入对视频质量的影响已经被证明是非常低甚至可以忽略不计的。与在比特率阶梯中对每个质量进行琐碎的双重编码相比,在本文中,我们证明了生成仅关键帧初始化流的开销可以减少。我们设计了一种方法,其中初始化流中关键帧的频率由编码开销和观看QoE中的两个指标之间的权衡来定义:启动时间和切换到ABR阶梯的最高可行视频质量所需的时间。更具体地说,对于每一个Qi质量,fi的定义是(i)切换到Qi,无论是启动还是切换到更高质量的Qi,都需要额外的延迟,(ii)总是有一个Qi低于Qcurrent(除非Qcurrent是最低质量的),玩家可以立即切换到这个Qi,这是在网络问题的情况下所需要的。由此产生的对观看者QoE的影响表现为偶尔(每当ABR切换到更高的质量时)短间隔[等式],在此期间播放可能以低于可行的视频质量完成。根据测量结果,当对初始化流进行编码时,所建议的方法只会产生15%到20%的开销。与“标准”HESP相比,观看者QoE的减少几乎不明显。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信