Proceedings of the 1st Mile-High Video Conference最新文献

Improving streaming quality and bitrate efficiency with dynamic resolution selection 通过动态分辨率选择提高流媒体质量和比特率效率

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517304

X. Ducloux, Patrick Gendron, T. Fautier

{"title":"Improving streaming quality and bitrate efficiency with dynamic resolution selection","authors":"X. Ducloux, Patrick Gendron, T. Fautier","doi":"10.1145/3510450.3517304","DOIUrl":"https://doi.org/10.1145/3510450.3517304","url":null,"abstract":"Dynamic Resolution Selection is a technology that has been deployed by Netflix with its per-scene encoding mechanism applied to VOD assets. The technology is based on a posteriori analysis of all the encoded resolutions to determine the best resolution for a given scene, in terms of quality and bandwidth used, based on VMAF analysis. It cannot be applied to live content, as it would require too much processing power and can't be used in real time. The method proposed in this paper is based on a machine learning (ML) mechanism that learns how to pick the best resolution to be encoded in a supervised learning environment. At run time, using the already existing pre-processing stage, the live encoder can decide on the best resolution to encode, without adding any processing complexity or delay. This results in higher quality of experience (QoE) or lower bitrate, as well as lower CPU footprint vs. a classical fixed ladder approach. This paper will present the results obtained for live HD or 4K content delivery across different networks, including classical TS (DVB), native IP (ATSC 3.0) and ABR (DASH/HLS). In addition, the paper will report on the interoperability results of tested devices.","PeriodicalId":122386,"journal":{"name":"Proceedings of the 1st Mile-High Video Conference","volume":"215 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121112820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On multiple media representations and CDN performance 关于多媒体表现和CDN性能

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517320

Y. Reznik, Thiago Teixeira, Robert Peck

引用次数: 2

Evaluation of MPEG-5 part 2 (LCEVC) for live gaming video streaming applications 实时游戏视频流应用的MPEG-5 part 2 (LCEVC)评估

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517279

Nabajeet Barman, Steven Schmidt, Saman Zadtootaghaj, M. Martini

引用次数: 5

Deploying the ITU-T P.1203 QoE model in the wild and retraining for new codecs 在野外部署ITU-T P.1203 QoE模型，并对新的编解码器进行再培训

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517310

W. Robitza, Rakesh Rao Ramachandra Rao, Steve Göring, A. Dethof, A. Raake

引用次数: 0

Using CMAF to deliver high resolution immersive video with ultra-low end to end latency for live streaming 使用CMAF提供具有超低端到端延迟的高分辨率沉浸式视频直播

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517292

Andrew Zhang, XiaoMing Chen, Ying Luo, Anna Qingfeng Li, William Cheung

{"title":"Using CMAF to deliver high resolution immersive video with ultra-low end to end latency for live streaming","authors":"Andrew Zhang, XiaoMing Chen, Ying Luo, Anna Qingfeng Li, William Cheung","doi":"10.1145/3510450.3517292","DOIUrl":"https://doi.org/10.1145/3510450.3517292","url":null,"abstract":"Immersive video with 8K or higher resolution utilizes viewport-dependent tile-based video with multi-resolutions (i.e. low-resolution background video with high-resolution video). OMAF defines how to deliver tiled immersive video through MPEG DASH. But End-to-End latency is a consistent problem for the MPEG DASH solution. Solutions using short segment with 1 sec duration will reduce latency, but even in those cases, without CDNs, the end-to-end latency is still 5 secs or more. And in most cases, massive segment files generated every second harden CDN, leading to much longer latencies, such as 20 secs or more. In this paper, we introduce a solution using Common Media Application Format (CMAF) to deliver tile-based immersive video to reduce the end-to-end latency to sub-3 secs. Based on CMAF: We enabled long duration CMAF segment with shorter End-to-End Latency by using long duration CMAF segmentation reduce CDN pressure since it reduces the amount segment files generated. In addition, we re-fetch relative CMAF chunks of high-resolution segments via our own adaptive viewport prediction algorithm. We use a decoder catching-up mechanism for prediction-missed tiles to reduce the M2HQ (Motion-To-High-Quality) latency while viewport changed within chunks. As we will show, this leads to an overall sub-3 seconds End-to-End latency with ~1 second Packager-Display Latency and average 300ms M2HQ latency can be reached with 5 seconds segmentation in non-CDN environment.","PeriodicalId":122386,"journal":{"name":"Proceedings of the 1st Mile-High Video Conference","volume":"355 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123548247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Overview of the DASH-HLS interoperability specification: 2021 edition DASH-HLS互操作性规范概述:2021版

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517281

Zachary A. Cava

{"title":"Overview of the DASH-HLS interoperability specification: 2021 edition","authors":"Zachary A. Cava","doi":"10.1145/3510450.3517281","DOIUrl":"https://doi.org/10.1145/3510450.3517281","url":null,"abstract":"While CMAF has provided the foundation for the interoperable packaging of streaming media, today it is still common practice to produce media specific to the delivery formats utilized by a service provider. As DASH and HLS are the delivery formats the industry has converged towards, a survey of deployments for DASH and HLS revealed two leading reasons for divergent packaging: media packaging requirements that were misaligned across formats and a non-trivial amount of tribal knowledge required to address media for common deployment use-cases in each format. To address the divergence of CMAF packaged media in DASH and HLS, the CTA WAVE project created a working group, the DASH-HLS Interoperability group, responsible for researching and transcribing the additional packaging and delivery format requirements necessary to achieve interoperability. Using industry guidance, the group defined a set of common streaming use-cases and published the interoperability details for the first four usecases in the 2021 Edition of the DASH-HLS Interoperability Specification (CTA-5005) [1]. The use-cases in this edition are: Basic On-Demand and Live Streaming, Low Latency Live Streaming, Encrypted Media Presentations, and Presentation Splicing. This talk will provide an overview of the specification outputs for these initial use-cases including the defined packaging and addressing requirements and any identified missing interoperability points that represent opportunities for further research. Beyond the current specification, this talk will highlight the new use-cases and work currently being prioritized for the next edition and how interested entities can get involved with the development.","PeriodicalId":122386,"journal":{"name":"Proceedings of the 1st Mile-High Video Conference","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128845515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Video streaming using light-weight transcoding and in-network intelligence 视频流使用轻量级转码和网络内智能

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517284

A. Erfanian, Hadi Amirpour, F. Tashtarian, C. Timmerer, H. Hellwagner

引用次数: 0

Low encoding overhead ultra-low latency streaming via HESP through sparse initialization streams 低编码开销超低延迟流通过HESP通过稀疏初始化流

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517294

Pieter-Jan Speelmans

{"title":"Low encoding overhead ultra-low latency streaming via HESP through sparse initialization streams","authors":"Pieter-Jan Speelmans","doi":"10.1145/3510450.3517294","DOIUrl":"https://doi.org/10.1145/3510450.3517294","url":null,"abstract":"HESP, the High Efficiency Streaming Protocol [4], realizes ultra-low latencies and ultra-short start-up times by combining two feeds, the keyframe-only Initialization Stream and the ultra-low latency CMAF-CTE Continuation Stream. HESP uses a keyframe from the Initialization Stream to start playback (via keyframe injection) of the Continuation Stream extremely close to the live edge. In previous research [5], the impact of the HESP keyframe injection on the video quality has been proven to be very low or even negligible. In contrast to the trivial double encoding for each quality in the bitrate ladder, in this paper we show that the overhead of the generation of the keyframe-only Initialization Streams can be reduced. We designed an approach in which the frequency of keyframes in the Initialization Streams is defined by a trade-off between the encoding overhead and two metrics in the viewing QoE: start-up time and time that it takes to switch to the highest feasible video quality of the ABR ladder. More specifically, for each quality Qi, fi is defined such that (i) switching to Qi, either for start-up or for switching to Qi as a higher quality, takes [EQUATION] additional delay, and (ii) there always is a Qi, lower than Qcurrent (unless Qcurrent is the lowest quality) to which the player can switch down instantly, which is needed in case of network problems. The resulting impact on the viewer QoE is characterizedby occasional (whenever an ABR switch to a higher quality is needed) short intervals [EQUATION] during which playback potentially is done at a lower than feasible video quality. Based on measurements, the proposed approach results in an overhead when encoding Initialization Streams of only 15 to 20%. Compared to \"standard\" HESP, the viewer QoE reduction is hardly noticeable.","PeriodicalId":122386,"journal":{"name":"Proceedings of the 1st Mile-High Video Conference","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133116819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

CAdViSE or how to find the sweet spots of ABR systems 建议如何找到ABR系统的最佳点

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517274

Babak Taraghi, A. Bentaleb, C. Timmerer, Roger Zimmermann, H. Hellwagner

{"title":"CAdViSE or how to find the sweet spots of ABR systems","authors":"Babak Taraghi, A. Bentaleb, C. Timmerer, Roger Zimmermann, H. Hellwagner","doi":"10.1145/3510450.3517274","DOIUrl":"https://doi.org/10.1145/3510450.3517274","url":null,"abstract":"With the recent surge in Internet multimedia traffic, the enhancement and improvement of media players, specifically Dynamic Adaptive Streaming over HTTP (DASH) media players happened at an incredible rate. DASH Media players take advantage of adapting a media stream to the network fluctuations by continuously monitoring the network and making decisions in near real-time. The performance of algorithms that are in charge of making such decisions was often difficult to be evaluated and objectively assessed from an End-to-end or holistic perspective [1]. CAdViSE provides a Cloud-based Adaptive Video Streaming Evaluation framework for the automated testing of adaptive media players [4]. We will introduce the CAdViSE framework, its application, and propose the benefits and advantages that it can bring to every web-based media player development pipeline. To demonstrate the power of CAdViSE in evaluating Adaptive Bitrate (ABR) algorithms we will exhibit its capabilities when combined with objective Quality of Experience (QoE) models. Our team at Bitmovin Inc. and ATHENA laboratory has selected the ITU-T P.1203 (mode 1) quality evaluation model in order to assess the experiments and calculate the Mean Opinion Score (MOS), and better understand the behavior of a set of well-known ABR algorithms in a real-life setting [2]. We will display how we tested and deployed our framework using a modular architecture into a cloud infrastructure. This method yields a massive growth to the number of concurrent experiments and the number of media players that can be evaluated and compared at the same time, thus enabling maximum potential scalability. In our team's most recent experiments, we used Amazon Web Services (AWS) for demonstration purposes. Another awesome feature of CAdViSE that will be discussed here is the ability to shape the test network with endless network profiles. To do so, we used a fluctuation network profile and a real LTE network trace based on the recorded internet usage of a bicycle commuter in Belgium. CAdViSE produces comprehensive logs for each experimental session. These logs can then be applied against different goals, such as objective evaluation or to stitch back media segments and conduct subjective evaluations. In addition, startup delays, stall events, and other media streaming defects can be imitated exactly as they happened during the experimental streaming sessions [3].","PeriodicalId":122386,"journal":{"name":"Proceedings of the 1st Mile-High Video Conference","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131238162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Latest advances in the development of the open-source player dash.js 开源播放器dash.js开发的最新进展

Proceedings of the 1st Mile-High Video Conference Pub Date : 2022-03-01 DOI: 10.1145/3510450.3517311

D. Silhavy, S. Pham, S. Arbanowski, S. Steglich, Björn Harrer

引用次数: 1