Adaptive Multi-View Live Video Streaming for Teledriving Using a Single Hardware Encoder

2020 IEEE International Symposium on Multimedia (ISM) Pub Date : 2020-12-01 DOI:10.1109/ISM.2020.00008

M. Hofbauer, Christopher B. Kuhn, G. Petrovic, E. Steinbach

{"title":"Adaptive Multi-View Live Video Streaming for Teledriving Using a Single Hardware Encoder","authors":"M. Hofbauer, Christopher B. Kuhn, G. Petrovic, E. Steinbach","doi":"10.1109/ISM.2020.00008","DOIUrl":null,"url":null,"abstract":"Teleoperated driving (TOD) is a possible solution to cope with failures of autonomous vehicles. In TOD, the human operator perceives the traffic situation via video streams of multiple cameras from a remote location. Adaptation mechanisms are needed in order to match the available transmission resources and provide the operator with the best possible situation awareness. This includes the adjustment of individual camera video streams according to the current traffic situation. The limited video encoding hardware in vehicles requires the combination of individual camera frames into a larger superframe video. While this enables the encoding of multiple camera views with a single encoder, it does not allow for rate/quality adaptation of the individual views. To this end, we propose a novel concept that uses preprocessing filters to enable individual rate/quality adaptations in the superframe video. The proposed preprocessing filters allow for the usage of existing multidimensional adaptation models in the same way as for individual video streams using multiple encoders. Our experiments confirm that the proposed concept is able to control the spatial, temporal and quality resolution of individual segments in the superframe video. Additionally, we demonstrate the usability of the proposed method by applying it in a multi-view teledriving scenario. We compare our approach to individually encoded video streams and a multiplexing solution without preprocessing. The results show that the proposed approach produces bitrates for the individual video streams which are comparable to the bitrates achieved with separate encoders. While achieving a similar bitrate for the most important views, our approach requires a total bitrate that is 40% smaller compared to the multiplexing approach without preprocessing.","PeriodicalId":120972,"journal":{"name":"2020 IEEE International Symposium on Multimedia (ISM)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Symposium on Multimedia (ISM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISM.2020.00008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Teleoperated driving (TOD) is a possible solution to cope with failures of autonomous vehicles. In TOD, the human operator perceives the traffic situation via video streams of multiple cameras from a remote location. Adaptation mechanisms are needed in order to match the available transmission resources and provide the operator with the best possible situation awareness. This includes the adjustment of individual camera video streams according to the current traffic situation. The limited video encoding hardware in vehicles requires the combination of individual camera frames into a larger superframe video. While this enables the encoding of multiple camera views with a single encoder, it does not allow for rate/quality adaptation of the individual views. To this end, we propose a novel concept that uses preprocessing filters to enable individual rate/quality adaptations in the superframe video. The proposed preprocessing filters allow for the usage of existing multidimensional adaptation models in the same way as for individual video streams using multiple encoders. Our experiments confirm that the proposed concept is able to control the spatial, temporal and quality resolution of individual segments in the superframe video. Additionally, we demonstrate the usability of the proposed method by applying it in a multi-view teledriving scenario. We compare our approach to individually encoded video streams and a multiplexing solution without preprocessing. The results show that the proposed approach produces bitrates for the individual video streams which are comparable to the bitrates achieved with separate encoders. While achieving a similar bitrate for the most important views, our approach requires a total bitrate that is 40% smaller compared to the multiplexing approach without preprocessing.

查看原文本刊更多论文

自适应多视图实时视频流电视驾驶使用单一硬件编码器

远程操作驾驶(TOD)是应对自动驾驶汽车故障的一种可能的解决方案。在TOD中，人类操作员通过来自远程位置的多个摄像头的视频流来感知交通状况。为了匹配可用的传输资源并为运营商提供最佳的态势感知，需要自适应机制。这包括根据当前交通状况调整单个摄像机视频流。车辆中有限的视频编码硬件需要将单个摄像机帧组合成更大的超帧视频。虽然这样可以用一个编码器对多个摄像机视图进行编码，但它不允许对单个视图进行速率/质量调整。为此，我们提出了一种新的概念，即使用预处理滤波器在超帧视频中实现个人速率/质量调整。所提出的预处理过滤器允许以与使用多个编码器的单个视频流相同的方式使用现有的多维自适应模型。我们的实验证实了所提出的概念能够控制超帧视频中单个片段的空间、时间和质量分辨率。此外，我们通过将其应用于多视图电视驾驶场景来证明所提出方法的可用性。我们将我们的方法与单独编码的视频流和没有预处理的多路复用解决方案进行比较。结果表明，该方法产生的单个视频流的比特率与使用单独编码器获得的比特率相当。虽然对于最重要的视图实现了类似的比特率，但我们的方法需要的总比特率比没有预处理的多路复用方法小40%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE International Symposium on Multimedia (ISM)

自引率

0.00%

发文量