Optimizing storage and delivery of Omnidirectional Videos in Viewport-dependent streaming

Kashyap Kammachi Sreedhar, M. Hannuksela, Emre B. Aksu, Lauri Ilola, Lukasz Condrad
{"title":"Optimizing storage and delivery of Omnidirectional Videos in Viewport-dependent streaming","authors":"Kashyap Kammachi Sreedhar, M. Hannuksela, Emre B. Aksu, Lauri Ilola, Lukasz Condrad","doi":"10.1109/ISM55400.2022.00039","DOIUrl":null,"url":null,"abstract":"The OMAF standard makes use of a framework called the viewport-dependent-delivery for the streaming of 360-degree videos. OMAF uses ISOBMFF for storage and MPEG-DASH as one of the delivery mechanisms. In viewport-dependent-streaming videos are spatially divided and encoded into multiple tracks and each track is further segmented for DASH delivery. Segmentation requires additional metadata which adds to bitrate overhead. The main contributor to this overhead is the track fragment run in a box with the four-character code, ‘trun’. The TRUN records the following information of each sample in a track: the size, duration, flags, and time offsets and uses a fixed byte size to record this information. To minimize the bitrate overhead of TRUN, four different representation algorithms have been explored. This paper briefly describes the four TRUN representations and discusses the benefits and drawbacks of each algorithm. For evaluation, the algorithms were implemented in the MP4BOX module of the GPAC suite. The results were evaluated for different segment durations (500ms, 1s, 2s, 4s), different tiling grids (8x4, 9x6), two videos (bip-bop, countertiles) with different packaging techniques (no encryption, encryption of Keyframes, encryption of all frames) The algorithms reduced the bitrate overhead by 59% on average as compared to the original TRUN representation.","PeriodicalId":112060,"journal":{"name":"2022 IEEE International Symposium on Multimedia (ISM)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Multimedia (ISM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISM55400.2022.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The OMAF standard makes use of a framework called the viewport-dependent-delivery for the streaming of 360-degree videos. OMAF uses ISOBMFF for storage and MPEG-DASH as one of the delivery mechanisms. In viewport-dependent-streaming videos are spatially divided and encoded into multiple tracks and each track is further segmented for DASH delivery. Segmentation requires additional metadata which adds to bitrate overhead. The main contributor to this overhead is the track fragment run in a box with the four-character code, ‘trun’. The TRUN records the following information of each sample in a track: the size, duration, flags, and time offsets and uses a fixed byte size to record this information. To minimize the bitrate overhead of TRUN, four different representation algorithms have been explored. This paper briefly describes the four TRUN representations and discusses the benefits and drawbacks of each algorithm. For evaluation, the algorithms were implemented in the MP4BOX module of the GPAC suite. The results were evaluated for different segment durations (500ms, 1s, 2s, 4s), different tiling grids (8x4, 9x6), two videos (bip-bop, countertiles) with different packaging techniques (no encryption, encryption of Keyframes, encryption of all frames) The algorithms reduced the bitrate overhead by 59% on average as compared to the original TRUN representation.
在依赖视口的流媒体中优化全向视频的存储和交付
OMAF标准使用了一种称为视口相关交付的框架,用于360度视频流。OMAF使用ISOBMFF进行存储,并使用MPEG-DASH作为交付机制之一。在视口相关的流媒体视频被空间分割并编码成多个轨道,每个轨道被进一步分割为DASH传输。分段需要额外的元数据,这增加了比特率开销。这一开销的主要贡献者是在带有四个字符代码“run”的盒子中运行的轨道片段。TRUN记录轨道中每个样本的以下信息:大小、持续时间、标志和时间偏移,并使用固定的字节大小来记录这些信息。为了最小化TRUN的比特率开销,研究了四种不同的表示算法。本文简要介绍了四种TRUN表示,并讨论了每种算法的优缺点。为了进行评估,算法在GPAC套件的MP4BOX模块中实现。结果对不同的片段持续时间(500ms, 15, 2s, 4s),不同的平铺网格(8x4, 9x6),两个视频(bip-bop, countertiles)使用不同的封装技术(无加密,关键帧加密,所有帧加密)进行了评估。与原始TRUN表示相比,算法平均减少了59%的比特率开销。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信