Spherical Distortion Temporal Propagation and Spatial Mapping Model for Efficient Panoramic Video Coding

IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Xu Yang;Minfeng Huang;Hongwei Guo;Shengxi Li;Lei Luo;Ce Zhu
{"title":"Spherical Distortion Temporal Propagation and Spatial Mapping Model for Efficient Panoramic Video Coding","authors":"Xu Yang;Minfeng Huang;Hongwei Guo;Shengxi Li;Lei Luo;Ce Zhu","doi":"10.1109/TBC.2024.3358749","DOIUrl":null,"url":null,"abstract":"Panoramic video undergoes projection onto a two-dimensional plane for compression and subsequent back-projection onto a sphere for display. This process introduces inconsistency between compression distortion and perceived spherical distortion, which causes a serious loss in coding efficiency. Meanwhile, the existing independent rate-distortion optimization (RDO) model for spherical distortion solely accounts for the current coding frame and neglects its influence on subsequent frames, which leads to sub-optimal coding performance. To this end, we propose a spherical distortion temporal propagation and spatial mapping model for efficient panoramic video coding. First, a zero-delay spherical distortion backward propagation chain is established in the temporal domain, and distortion impact factors are computed. Then, an accurate spatial mapping relationship between spherical distortion and coding distortion is constructed, along with the calculation of spatial mapping weights. Finally, these components are integrated into spherical RDO. The experimental results demonstrated the effectiveness of the proposed algorithm. Compared to the versatile video coding test model (VTM-14.0) with a 360Lib extension under low-delay P frame and B frame configurations, the proposed algorithm achieves bitrate savings of 9.4% (up to 19.4%) and 8.5% (up to 19.0%) by using WSPSNR as the distortion evaluation index, respectively. Additionally, the coding time was reduced by 14.53% and 15.65%, respectively.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 2","pages":"654-666"},"PeriodicalIF":3.2000,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Broadcasting","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10439250/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Panoramic video undergoes projection onto a two-dimensional plane for compression and subsequent back-projection onto a sphere for display. This process introduces inconsistency between compression distortion and perceived spherical distortion, which causes a serious loss in coding efficiency. Meanwhile, the existing independent rate-distortion optimization (RDO) model for spherical distortion solely accounts for the current coding frame and neglects its influence on subsequent frames, which leads to sub-optimal coding performance. To this end, we propose a spherical distortion temporal propagation and spatial mapping model for efficient panoramic video coding. First, a zero-delay spherical distortion backward propagation chain is established in the temporal domain, and distortion impact factors are computed. Then, an accurate spatial mapping relationship between spherical distortion and coding distortion is constructed, along with the calculation of spatial mapping weights. Finally, these components are integrated into spherical RDO. The experimental results demonstrated the effectiveness of the proposed algorithm. Compared to the versatile video coding test model (VTM-14.0) with a 360Lib extension under low-delay P frame and B frame configurations, the proposed algorithm achieves bitrate savings of 9.4% (up to 19.4%) and 8.5% (up to 19.0%) by using WSPSNR as the distortion evaluation index, respectively. Additionally, the coding time was reduced by 14.53% and 15.65%, respectively.
用于高效全景视频编码的球形畸变时空传播和空间映射模型
全景视频先投影到二维平面上进行压缩,然后再反投影到球面上进行显示。这一过程会导致压缩失真与感知球面失真不一致,从而严重降低编码效率。同时,现有的球形失真独立速率-失真优化(RDO)模型只考虑当前编码帧,忽略了其对后续帧的影响,导致编码性能未达到最佳。为此,我们提出了一种球形失真时间传播和空间映射模型,用于高效的全景视频编码。首先,在时域建立零延迟球形失真后向传播链,并计算失真影响因子。然后,构建球形失真与编码失真之间的精确空间映射关系,并计算空间映射权重。最后,将这些组件集成到球形 RDO 中。实验结果证明了所提算法的有效性。在低延迟 P 帧和 B 帧配置下,与带有 360Lib 扩展的通用视频编码测试模型(VTM-14.0)相比,以 WSPSNR 作为失真评估指标,所提算法分别节省了 9.4% (最高 19.4%)和 8.5%(最高 19.0%)的比特率。此外,编码时间也分别缩短了 14.53% 和 15.65%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Broadcasting
IEEE Transactions on Broadcasting 工程技术-电信学
CiteScore
9.40
自引率
31.10%
发文量
79
审稿时长
6-12 weeks
期刊介绍: The Society’s Field of Interest is “Devices, equipment, techniques and systems related to broadcast technology, including the production, distribution, transmission, and propagation aspects.” In addition to this formal FOI statement, which is used to provide guidance to the Publications Committee in the selection of content, the AdCom has further resolved that “broadcast systems includes all aspects of transmission, propagation, and reception.”
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信