基于能量分布优化的全景视频编码感知驱动速率控制

IF 3.4 2区 工程技术 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Linyun Liu , Tiansong Li , Huijuan Zhao , Shuangjiang He , Jiabao Zhu , Jinhao Kuang , Li Yu
{"title":"基于能量分布优化的全景视频编码感知驱动速率控制","authors":"Linyun Liu ,&nbsp;Tiansong Li ,&nbsp;Huijuan Zhao ,&nbsp;Shuangjiang He ,&nbsp;Jiabao Zhu ,&nbsp;Jinhao Kuang ,&nbsp;Li Yu","doi":"10.1016/j.displa.2025.103198","DOIUrl":null,"url":null,"abstract":"<div><div>In immersive panoramic video (PV) encoding scenarios, PV exhibits larger flat regions compared to traditional videos. The existing intra-rate control methods in Versatile Video Coding (VVC) allocate the bitrate based on the construction of weights according to the energy distribution characteristics of different encoding blocks. However, in reality, the human visual system (HVS) is not sensitive to blocks with many flat regions in perception, which leads to excessive allocation of bitrate in insensitive regions. On the contrary, insufficient bitrate allocation in sensitive regions leads to the inability to achieve better reconstruction quality. To address this challenge, we propose a Perception-Driven Rate Control (PD-RC) strategy for panoramic video encoding based on energy distribution optimization, which makes the intra-rate control closer to the perception habits of HVS. Firstly, we propose a low-complexity filtering method guided by rate–distortion performance to optimize the energy distribution of I-frame features. Subsequently, leveraging the optimized perception features of the energy distribution, a perception-driven intra-mode coding-tree-unit-level rate control strategy is proposed to improve the coding performance for PV. Extensive evaluations show the performance of PD-RC over the state-of-the-art rate control methods of VVC. Specifically, in all-intra encoding mode, the average bitrate savings of PD-RC is −5.002%, while the average gain in weighted-spherically quality is 0.239 dB, with a rate exceeding the upper limit as low as 2%, and a reduction in encoding complexity gain of −0.163%. PD-RC effectively improves the rate control performance of PV intra-frame coding while saving computational overhead. It is significant for optimizing data transmission efficiency, enhancing video quality, and reducing storage costs. The source code will be available at <span><span>https://github.com/liulinyun324/PD-RC</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"91 ","pages":"Article 103198"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PD-RC: Perception-driven rate control for panoramic video coding based on energy distribution optimization\",\"authors\":\"Linyun Liu ,&nbsp;Tiansong Li ,&nbsp;Huijuan Zhao ,&nbsp;Shuangjiang He ,&nbsp;Jiabao Zhu ,&nbsp;Jinhao Kuang ,&nbsp;Li Yu\",\"doi\":\"10.1016/j.displa.2025.103198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In immersive panoramic video (PV) encoding scenarios, PV exhibits larger flat regions compared to traditional videos. The existing intra-rate control methods in Versatile Video Coding (VVC) allocate the bitrate based on the construction of weights according to the energy distribution characteristics of different encoding blocks. However, in reality, the human visual system (HVS) is not sensitive to blocks with many flat regions in perception, which leads to excessive allocation of bitrate in insensitive regions. On the contrary, insufficient bitrate allocation in sensitive regions leads to the inability to achieve better reconstruction quality. To address this challenge, we propose a Perception-Driven Rate Control (PD-RC) strategy for panoramic video encoding based on energy distribution optimization, which makes the intra-rate control closer to the perception habits of HVS. Firstly, we propose a low-complexity filtering method guided by rate–distortion performance to optimize the energy distribution of I-frame features. Subsequently, leveraging the optimized perception features of the energy distribution, a perception-driven intra-mode coding-tree-unit-level rate control strategy is proposed to improve the coding performance for PV. Extensive evaluations show the performance of PD-RC over the state-of-the-art rate control methods of VVC. Specifically, in all-intra encoding mode, the average bitrate savings of PD-RC is −5.002%, while the average gain in weighted-spherically quality is 0.239 dB, with a rate exceeding the upper limit as low as 2%, and a reduction in encoding complexity gain of −0.163%. PD-RC effectively improves the rate control performance of PV intra-frame coding while saving computational overhead. It is significant for optimizing data transmission efficiency, enhancing video quality, and reducing storage costs. The source code will be available at <span><span>https://github.com/liulinyun324/PD-RC</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50570,\"journal\":{\"name\":\"Displays\",\"volume\":\"91 \",\"pages\":\"Article 103198\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Displays\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141938225002355\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225002355","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

在沉浸式全景视频(PV)编码场景中,与传统视频相比,PV显示出更大的平面区域。通用视频编码(VVC)中现有的码率内控制方法是根据不同编码块的能量分布特点,通过构建权值来分配码率。然而,在现实中,人类视觉系统(HVS)对具有许多平坦区域的块不敏感,这导致在不敏感区域分配过多的比特率。相反,敏感区域比特率分配不足导致无法获得更好的重建质量。为了解决这一挑战,我们提出了一种基于能量分布优化的全景视频编码感知驱动速率控制(PD-RC)策略,该策略使速率内控制更接近HVS的感知习惯。首先,我们提出了一种基于率失真性能的低复杂度滤波方法来优化i帧特征的能量分布。随后,利用优化后的能量分布感知特征,提出了一种感知驱动的模式内编码树单元级速率控制策略,以提高光伏的编码性能。广泛的评估表明PD-RC的性能优于最先进的VVC速率控制方法。具体而言,在全帧内编码模式下,PD-RC的平均比特率节省为- 5.002%,而加权球质量的平均增益为0.239 dB,超过上限的速率低至2%,编码复杂度增益降低为- 0.163%。PD-RC有效地提高了帧内编码的速率控制性能,同时节省了计算开销。这对优化数据传输效率、提高视频质量、降低存储成本具有重要意义。源代码可从https://github.com/liulinyun324/PD-RC获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
PD-RC: Perception-driven rate control for panoramic video coding based on energy distribution optimization
In immersive panoramic video (PV) encoding scenarios, PV exhibits larger flat regions compared to traditional videos. The existing intra-rate control methods in Versatile Video Coding (VVC) allocate the bitrate based on the construction of weights according to the energy distribution characteristics of different encoding blocks. However, in reality, the human visual system (HVS) is not sensitive to blocks with many flat regions in perception, which leads to excessive allocation of bitrate in insensitive regions. On the contrary, insufficient bitrate allocation in sensitive regions leads to the inability to achieve better reconstruction quality. To address this challenge, we propose a Perception-Driven Rate Control (PD-RC) strategy for panoramic video encoding based on energy distribution optimization, which makes the intra-rate control closer to the perception habits of HVS. Firstly, we propose a low-complexity filtering method guided by rate–distortion performance to optimize the energy distribution of I-frame features. Subsequently, leveraging the optimized perception features of the energy distribution, a perception-driven intra-mode coding-tree-unit-level rate control strategy is proposed to improve the coding performance for PV. Extensive evaluations show the performance of PD-RC over the state-of-the-art rate control methods of VVC. Specifically, in all-intra encoding mode, the average bitrate savings of PD-RC is −5.002%, while the average gain in weighted-spherically quality is 0.239 dB, with a rate exceeding the upper limit as low as 2%, and a reduction in encoding complexity gain of −0.163%. PD-RC effectively improves the rate control performance of PV intra-frame coding while saving computational overhead. It is significant for optimizing data transmission efficiency, enhancing video quality, and reducing storage costs. The source code will be available at https://github.com/liulinyun324/PD-RC.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Displays
Displays 工程技术-工程:电子与电气
CiteScore
4.60
自引率
25.60%
发文量
138
审稿时长
92 days
期刊介绍: Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface. Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信