Linyun Liu , Tiansong Li , Huijuan Zhao , Shuangjiang He , Jiabao Zhu , Jinhao Kuang , Li Yu
{"title":"PD-RC: Perception-driven rate control for panoramic video coding based on energy distribution optimization","authors":"Linyun Liu , Tiansong Li , Huijuan Zhao , Shuangjiang He , Jiabao Zhu , Jinhao Kuang , Li Yu","doi":"10.1016/j.displa.2025.103198","DOIUrl":null,"url":null,"abstract":"<div><div>In immersive panoramic video (PV) encoding scenarios, PV exhibits larger flat regions compared to traditional videos. The existing intra-rate control methods in Versatile Video Coding (VVC) allocate the bitrate based on the construction of weights according to the energy distribution characteristics of different encoding blocks. However, in reality, the human visual system (HVS) is not sensitive to blocks with many flat regions in perception, which leads to excessive allocation of bitrate in insensitive regions. On the contrary, insufficient bitrate allocation in sensitive regions leads to the inability to achieve better reconstruction quality. To address this challenge, we propose a Perception-Driven Rate Control (PD-RC) strategy for panoramic video encoding based on energy distribution optimization, which makes the intra-rate control closer to the perception habits of HVS. Firstly, we propose a low-complexity filtering method guided by rate–distortion performance to optimize the energy distribution of I-frame features. Subsequently, leveraging the optimized perception features of the energy distribution, a perception-driven intra-mode coding-tree-unit-level rate control strategy is proposed to improve the coding performance for PV. Extensive evaluations show the performance of PD-RC over the state-of-the-art rate control methods of VVC. Specifically, in all-intra encoding mode, the average bitrate savings of PD-RC is −5.002%, while the average gain in weighted-spherically quality is 0.239 dB, with a rate exceeding the upper limit as low as 2%, and a reduction in encoding complexity gain of −0.163%. PD-RC effectively improves the rate control performance of PV intra-frame coding while saving computational overhead. It is significant for optimizing data transmission efficiency, enhancing video quality, and reducing storage costs. The source code will be available at <span><span>https://github.com/liulinyun324/PD-RC</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"91 ","pages":"Article 103198"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225002355","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
In immersive panoramic video (PV) encoding scenarios, PV exhibits larger flat regions compared to traditional videos. The existing intra-rate control methods in Versatile Video Coding (VVC) allocate the bitrate based on the construction of weights according to the energy distribution characteristics of different encoding blocks. However, in reality, the human visual system (HVS) is not sensitive to blocks with many flat regions in perception, which leads to excessive allocation of bitrate in insensitive regions. On the contrary, insufficient bitrate allocation in sensitive regions leads to the inability to achieve better reconstruction quality. To address this challenge, we propose a Perception-Driven Rate Control (PD-RC) strategy for panoramic video encoding based on energy distribution optimization, which makes the intra-rate control closer to the perception habits of HVS. Firstly, we propose a low-complexity filtering method guided by rate–distortion performance to optimize the energy distribution of I-frame features. Subsequently, leveraging the optimized perception features of the energy distribution, a perception-driven intra-mode coding-tree-unit-level rate control strategy is proposed to improve the coding performance for PV. Extensive evaluations show the performance of PD-RC over the state-of-the-art rate control methods of VVC. Specifically, in all-intra encoding mode, the average bitrate savings of PD-RC is −5.002%, while the average gain in weighted-spherically quality is 0.239 dB, with a rate exceeding the upper limit as low as 2%, and a reduction in encoding complexity gain of −0.163%. PD-RC effectively improves the rate control performance of PV intra-frame coding while saving computational overhead. It is significant for optimizing data transmission efficiency, enhancing video quality, and reducing storage costs. The source code will be available at https://github.com/liulinyun324/PD-RC.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.