Thuc Nguyen Huu;Vinh Van Duong;Jonghoon Yim;Byeungwoo Jeon
{"title":"Adaptive Motion Vector Resolutions in Raw Plenoptic Video Coding","authors":"Thuc Nguyen Huu;Vinh Van Duong;Jonghoon Yim;Byeungwoo Jeon","doi":"10.1109/OJSP.2025.3572840","DOIUrl":null,"url":null,"abstract":"This paper addresses the unique challenges of compressing raw plenoptic video, in which the inherent hexagonal micro-image layout and sparse distribution of motion vectors (MVs) often diminish the coding efficiency of conventional block-based motion compensation. To mitigate excessive overhead from motion vector difference (MVD) signaling, we use three specialized MV resolutions: a hexagonal-lattice (HL) alignment that matches the micro-image structure, an integer-pel resolution, and a quarter-pel resolution. We then develop a rate-distortion (RD)-optimized scheme that adaptively selects the most suitable MV resolution at the coding unit level. By integrating our approach into the Versatile Video Coding (VVC) framework, the proposed method reduces MVD bits significantly while preserving high prediction accuracy. Experiments using two comprehensive plenoptic camera datasets — lenslet 1.0 and lenslet 2.0 — demonstrate substantial gains over the VVC anchor, achieving average Bjontegaard–Delta rate savings of 5.90% and 1.80%, respectively. These results confirm that combining HL and conventional resolutions in an RD-optimized manner substantially enhances motion prediction efficiency for raw plenoptic video.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"917-925"},"PeriodicalIF":2.7000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11010129","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE open journal of signal processing","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11010129/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
This paper addresses the unique challenges of compressing raw plenoptic video, in which the inherent hexagonal micro-image layout and sparse distribution of motion vectors (MVs) often diminish the coding efficiency of conventional block-based motion compensation. To mitigate excessive overhead from motion vector difference (MVD) signaling, we use three specialized MV resolutions: a hexagonal-lattice (HL) alignment that matches the micro-image structure, an integer-pel resolution, and a quarter-pel resolution. We then develop a rate-distortion (RD)-optimized scheme that adaptively selects the most suitable MV resolution at the coding unit level. By integrating our approach into the Versatile Video Coding (VVC) framework, the proposed method reduces MVD bits significantly while preserving high prediction accuracy. Experiments using two comprehensive plenoptic camera datasets — lenslet 1.0 and lenslet 2.0 — demonstrate substantial gains over the VVC anchor, achieving average Bjontegaard–Delta rate savings of 5.90% and 1.80%, respectively. These results confirm that combining HL and conventional resolutions in an RD-optimized manner substantially enhances motion prediction efficiency for raw plenoptic video.