{"title":"3DoF+ video compression algorithm based on distortion minimization interpolation with adaptive reference view selection","authors":"Mingliu Sun, Shengyang Zhao, Xiaoming Chen","doi":"10.1145/3381271.3381279","DOIUrl":null,"url":null,"abstract":"The emerging 3DoF+ video offers more interactivity and deeper immersion. However, the bulky size of the 3DoF+ video imposes new challenges in its storage and processing. Thus, compressing 3DoF+ video efficiently is vitally important. Due to the geometric relationship between viewpoints, overlaps exist in views and this redundancy is available for further compression. However, existing compression algorithms either utilize partial information of the scene or still retain a lot of redundancy, which results in low quality reconstructed views. To address these problems, our study proposes to select reference views adaptively, which utilizes as few bits as possible to contain the necessary visual information of the scene. Besides, in order to reduce the residues between the reconstructed and original views, the distortion minimization interpolation (DMI) method is then proposed for further refining the synthesized views. The experimental results show that our method achieves on average 52.1% BD-rate reduction than direct coding with High Efficiency Video Coding (HEVC).","PeriodicalId":124651,"journal":{"name":"Proceedings of the 5th International Conference on Multimedia and Image Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Multimedia and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3381271.3381279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The emerging 3DoF+ video offers more interactivity and deeper immersion. However, the bulky size of the 3DoF+ video imposes new challenges in its storage and processing. Thus, compressing 3DoF+ video efficiently is vitally important. Due to the geometric relationship between viewpoints, overlaps exist in views and this redundancy is available for further compression. However, existing compression algorithms either utilize partial information of the scene or still retain a lot of redundancy, which results in low quality reconstructed views. To address these problems, our study proposes to select reference views adaptively, which utilizes as few bits as possible to contain the necessary visual information of the scene. Besides, in order to reduce the residues between the reconstructed and original views, the distortion minimization interpolation (DMI) method is then proposed for further refining the synthesized views. The experimental results show that our method achieves on average 52.1% BD-rate reduction than direct coding with High Efficiency Video Coding (HEVC).
新兴的3DoF+视频提供了更多的交互性和更深入的沉浸感。然而,3DoF+视频的庞大尺寸给其存储和处理带来了新的挑战。因此,有效压缩3DoF+视频是至关重要的。由于视点之间的几何关系,视点之间存在重叠,这种冗余可用于进一步压缩。然而,现有的压缩算法要么利用了场景的部分信息,要么保留了大量的冗余,导致重构视图的质量不高。为了解决这些问题,我们的研究建议自适应地选择参考视图,它使用尽可能少的比特来包含场景的必要视觉信息。此外,为了减少重建视图与原始视图之间的残差,提出了失真最小化插值(DMI)方法对合成视图进行进一步细化。实验结果表明,采用高效视频编码(High Efficiency Video coding, HEVC)比直接编码平均降低52.1%的bd率。