Geometry Coding for Dynamic Voxelized Point Clouds Using Octrees and Multiple Contexts.

IF 10.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Image Processing Pub Date : 2019-08-01 DOI:10.1109/TIP.2019.2931466

Diogo C Garcia, Tiago A Fonseca, Renan U Ferreira, Ricardo L de Queiroz

{"title":"Geometry Coding for Dynamic Voxelized Point Clouds Using Octrees and Multiple Contexts.","authors":"Diogo C Garcia, Tiago A Fonseca, Renan U Ferreira, Ricardo L de Queiroz","doi":"10.1109/TIP.2019.2931466","DOIUrl":null,"url":null,"abstract":"<p><p>We present a method to compress geometry information of point clouds that explores redundancies across consecutive frames of a sequence. It uses octrees and works by progressively increasing resolution of the octree. At each branch of the tree, we generate an approximation of the child nodes by a number of methods which are used as contexts to drive an arithmetic coder. The best approximation, i.e. the context that yields the least amount of encoding bits, is selected and the chosen method is indicated as side information for replication at the decoder. The core of our method is a context-based arithmetic coder in which a reference octree is used as reference to encode the current octree, thus providing 255 contexts for each output octet. The 255×255 frequency histogram is viewed as a discrete 3D surface and is conveyed to the decoder using another octree. We present two methods to generate the predictions (contexts) which use adjacent frames in the sequence (inter-frame) and one method that works purely intra-frame. The encoder continuously switches the best mode among the three and conveys such information to the decoder. Since an intra-frame prediction is present, our coder can also work in purely intra-frame mode, as well. Extensive results are presented to show the method's potential against many compression alternatives for the geometry information in dynamic voxelized point clouds.</p>","PeriodicalId":13217,"journal":{"name":"IEEE Transactions on Image Processing","volume":null,"pages":null},"PeriodicalIF":10.8000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Image Processing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/TIP.2019.2931466","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

We present a method to compress geometry information of point clouds that explores redundancies across consecutive frames of a sequence. It uses octrees and works by progressively increasing resolution of the octree. At each branch of the tree, we generate an approximation of the child nodes by a number of methods which are used as contexts to drive an arithmetic coder. The best approximation, i.e. the context that yields the least amount of encoding bits, is selected and the chosen method is indicated as side information for replication at the decoder. The core of our method is a context-based arithmetic coder in which a reference octree is used as reference to encode the current octree, thus providing 255 contexts for each output octet. The 255×255 frequency histogram is viewed as a discrete 3D surface and is conveyed to the decoder using another octree. We present two methods to generate the predictions (contexts) which use adjacent frames in the sequence (inter-frame) and one method that works purely intra-frame. The encoder continuously switches the best mode among the three and conveys such information to the decoder. Since an intra-frame prediction is present, our coder can also work in purely intra-frame mode, as well. Extensive results are presented to show the method's potential against many compression alternatives for the geometry information in dynamic voxelized point clouds.

查看原文本刊更多论文

使用八叉树和多语境为动态体素化点云进行几何编码

我们提出了一种压缩点云几何信息的方法，这种方法可以探索序列中连续帧的冗余。该方法使用八叉树，并通过逐步提高八叉树的分辨率来实现。在八叉树的每个分支上，我们通过多种方法生成子节点的近似值，并将其作为上下文来驱动算术编码器。我们会选择最佳近似值，即产生最少编码比特的上下文，并将所选方法作为侧信息在解码器中进行复制。我们方法的核心是基于上下文的算术编码器，其中参考八进制被用作当前八进制的编码参考，从而为每个输出八进制提供 255 个上下文。255×255 频率直方图被视为一个离散的三维表面，并通过另一个八叉树传达给解码器。我们提出了两种利用序列中相邻帧（帧间）生成预测（上下文）的方法，以及一种纯粹在帧内工作的方法。编码器不断切换这三种方法中的最佳模式，并将这些信息传递给解码器。由于存在帧内预测，我们的编码器也可以在纯帧内模式下工作。本文展示了大量结果，显示了该方法在动态体素化点云几何信息压缩方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Image Processing 工程技术-工程：电子与电气

CiteScore

20.90

自引率

6.60%

发文量

774

审稿时长

7.6 months

期刊介绍： The IEEE Transactions on Image Processing delves into groundbreaking theories, algorithms, and structures concerning the generation, acquisition, manipulation, transmission, scrutiny, and presentation of images, video, and multidimensional signals across diverse applications. Topics span mathematical, statistical, and perceptual aspects, encompassing modeling, representation, formation, coding, filtering, enhancement, restoration, rendering, halftoning, search, and analysis of images, video, and multidimensional signals. Pertinent applications range from image and video communications to electronic imaging, biomedical imaging, image and video systems, and remote sensing.