PlanarRecon: Realtime 3D Plane Detection and Reconstruction from Posed Monocular Videos

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2022-06-01 DOI:10.1109/CVPR52688.2022.00612

Yiming Xie, Matheus Gadelha, Fengting Yang, Xiaowei Zhou, Huaizu Jiang

引用次数: 9

Abstract

We present PlanarRecon - a novel framework for globally coherent detection and reconstruction of 3D planes from a posed monocular video. Unlike previous works that detect planes in 2D from a single image, PlanarRecon incrementally detects planes in 3D for each video fragment, which consists of a set of key frames, from a volumetric representation of the scene using neural networks. A learning-based tracking and fusion module is designed to merge planes from previous fragments to form a coherent global plane reconstruction. Such design allows Planar-Recon to integrate observations from multiple views within each fragment and temporal information across different ones, resulting in an accurate and coherent reconstruction of the scene abstraction with low-polygonal geometry. Experiments show that the proposed approach achieves state-of-the-art performances on the ScanNet dataset while being real-time. Code is available at the project page: https://neu-vi.github.io/planarrecon/.

查看原文本刊更多论文

PlanarRecon:实时三维平面检测和重建从摆单目视频

我们提出了PlanarRecon -一个新的框架，用于从单目视频中进行全局相干检测和重建3D平面。与以前的工作不同，PlanarRecon从单个图像中检测2D平面，它使用神经网络从场景的体积表示中增量地检测每个视频片段(由一组关键帧组成)的3D平面。设计了基于学习的跟踪和融合模块，将先前碎片中的平面合并，形成连贯的全局平面重建。这样的设计允许Planar-Recon在每个片段中整合来自多个视图的观察结果和不同视图的时间信息，从而以低多边形几何形状精确连贯地重建场景抽象。实验表明，该方法在ScanNet数据集上达到了最先进的性能，同时具有实时性。代码可从项目页面获得:https://neu-vi.github.io/planarrecon/。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量