Unbounded-GS: Extending 3D Gaussian Splatting With Hybrid Representation for Unbounded Large-Scale Scene Reconstruction

IF 4.6 2区计算机科学 Q2 ROBOTICS

IEEE Robotics and Automation Letters Pub Date : 2024-11-08 DOI:10.1109/LRA.2024.3494652

Wanzhang Li;Fukun Yin;Wen Liu;Yiying Yang;Xin Chen;Biao Jiang;Gang Yu;Jiayuan Fan

{"title":"Unbounded-GS: Extending 3D Gaussian Splatting With Hybrid Representation for Unbounded Large-Scale Scene Reconstruction","authors":"Wanzhang Li;Fukun Yin;Wen Liu;Yiying Yang;Xin Chen;Biao Jiang;Gang Yu;Jiayuan Fan","doi":"10.1109/LRA.2024.3494652","DOIUrl":null,"url":null,"abstract":"Modeling large-scale scenes from multi-view images is challenging due to the trade-off dilemma between visual quality and computational cost. Existing NeRF-based methods have made advancements in neural implicit representation through volumetric ray-marching, but still struggle to deal with cubically growing sampling space in large-scale scenes. Fortunately, the rendering approach based on 3D Gaussian splatting (3DGS) has shown promising results, inspiring further exploration in the splatting setting. However, 3DGS has the limitation of inadequate Gaussian points for modeling distant backgrounds, leading to “splotchy” artifacts. To address this problem, we introduce a novel hybrid neural representation called Unbounded 3D Gaussian. For foreground area, we employs an explicit 3D Gaussian representation to efficiently model the geometry and appearance through splatting weighted Gaussians. For far-away background, we additionally introduce an implicit module comprising Multi-layer Perceptions (MLPs) to directly predict far-away background colors from positional encodings of view positions and ray directions. Furthermore, we design a seamless blending mechanism between the color predictions of the explicit splatting and implicit branches to reconstruct holistic scenes. Extensive experiments demonstrate that our proposed Unbounded-GS inherits the advantages of both faster convergence and high-fidelity rendering quality.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11529-11536"},"PeriodicalIF":4.6000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Robotics and Automation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10747249/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Modeling large-scale scenes from multi-view images is challenging due to the trade-off dilemma between visual quality and computational cost. Existing NeRF-based methods have made advancements in neural implicit representation through volumetric ray-marching, but still struggle to deal with cubically growing sampling space in large-scale scenes. Fortunately, the rendering approach based on 3D Gaussian splatting (3DGS) has shown promising results, inspiring further exploration in the splatting setting. However, 3DGS has the limitation of inadequate Gaussian points for modeling distant backgrounds, leading to “splotchy” artifacts. To address this problem, we introduce a novel hybrid neural representation called Unbounded 3D Gaussian. For foreground area, we employs an explicit 3D Gaussian representation to efficiently model the geometry and appearance through splatting weighted Gaussians. For far-away background, we additionally introduce an implicit module comprising Multi-layer Perceptions (MLPs) to directly predict far-away background colors from positional encodings of view positions and ray directions. Furthermore, we design a seamless blending mechanism between the color predictions of the explicit splatting and implicit branches to reconstruct holistic scenes. Extensive experiments demonstrate that our proposed Unbounded-GS inherits the advantages of both faster convergence and high-fidelity rendering quality.

查看原文本刊更多论文

无界-GS：利用混合表示法扩展三维高斯拼接，实现无界大规模场景重建

由于视觉质量和计算成本之间的权衡难题，通过多视角图像对大规模场景进行建模具有挑战性。现有的基于 NeRF 的方法通过体积光线行进在神经隐式表示方面取得了进步，但在处理大规模场景中立方体增长的采样空间时仍有困难。幸运的是，基于三维高斯拼接（3DGS）的渲染方法取得了可喜的成果，激发了对拼接设置的进一步探索。然而，3DGS 有其局限性，即高斯点不足以对远处的背景进行建模，从而导致 "斑点 "现象。为了解决这个问题，我们引入了一种名为无界三维高斯的新型混合神经表示法。对于前景区域，我们采用明确的三维高斯表示法，通过拼接加权高斯有效地建立几何和外观模型。对于远处的背景，我们还引入了由多层感知（MLP）组成的隐式模块，通过视图位置和光线方向的位置编码直接预测远处背景的颜色。此外，我们还在显式拼接和隐式分支的颜色预测之间设计了一种无缝混合机制，以重建整体场景。广泛的实验证明，我们提出的无边界-GS 继承了更快收敛和高保真渲染质量的优点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Robotics and Automation Letters Computer Science-Computer Science Applications

CiteScore

9.60

自引率

15.40%

发文量

1428

期刊介绍： The scope of this journal is to publish peer-reviewed articles that provide a timely and concise account of innovative research ideas and application results, reporting significant theoretical findings and application case studies in areas of robotics and automation.