Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond

arXiv - CS - Graphics Pub Date : 2024-08-08 DOI:arxiv-2408.04586

Ravi Ramamoorthi

{"title":"Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond","authors":"Ravi Ramamoorthi","doi":"arxiv-2408.04586","DOIUrl":null,"url":null,"abstract":"Capturing and rendering novel views of complex real-world scenes is a\nlong-standing problem in computer graphics and vision, with applications in\naugmented and virtual reality, immersive experiences and 3D photography. The\nadvent of deep learning has enabled revolutionary advances in this area,\nclassically known as image-based rendering. However, previous approaches\nrequire intractably dense view sampling or provide little or no guidance for\nhow users should sample views of a scene to reliably render high-quality novel\nviews. Local light field fusion proposes an algorithm for practical view\nsynthesis from an irregular grid of sampled views that first expands each\nsampled view into a local light field via a multiplane image scene\nrepresentation, then renders novel views by blending adjacent local light\nfields. Crucially, we extend traditional plenoptic sampling theory to derive a\nbound that specifies precisely how densely users should sample views of a given\nscene when using our algorithm. We achieve the perceptual quality of Nyquist\nrate view sampling while using up to 4000x fewer views. Subsequent developments\nhave led to new scene representations for deep learning with view synthesis,\nnotably neural radiance fields, but the problem of sparse view synthesis from a\nsmall number of images has only grown in importance. We reprise some of the\nrecent results on sparse and even single image view synthesis, while posing the\nquestion of whether prescriptive sampling guidelines are feasible for the new\ngeneration of image-based rendering algorithms.","PeriodicalId":501174,"journal":{"name":"arXiv - CS - Graphics","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Graphics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.04586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Capturing and rendering novel views of complex real-world scenes is a long-standing problem in computer graphics and vision, with applications in augmented and virtual reality, immersive experiences and 3D photography. The advent of deep learning has enabled revolutionary advances in this area, classically known as image-based rendering. However, previous approaches require intractably dense view sampling or provide little or no guidance for how users should sample views of a scene to reliably render high-quality novel views. Local light field fusion proposes an algorithm for practical view synthesis from an irregular grid of sampled views that first expands each sampled view into a local light field via a multiplane image scene representation, then renders novel views by blending adjacent local light fields. Crucially, we extend traditional plenoptic sampling theory to derive a bound that specifies precisely how densely users should sample views of a given scene when using our algorithm. We achieve the perceptual quality of Nyquist rate view sampling while using up to 4000x fewer views. Subsequent developments have led to new scene representations for deep learning with view synthesis, notably neural radiance fields, but the problem of sparse view synthesis from a small number of images has only grown in importance. We reprise some of the recent results on sparse and even single image view synthesis, while posing the question of whether prescriptive sampling guidelines are feasible for the new generation of image-based rendering algorithms.

查看原文本刊更多论文

视图合成采样：从局部光场融合到神经辐射场及其他

捕捉和渲染复杂现实世界场景的新颖视图是计算机图形学和视觉领域的长期难题，在增强现实和虚拟现实、沉浸式体验和三维摄影中都有应用。深度学习的出现使这一领域取得了革命性的进展，即经典的基于图像的渲染。然而，以前的方法需要高密度的视图采样，或者对于用户如何采样场景视图以可靠地渲染高质量的新视图几乎没有提供任何指导。局部光场融合提出了一种从不规则网格采样视图中合成实用视图的算法，该算法首先通过多平面图像场景呈现将每个采样视图扩展为一个局部光场，然后通过混合相邻的局部光场来渲染新视图。最重要的是，我们扩展了传统的全光景采样理论，从而得出了一套精确的方法，规定了用户在使用我们的算法时，对给定场景的视图进行采样的密度。我们实现了 Nyquistrate 视图采样的感知质量，而使用的视图数量却减少了 4000 倍。随后的发展为深度学习的视图合成带来了新的场景表示法，特别是神经辐射场，但从少量图像中进行稀疏视图合成的问题却越来越重要。我们重述了近期在稀疏甚至单张图像视图合成方面取得的一些成果，同时提出了一个问题：对于新一代基于图像的渲染算法来说，规范性的采样准则是否可行？

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Graphics

自引率

0.00%

发文量