CamP: Camera Preconditioning for Neural Radiance Fields

Keunhong Park, P. Henzler, B. Mildenhall, J. Barron, Ricardo Martin-Brualla
{"title":"CamP: Camera Preconditioning for Neural Radiance Fields","authors":"Keunhong Park, P. Henzler, B. Mildenhall, J. Barron, Ricardo Martin-Brualla","doi":"10.1145/3618321","DOIUrl":null,"url":null,"abstract":"Neural Radiance Fields (NeRF) can be optimized to obtain high-fidelity 3D scene reconstructions of objects and large-scale scenes. However, NeRFs require accurate camera parameters as input --- inaccurate camera parameters result in blurry renderings. Extrinsic and intrinsic camera parameters are usually estimated using Structure-from-Motion (SfM) methods as a pre-processing step to NeRF, but these techniques rarely yield perfect estimates. Thus, prior works have proposed jointly optimizing camera parameters alongside a NeRF, but these methods are prone to local minima in challenging settings. In this work, we analyze how different camera parameterizations affect this joint optimization problem, and observe that standard parameterizations exhibit large differences in magnitude with respect to small perturbations, which can lead to an ill-conditioned optimization problem. We propose using a proxy problem to compute a whitening transform that eliminates the correlation between camera parameters and normalizes their effects, and we propose to use this transform as a preconditioner for the camera parameters during joint optimization. Our preconditioned camera optimization significantly improves reconstruction quality on scenes from the Mip-NeRF 360 dataset: we reduce error rates (RMSE) by 67% compared to state-of-the-art NeRF approaches that do not optimize for cameras like Zip-NeRF, and by 29% relative to state-of-the-art joint optimization approaches using the camera parameterization of SCNeRF. Our approach is easy to implement, does not significantly increase runtime, can be applied to a wide variety of camera parameterizations, and can straightforwardly be incorporated into other NeRF-like models.","PeriodicalId":7077,"journal":{"name":"ACM Transactions on Graphics (TOG)","volume":"26 1","pages":"1 - 11"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Graphics (TOG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3618321","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Neural Radiance Fields (NeRF) can be optimized to obtain high-fidelity 3D scene reconstructions of objects and large-scale scenes. However, NeRFs require accurate camera parameters as input --- inaccurate camera parameters result in blurry renderings. Extrinsic and intrinsic camera parameters are usually estimated using Structure-from-Motion (SfM) methods as a pre-processing step to NeRF, but these techniques rarely yield perfect estimates. Thus, prior works have proposed jointly optimizing camera parameters alongside a NeRF, but these methods are prone to local minima in challenging settings. In this work, we analyze how different camera parameterizations affect this joint optimization problem, and observe that standard parameterizations exhibit large differences in magnitude with respect to small perturbations, which can lead to an ill-conditioned optimization problem. We propose using a proxy problem to compute a whitening transform that eliminates the correlation between camera parameters and normalizes their effects, and we propose to use this transform as a preconditioner for the camera parameters during joint optimization. Our preconditioned camera optimization significantly improves reconstruction quality on scenes from the Mip-NeRF 360 dataset: we reduce error rates (RMSE) by 67% compared to state-of-the-art NeRF approaches that do not optimize for cameras like Zip-NeRF, and by 29% relative to state-of-the-art joint optimization approaches using the camera parameterization of SCNeRF. Our approach is easy to implement, does not significantly increase runtime, can be applied to a wide variety of camera parameterizations, and can straightforwardly be incorporated into other NeRF-like models.
CamP:神经辐射场的相机预处理
神经辐射场(NeRF)可以通过优化获得物体和大型场景的高保真三维场景重建。然而,神经辐射场需要精确的相机参数作为输入--相机参数不准确会导致渲染效果模糊。作为 NeRF 的预处理步骤,通常使用运动结构(SfM)方法估算外在和内在相机参数,但这些技术很少能获得完美的估算结果。因此,之前的研究提出了在 NeRF 的同时联合优化摄像机参数的方法,但这些方法在具有挑战性的环境中容易出现局部最小值。在这项工作中,我们分析了不同的相机参数设置如何影响联合优化问题,并观察到标准参数设置在小扰动方面表现出巨大的差异,这会导致优化问题条件不佳。我们建议使用代理问题来计算白化变换,以消除摄像机参数之间的相关性,并将其影响归一化,我们还建议在联合优化过程中将此变换用作摄像机参数的前提条件。在 Mip-NeRF 360 数据集的场景上,我们的相机优化前提条件显著提高了重建质量:与 Zip-NeRF 等不对相机进行优化的最先进 NeRF 方法相比,我们将错误率(RMSE)降低了 67%,与使用 SCNeRF 相机参数化的最先进联合优化方法相比,我们将错误率(RMSE)降低了 29%。我们的方法易于实现,不会显著增加运行时间,可应用于多种相机参数化,并可直接集成到其他类似 NeRF 的模型中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信