ThermoNeRF: A multimodal Neural Radiance Field for joint RGB-thermal novel view synthesis of building facades

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Mariam Hassan , Florent Forest , Olga Fink , Malcolm Mielle
{"title":"ThermoNeRF: A multimodal Neural Radiance Field for joint RGB-thermal novel view synthesis of building facades","authors":"Mariam Hassan ,&nbsp;Florent Forest ,&nbsp;Olga Fink ,&nbsp;Malcolm Mielle","doi":"10.1016/j.aei.2025.103345","DOIUrl":null,"url":null,"abstract":"<div><div>Thermal scene reconstruction holds great potential for various applications, such as building energy analysis and non-destructive infrastructure testing. However, existing methods rely on dense scene measurements and use RGB images for 3D reconstruction, incorporating thermal data only through a post-hoc projection. Due to the lower resolution of thermal cameras and the challenges of RGB/Thermal camera calibration, this post-hoc projection often results in spatial discrepancies between temperatures projected onto the 3D model and real temperatures at the surface. We propose ThermoNeRF, a novel multimodal Neural Radiance Fields (NerF) that renders new RGB and thermal views of a scene with joint optimization of the geometry and thermal information while preventing cross-modal interference. To compensate for the lack of texture in thermal images, ThermoNeRF leverages paired RGB and thermal images to learn scene geometry while maintaining separate networks for reconstructing RGB color and temperature values, ensuring accurate and modality-specific representations. We also introduce ThermoScenes, a dataset of paired RGB+thermal images comprising 8 scenes of building facades and 8 scenes of everyday objects enabling evaluation in diverse scenarios. On ThermoScenes, ThermoNeRF achieves an average mean absolute error of 1.13 °C for buildings and 0.41 °C for other scenes when predicting temperatures of previously unobserved views. This improves accuracy by over 50% compared to concatenated RGB+thermal input in standard NeRF. While ThermoNeRF performs well on aligned RGB-thermal images, future work could address misaligned or unpaired data for better generalization. <span><span>Code</span><svg><path></path></svg></span> and <span><span>dataset</span><svg><path></path></svg></span> are available online.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103345"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002381","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Thermal scene reconstruction holds great potential for various applications, such as building energy analysis and non-destructive infrastructure testing. However, existing methods rely on dense scene measurements and use RGB images for 3D reconstruction, incorporating thermal data only through a post-hoc projection. Due to the lower resolution of thermal cameras and the challenges of RGB/Thermal camera calibration, this post-hoc projection often results in spatial discrepancies between temperatures projected onto the 3D model and real temperatures at the surface. We propose ThermoNeRF, a novel multimodal Neural Radiance Fields (NerF) that renders new RGB and thermal views of a scene with joint optimization of the geometry and thermal information while preventing cross-modal interference. To compensate for the lack of texture in thermal images, ThermoNeRF leverages paired RGB and thermal images to learn scene geometry while maintaining separate networks for reconstructing RGB color and temperature values, ensuring accurate and modality-specific representations. We also introduce ThermoScenes, a dataset of paired RGB+thermal images comprising 8 scenes of building facades and 8 scenes of everyday objects enabling evaluation in diverse scenarios. On ThermoScenes, ThermoNeRF achieves an average mean absolute error of 1.13 °C for buildings and 0.41 °C for other scenes when predicting temperatures of previously unobserved views. This improves accuracy by over 50% compared to concatenated RGB+thermal input in standard NeRF. While ThermoNeRF performs well on aligned RGB-thermal images, future work could address misaligned or unpaired data for better generalization. Code and dataset are available online.

Abstract Image

ThermoNeRF:用于建筑立面联合rgb -热新视图合成的多模态神经辐射场
热场景重建在建筑能耗分析、基础设施无损检测等领域具有广阔的应用前景。然而,现有的方法依赖于密集的场景测量,并使用RGB图像进行3D重建,仅通过事后投影结合热数据。由于热像仪的分辨率较低以及RGB/热像仪校准的挑战,这种后期投影通常会导致投影到3D模型上的温度与地表实际温度之间的空间差异。我们提出了ThermoNeRF,一种新型的多模态神经辐射场(NerF),通过联合优化几何和热信息来呈现场景的新RGB和热视图,同时防止跨模态干扰。为了弥补热图像中缺乏纹理,ThermoNeRF利用配对的RGB和热图像来学习场景几何,同时保持单独的网络来重建RGB颜色和温度值,确保准确和模态特定的表示。我们还介绍了ThermoScenes,这是一个配对的RGB+热图像数据集,包括8个建筑立面场景和8个日常物品场景,可以在不同场景中进行评估。在ThermoScenes上,ThermoNeRF在预测以前未观察到的视图的温度时,对建筑物的平均绝对误差为1.13°C,对其他场景的平均绝对误差为0.41°C。与标准NeRF中的串联RGB+热输入相比,这提高了50%以上的精度。虽然ThermoNeRF在对齐的rgb热图像上表现良好,但未来的工作可以解决不对齐或未配对的数据,以更好地推广。代码和数据集可在线获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信