Yuzhen Wu , Lingxue Wang , Lian Zhang , Mingkun Chen , Wenqu Zhao , Dezhi Zheng , Yi Cai
{"title":"Monocular thermal SLAM with neural radiance fields for 3D scene reconstruction","authors":"Yuzhen Wu , Lingxue Wang , Lian Zhang , Mingkun Chen , Wenqu Zhao , Dezhi Zheng , Yi Cai","doi":"10.1016/j.neucom.2024.129041","DOIUrl":null,"url":null,"abstract":"<div><div>Visual simultaneous localization and mapping (SLAM) faces significant challenges in environments with variable lighting and smoke. Excelling in such visually degraded settings, thermal imaging captures scene radiance effectively. To address the limitations of traditional thermal SLAM in 3D scene reconstruction, we propose ThermalSLAM-NeRF, a novel integration of thermal SLAM with neural radiance fields (NeRF). This method significantly enhances the quality of high dynamic range thermal images by improving their signal-to-noise ratio, contrast, and detail. It also employs online photometric calibration to ensure grayscale consistency between consecutive frames. We utilize a sparse direct method for pose estimation, selecting keyframes based on photometric error and tracking quality. The NeRF map is reconstructed using a multi-view keyframe sequence. Our evaluations on datasets containing over 15,000 thermal images show that ThermalSLAM-NeRF achieves an average improvement of 59.30% in trajectory accuracy over existing state-of-the-art SLAM methods. This approach uniquely tracks all sequences and constructs comprehensive NeRF maps, enabling robust and precise pose estimation without the need for extensive pre-training.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"617 ","pages":"Article 129041"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224018125","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Visual simultaneous localization and mapping (SLAM) faces significant challenges in environments with variable lighting and smoke. Excelling in such visually degraded settings, thermal imaging captures scene radiance effectively. To address the limitations of traditional thermal SLAM in 3D scene reconstruction, we propose ThermalSLAM-NeRF, a novel integration of thermal SLAM with neural radiance fields (NeRF). This method significantly enhances the quality of high dynamic range thermal images by improving their signal-to-noise ratio, contrast, and detail. It also employs online photometric calibration to ensure grayscale consistency between consecutive frames. We utilize a sparse direct method for pose estimation, selecting keyframes based on photometric error and tracking quality. The NeRF map is reconstructed using a multi-view keyframe sequence. Our evaluations on datasets containing over 15,000 thermal images show that ThermalSLAM-NeRF achieves an average improvement of 59.30% in trajectory accuracy over existing state-of-the-art SLAM methods. This approach uniquely tracks all sequences and constructs comprehensive NeRF maps, enabling robust and precise pose estimation without the need for extensive pre-training.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.