Yuang Chen, Xiaoyu Chen, Canhui Zhou, Jing Han, Lianfa Bai
{"title":"Infrared NeRF reconstruction based on perceptual pose and high-frequency-invariant attention","authors":"Yuang Chen, Xiaoyu Chen, Canhui Zhou, Jing Han, Lianfa Bai","doi":"10.1016/j.sigpro.2025.110012","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared images present significant challenges in novel view synthesis (NVS) due to low resolution and limited texture features. Additionally, the adaptive gain mechanism in infrared cameras leads to variations in the illumination across different viewpoints. These all can result in potential failures when utilizing infrared images for reconstructing Neural Radiance Fields (NeRF). To address these issues, we propose an end-to-end framework for pose estimation and rendering optimization. Specifically, perceptual pose optimization is used to estimate more accurate camera pose. To enhance the matching accuracy of multi scene corresponding points, we retain high-confidence camera poses while jointly optimizing both the scene and low-confidence poses. This allows for high-quality 3D scenes with accurate pose estimation for infrared images. The high-frequency-invariant attention module is designed to focus on the high-frequency features not easily captured and invariant edge information in infrared images by densely sampling, which can use high-frequency region to compensate for the low-frequency region differences caused by the adaptive gain mechanism. We evaluated our approach on datasets consisting of near-infrared, mid-wave infrared, and long-wave infrared images. Our method successfully reconstructs NeRF using infrared images and outperforms the state-of-the-art methods in terms of performance. The dataset is available at: <span><span>https://github.com/YuangChen111/IR-NeRF</span><svg><path></path></svg></span></div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"235 ","pages":"Article 110012"},"PeriodicalIF":3.4000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425001264","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Infrared images present significant challenges in novel view synthesis (NVS) due to low resolution and limited texture features. Additionally, the adaptive gain mechanism in infrared cameras leads to variations in the illumination across different viewpoints. These all can result in potential failures when utilizing infrared images for reconstructing Neural Radiance Fields (NeRF). To address these issues, we propose an end-to-end framework for pose estimation and rendering optimization. Specifically, perceptual pose optimization is used to estimate more accurate camera pose. To enhance the matching accuracy of multi scene corresponding points, we retain high-confidence camera poses while jointly optimizing both the scene and low-confidence poses. This allows for high-quality 3D scenes with accurate pose estimation for infrared images. The high-frequency-invariant attention module is designed to focus on the high-frequency features not easily captured and invariant edge information in infrared images by densely sampling, which can use high-frequency region to compensate for the low-frequency region differences caused by the adaptive gain mechanism. We evaluated our approach on datasets consisting of near-infrared, mid-wave infrared, and long-wave infrared images. Our method successfully reconstructs NeRF using infrared images and outperforms the state-of-the-art methods in terms of performance. The dataset is available at: https://github.com/YuangChen111/IR-NeRF
期刊介绍:
Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing.
Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.