NER-Net+: Seeing Motion at Nighttime With an Event Camera

IF 18.6
Haoyue Liu;Jinghan Xu;Shihan Peng;Yi Chang;Hanyu Zhou;Yuxing Duan;Lin Zhu;Yonghong Tian;Luxin Yan
{"title":"NER-Net+: Seeing Motion at Nighttime With an Event Camera","authors":"Haoyue Liu;Jinghan Xu;Shihan Peng;Yi Chang;Hanyu Zhou;Yuxing Duan;Lin Zhu;Yonghong Tian;Luxin Yan","doi":"10.1109/TPAMI.2025.3545936","DOIUrl":null,"url":null,"abstract":"We focus on a very challenging task: imaging at nighttime dynamic scenes. Conventional RGB cameras struggle with the trade-off between long exposure for low-light imaging and short exposure for capturing dynamic scenes. Event cameras react to dynamic changes, with their high temporal resolution (microsecond) and dynamic range (120 dB), and thus offer a promising alternative. However, existing methods are mostly based on simulated datasets due to the lack of paired event-clean image data for nighttime conditions, where the domain gap leads to performance limitations in real-world scenarios. Moreover, most existing event reconstruction methods are tailored for daytime data, overlooking issues unique to low-light events at night, such as strong noise, temporal trailing, and spatial non-uniformity, resulting in unsatisfactory reconstruction results. To address these challenges, we construct the first real paired low-light event dataset (RLED) through a co-axial imaging system, comprising 80,400 spatially and temporally aligned image GTs and low-light events, which provides a unified training and evaluation dataset for existing methods. We further conduct a comprehensive analysis of the causes and characteristics of strong noise, temporal trailing, and spatial non-uniformity in nighttime events, and propose a nighttime event reconstruction network (NER-Net+). It includes a learnable event timestamps calibration module (LETC) to correct the temporal trailing events and a non-stationary spatio-temporal information enhancement module (NSIE) to suppress sensor noise and spatial non-uniformity. Extensive experiments demonstrate that the proposed method outperforms state-of-the-art methods in visual quality and generalization on real-world nighttime datasets.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 6","pages":"4768-4786"},"PeriodicalIF":18.6000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10904298/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We focus on a very challenging task: imaging at nighttime dynamic scenes. Conventional RGB cameras struggle with the trade-off between long exposure for low-light imaging and short exposure for capturing dynamic scenes. Event cameras react to dynamic changes, with their high temporal resolution (microsecond) and dynamic range (120 dB), and thus offer a promising alternative. However, existing methods are mostly based on simulated datasets due to the lack of paired event-clean image data for nighttime conditions, where the domain gap leads to performance limitations in real-world scenarios. Moreover, most existing event reconstruction methods are tailored for daytime data, overlooking issues unique to low-light events at night, such as strong noise, temporal trailing, and spatial non-uniformity, resulting in unsatisfactory reconstruction results. To address these challenges, we construct the first real paired low-light event dataset (RLED) through a co-axial imaging system, comprising 80,400 spatially and temporally aligned image GTs and low-light events, which provides a unified training and evaluation dataset for existing methods. We further conduct a comprehensive analysis of the causes and characteristics of strong noise, temporal trailing, and spatial non-uniformity in nighttime events, and propose a nighttime event reconstruction network (NER-Net+). It includes a learnable event timestamps calibration module (LETC) to correct the temporal trailing events and a non-stationary spatio-temporal information enhancement module (NSIE) to suppress sensor noise and spatial non-uniformity. Extensive experiments demonstrate that the proposed method outperforms state-of-the-art methods in visual quality and generalization on real-world nighttime datasets.
NER-Net+:用事件相机在夜间观察运动
我们专注于一项非常具有挑战性的任务:夜间动态场景成像。传统的RGB相机很难在低光成像的长曝光和捕捉动态场景的短曝光之间进行权衡。事件相机对动态变化作出反应,具有高时间分辨率(微秒)和动态范围(120 dB),因此提供了一个有前途的替代方案。然而,由于缺乏夜间条件下的成对事件干净图像数据,现有方法大多基于模拟数据集,而夜间条件下的域间隙导致现实场景中的性能限制。此外,现有的事件重建方法大多针对白天数据,忽略了夜间弱光事件特有的强噪声、时间拖尾、空间不均匀性等问题,导致重建效果不理想。为了解决这些挑战,我们通过同轴成像系统构建了第一个真正的配对低光事件数据集(RLED),该数据集包括80,400个空间和时间对齐的图像gt和低光事件,为现有方法提供了统一的训练和评估数据集。我们进一步全面分析了夜间事件中强噪声、时间拖尾和空间非均匀性的原因和特征,并提出了夜间事件重建网络(NER-Net+)。它包括一个可学习的事件时间戳校准模块(LETC)来校正时间滞后事件和一个非平稳时空信息增强模块(NSIE)来抑制传感器噪声和空间非均匀性。大量的实验表明,该方法在真实世界夜间数据集的视觉质量和泛化方面优于最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信