Burst Vision Using Single-Photon Cameras

2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Pub Date : 2023-01-01 DOI:10.1109/WACV56688.2023.00534

Sizhuo Ma, Paul Mos, E. Charbon, Mohit Gupta

{"title":"Burst Vision Using Single-Photon Cameras","authors":"Sizhuo Ma, Paul Mos, E. Charbon, Mohit Gupta","doi":"10.1109/WACV56688.2023.00534","DOIUrl":null,"url":null,"abstract":"Single-photon avalanche diodes (SPADs) are novel image sensors that record the arrival of individual photons at extremely high temporal resolution. In the past, they were only available as single pixels or small-format arrays, for various active imaging applications such as LiDAR and microscopy. Recently, high-resolution SPAD arrays up to 3.2 megapixel have been realized, which for the first time may be able to capture sufficient spatial details for general computer vision tasks, purely as a passive sensor. However, existing vision algorithms are not directly applicable on the binary data captured by SPADs. In this paper, we propose developing quanta vision algorithms based on burst processing for extracting scene information from SPAD photon streams. With extensive real-world data, we demonstrate that current SPAD arrays, along with burst processing as an example plug-and-play algorithm, are capable of a wide range of downstream vision tasks in extremely challenging imaging conditions including fast motion, low light (< 5 lux) and high dynamic range. To our knowledge, this is the first attempt to demonstrate the capabilities of SPAD sensors for a wide gamut of real-world computer vision tasks including object detection, pose estimation, SLAM, and text recognition. We hope this work will inspire future research into developing computer vision algorithms in extreme scenarios using single-photon cameras.","PeriodicalId":270631,"journal":{"name":"2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WACV56688.2023.00534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Single-photon avalanche diodes (SPADs) are novel image sensors that record the arrival of individual photons at extremely high temporal resolution. In the past, they were only available as single pixels or small-format arrays, for various active imaging applications such as LiDAR and microscopy. Recently, high-resolution SPAD arrays up to 3.2 megapixel have been realized, which for the first time may be able to capture sufficient spatial details for general computer vision tasks, purely as a passive sensor. However, existing vision algorithms are not directly applicable on the binary data captured by SPADs. In this paper, we propose developing quanta vision algorithms based on burst processing for extracting scene information from SPAD photon streams. With extensive real-world data, we demonstrate that current SPAD arrays, along with burst processing as an example plug-and-play algorithm, are capable of a wide range of downstream vision tasks in extremely challenging imaging conditions including fast motion, low light (< 5 lux) and high dynamic range. To our knowledge, this is the first attempt to demonstrate the capabilities of SPAD sensors for a wide gamut of real-world computer vision tasks including object detection, pose estimation, SLAM, and text recognition. We hope this work will inspire future research into developing computer vision algorithms in extreme scenarios using single-photon cameras.

查看原文本刊更多论文

使用单光子相机的突发视觉

单光子雪崩二极管(spad)是一种新型的图像传感器，可以以极高的时间分辨率记录单个光子的到达。在过去，它们只能以单像素或小格式阵列的形式提供，用于各种主动成像应用，如激光雷达和显微镜。最近，高达320万像素的高分辨率SPAD阵列已经实现，这是第一次能够捕捉到足够的空间细节，用于一般的计算机视觉任务，纯粹作为被动传感器。然而，现有的视觉算法不能直接应用于SPADs捕获的二进制数据。本文提出了一种基于突发处理的量子视觉算法，用于从SPAD光子流中提取场景信息。通过大量的实际数据，我们证明了当前的SPAD阵列，以及突发处理作为即插即用算法的例子，能够在极具挑战性的成像条件下完成广泛的下游视觉任务，包括快速运动、低光(< 5勒克斯)和高动态范围。据我们所知，这是第一次尝试展示SPAD传感器在现实世界中广泛的计算机视觉任务中的能力，包括物体检测、姿态估计、SLAM和文本识别。我们希望这项工作将激发未来使用单光子相机在极端情况下开发计算机视觉算法的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

自引率

0.00%

发文量