FastFusion: Deep stereo-LiDAR fusion for real-time high-precision dense depth sensing

IF 4.2 2区计算机科学 Q2 ROBOTICS

Journal of Field Robotics Pub Date : 2023-06-08 DOI:10.1002/rob.22216

Haitao Meng, Changcai Li, Chonghao Zhong, Jianfeng Gu, Gang Chen, Alois Knoll

{"title":"FastFusion: Deep stereo-LiDAR fusion for real-time high-precision dense depth sensing","authors":"Haitao Meng, Changcai Li, Chonghao Zhong, Jianfeng Gu, Gang Chen, Alois Knoll","doi":"10.1002/rob.22216","DOIUrl":null,"url":null,"abstract":"<p>Light detection and ranging (LiDAR) and stereo cameras are two generally used solutions for perceiving 3D information. The complementary properties of these two sensor modalities motivate a fusion to derive practicable depth sensing toward real-world applications. Promoted by deep neural network (DNN) techniques, recent works achieve superior performance on accuracy. However, the complex architecture and the sheer number of DNN parameters often lead to poor generalization capacity and non-real-time computing. In this paper, we present FastFusion, a three-stage stereo-LiDAR deep fusion scheme, which integrates the LiDAR priors into each step of classical stereo-matching taxonomy, gaining high-precision dense depth sensing in a real-time manner. We integrate stereo-LiDAR information by taking advantage of a compact binary neural network and utilize the proposed cross-based LiDAR trust aggregation to further fuse the sparse LiDAR measurements in the back-end of stereo matching. To align the photometrical of the input image and the depth of the estimation, we introduce a refinement network to guarantee consistency. More importantly, we present a graphic processing unit-based acceleration framework for providing a low-latency implementation of FastFusion, gaining both accuracy improvement and real-time responsiveness. In the experiments, we demonstrate the effectiveness and practicability of FastFusion, which obtains a significant speedup over state-of-the-art baselines while achieving comparable accuracy on depth sensing. The video demo for real-time depth estimation of FastFusion on the real-world driving scenario is available at https://youtu.be/nP7cls2BA8s.</p>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"40 7","pages":"1804-1816"},"PeriodicalIF":4.2000,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22216","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Light detection and ranging (LiDAR) and stereo cameras are two generally used solutions for perceiving 3D information. The complementary properties of these two sensor modalities motivate a fusion to derive practicable depth sensing toward real-world applications. Promoted by deep neural network (DNN) techniques, recent works achieve superior performance on accuracy. However, the complex architecture and the sheer number of DNN parameters often lead to poor generalization capacity and non-real-time computing. In this paper, we present FastFusion, a three-stage stereo-LiDAR deep fusion scheme, which integrates the LiDAR priors into each step of classical stereo-matching taxonomy, gaining high-precision dense depth sensing in a real-time manner. We integrate stereo-LiDAR information by taking advantage of a compact binary neural network and utilize the proposed cross-based LiDAR trust aggregation to further fuse the sparse LiDAR measurements in the back-end of stereo matching. To align the photometrical of the input image and the depth of the estimation, we introduce a refinement network to guarantee consistency. More importantly, we present a graphic processing unit-based acceleration framework for providing a low-latency implementation of FastFusion, gaining both accuracy improvement and real-time responsiveness. In the experiments, we demonstrate the effectiveness and practicability of FastFusion, which obtains a significant speedup over state-of-the-art baselines while achieving comparable accuracy on depth sensing. The video demo for real-time depth estimation of FastFusion on the real-world driving scenario is available at https://youtu.be/nP7cls2BA8s.

查看原文本刊更多论文

FastFusion:用于实时高精度密集深度传感的深度立体激光雷达融合

光探测和测距(LiDAR)和立体相机是两种常用的3D信息感知解决方案。这两种传感器模式的互补特性激发了一种融合，以获得对现实世界应用的实际深度传感。在深度神经网络(DNN)技术的推动下，近年来的研究在精度方面取得了优异的成绩。然而，复杂的结构和大量的深度神经网络参数往往导致较差的泛化能力和非实时计算。在本文中，我们提出了一种三阶段立体-激光雷达深度融合方案FastFusion，该方案将激光雷达先验信息整合到经典立体匹配分类的每一步中，实时获得高精度的密集深度感知。我们利用紧凑的二元神经网络整合立体激光雷达信息，并利用所提出的基于交叉的激光雷达信任聚合进一步融合立体匹配后端的稀疏激光雷达测量值。为了使输入图像的光度和估计的深度保持一致，我们引入了一个改进网络来保证一致性。更重要的是，我们提出了一个基于图形处理单元的加速框架，以提供低延迟的FastFusion实现，同时提高了精度和实时响应能力。在实验中，我们证明了FastFusion的有效性和实用性，它在最先进的基线上获得了显着的加速，同时在深度传感上达到了相当的精度。FastFusion在真实驾驶场景下的实时深度估计视频演示可在https://youtu.be/nP7cls2BA8s上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Field Robotics 工程技术-机器人学

CiteScore

15.00

自引率

3.60%

发文量

审稿时长

6 months

期刊介绍： The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments. The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.