Sparse LiDAR and Stereo Fusion (SLS-Fusion) for Depth Estimation and 3D Object Detection

11th International Conference of Pattern Recognition Systems (ICPRS 2021) Pub Date : 2021-03-05 DOI:10.1049/icp.2021.1442

Nguyen Anh Minh Mai, Pierre Duthon, L. Khoudour, Alain Crouzil, S. Velastín

{"title":"Sparse LiDAR and Stereo Fusion (SLS-Fusion) for Depth Estimation and 3D Object Detection","authors":"Nguyen Anh Minh Mai, Pierre Duthon, L. Khoudour, Alain Crouzil, S. Velastín","doi":"10.1049/icp.2021.1442","DOIUrl":null,"url":null,"abstract":"The ability to accurately detect and localize objects is recognized as being the most important for the perception of selfdriving cars. From 2D to 3D object detection, the most difficult is to determine the distance from the ego-vehicle to objects. Expensive technology like LiDAR can provide a precise and accurate depth information, so most studies have tended to focus on this sensor showing a performance gap between LiDAR-based methods and camera-based methods. Although many authors have investigated how to fuse LiDAR with RGB cameras, as far as we know there are no studies to fuse LiDAR and stereo in a deep neural network for the 3D object detection task. This paper presents SLS-Fusion, a new approach to fuse data from 4-beam LiDAR and a stereo camera via a neural network for depth estimation to achieve better dense depth maps and thereby improves 3D object detection performance. Since 4-beam LiDAR is cheaper than the well-known 64-beam LiDAR, this approach is also classified as a low-cost sensorsbased method. Through evaluation on the KITTI benchmark, it is shown that the proposed method significantly improves depth estimation performance compared to a baseline method. Also when applying it to 3D object detection, a new state of the art on low-cost sensor based method is achieved.","PeriodicalId":431144,"journal":{"name":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/icp.2021.1442","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

The ability to accurately detect and localize objects is recognized as being the most important for the perception of selfdriving cars. From 2D to 3D object detection, the most difficult is to determine the distance from the ego-vehicle to objects. Expensive technology like LiDAR can provide a precise and accurate depth information, so most studies have tended to focus on this sensor showing a performance gap between LiDAR-based methods and camera-based methods. Although many authors have investigated how to fuse LiDAR with RGB cameras, as far as we know there are no studies to fuse LiDAR and stereo in a deep neural network for the 3D object detection task. This paper presents SLS-Fusion, a new approach to fuse data from 4-beam LiDAR and a stereo camera via a neural network for depth estimation to achieve better dense depth maps and thereby improves 3D object detection performance. Since 4-beam LiDAR is cheaper than the well-known 64-beam LiDAR, this approach is also classified as a low-cost sensorsbased method. Through evaluation on the KITTI benchmark, it is shown that the proposed method significantly improves depth estimation performance compared to a baseline method. Also when applying it to 3D object detection, a new state of the art on low-cost sensor based method is achieved.

查看原文本刊更多论文

稀疏激光雷达和立体融合(SLS-Fusion)用于深度估计和三维目标检测

准确检测和定位物体的能力被认为是自动驾驶汽车感知的最重要的能力。从2D到3D目标检测，最困难的是确定自驾车到目标的距离。像激光雷达这样昂贵的技术可以提供精确的深度信息，因此大多数研究都倾向于关注这种传感器，显示出基于激光雷达的方法与基于相机的方法之间的性能差距。尽管许多作者已经研究了如何将激光雷达与RGB相机融合在一起，但据我们所知，目前还没有将激光雷达和立体视觉融合在一个深度神经网络中用于3D目标检测任务的研究。本文介绍了SLS-Fusion，一种通过神经网络深度估计融合四束激光雷达和立体相机数据的新方法，以获得更好的密集深度图，从而提高3D目标检测性能。由于4波束激光雷达比众所周知的64波束激光雷达更便宜，因此这种方法也被归类为基于低成本传感器的方法。通过对KITTI基准的评价表明，与基线方法相比，该方法显著提高了深度估计性能。并将其应用于三维物体检测中，实现了一种基于低成本传感器的新方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

11th International Conference of Pattern Recognition Systems (ICPRS 2021)

自引率

0.00%

发文量