VI-eye: semantic-based 3D point cloud registration for infrastructure-assisted autonomous driving

Proceedings of the 27th Annual International Conference on Mobile Computing and Networking Pub Date : 2021-10-19 DOI:10.1145/3447993.3483276

Yuze He, Li Ma, Zhehao Jiang, Yi Tang, G. Xing

{"title":"VI-eye: semantic-based 3D point cloud registration for infrastructure-assisted autonomous driving","authors":"Yuze He, Li Ma, Zhehao Jiang, Yi Tang, G. Xing","doi":"10.1145/3447993.3483276","DOIUrl":null,"url":null,"abstract":"Infrastructure-assisted autonomous driving is an emerging paradigm that aims to make affordable autonomous vehicles a reality. A key technology for realizing this vision is real-time point cloud registration which allows a vehicle to fuse the 3D point clouds generated by its own LiDAR and those on roadside infrastructures such as smart lampposts, which can deliver increased sensing range, more robust object detection, and centimeter-level navigation. Unfortunately, the existing methods for point cloud registration assume two clouds to share a similar perspective and large overlap, which result in significant delay and inaccuracy in real-world infrastructure-assisted driving settings. This paper proposes VI-Eye - the first system that can align vehicle-infrastructure point clouds at centimeter accuracy in real-time. Our key idea is to exploit traffic domain knowledge by detecting a set of key semantic objects including road, lane lines, curbs, and traffic signs. Based on the inherent regular geometries of such semantic objects, VI-Eye extracts a small number of saliency points and leverage them to achieve real-time registration of two point clouds. By allowing vehicles and infrastructures to extract the semantic information in parallel, VI-Eye leads to a highly scalable architecture for infrastructure-assisted autonomous driving. To evaluate the performance of VI-Eye, we collect two new multiview LiDAR point cloud datasets on an indoor autonomous driving testbed and a campus smart lamppost testbed, respectively. They contain total 915 point cloud pairs and cover three roads of 1.12km. Experiment results show that VI-Eye achieves centimeter-level accuracy within around 0.2s, and delivers a 5X improvement in accuracy and 2X speedup over state-of-the-art baselines.","PeriodicalId":177431,"journal":{"name":"Proceedings of the 27th Annual International Conference on Mobile Computing and Networking","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 27th Annual International Conference on Mobile Computing and Networking","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3447993.3483276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 24

Abstract

Infrastructure-assisted autonomous driving is an emerging paradigm that aims to make affordable autonomous vehicles a reality. A key technology for realizing this vision is real-time point cloud registration which allows a vehicle to fuse the 3D point clouds generated by its own LiDAR and those on roadside infrastructures such as smart lampposts, which can deliver increased sensing range, more robust object detection, and centimeter-level navigation. Unfortunately, the existing methods for point cloud registration assume two clouds to share a similar perspective and large overlap, which result in significant delay and inaccuracy in real-world infrastructure-assisted driving settings. This paper proposes VI-Eye - the first system that can align vehicle-infrastructure point clouds at centimeter accuracy in real-time. Our key idea is to exploit traffic domain knowledge by detecting a set of key semantic objects including road, lane lines, curbs, and traffic signs. Based on the inherent regular geometries of such semantic objects, VI-Eye extracts a small number of saliency points and leverage them to achieve real-time registration of two point clouds. By allowing vehicles and infrastructures to extract the semantic information in parallel, VI-Eye leads to a highly scalable architecture for infrastructure-assisted autonomous driving. To evaluate the performance of VI-Eye, we collect two new multiview LiDAR point cloud datasets on an indoor autonomous driving testbed and a campus smart lamppost testbed, respectively. They contain total 915 point cloud pairs and cover three roads of 1.12km. Experiment results show that VI-Eye achieves centimeter-level accuracy within around 0.2s, and delivers a 5X improvement in accuracy and 2X speedup over state-of-the-art baselines.

查看原文本刊更多论文

VI-eye:基于语义的3D点云配准，用于基础设施辅助自动驾驶

基础设施辅助自动驾驶是一种新兴的模式，旨在使负担得起的自动驾驶汽车成为现实。实现这一愿景的一项关键技术是实时点云配准，该技术允许车辆将自身激光雷达生成的3D点云与路边基础设施(如智能灯柱)上的点云融合在一起，从而提供更大的传感范围、更强大的目标检测和厘米级导航。不幸的是，现有的点云配准方法假设两个云具有相似的视角和很大的重叠，这在现实世界的基础设施辅助驾驶设置中会导致严重的延迟和不准确。本文提出了首个能够以厘米级精度实时对准车辆-基础设施点云的系统VI-Eye。我们的关键思想是通过检测一组关键语义对象(包括道路、车道线、路缘和交通标志)来开发交通领域知识。VI-Eye基于这些语义对象固有的规则几何形状，提取少量显著点，并利用它们实现两点云的实时配准。通过允许车辆和基础设施并行提取语义信息，VI-Eye为基础设施辅助自动驾驶提供了高度可扩展的架构。为了评估VI-Eye的性能，我们分别在室内自动驾驶试验台和校园智能灯柱试验台上收集了两个新的多视图LiDAR点云数据集。共包含915对点云，覆盖3条1.12公里的道路。实验结果表明，VI-Eye在0.2s左右的时间内实现了厘米级的精度，并且在最先进的基线上提供了5倍的精度提高和2倍的加速。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 27th Annual International Conference on Mobile Computing and Networking

自引率

0.00%

发文量