Yue Ma, Lei Miao, Haosen Wang, Yan Li, Bo Lu, Shifeng Wang
{"title":"基于激光雷达的两阶段增强行人和骑车人检测方法","authors":"Yue Ma, Lei Miao, Haosen Wang, Yan Li, Bo Lu, Shifeng Wang","doi":"10.1007/s10946-023-10158-2","DOIUrl":null,"url":null,"abstract":"<div><p>In recent years, the application scope of LIDAR has been continuously expanding, especially in object detection. Yet existing LIDAR-based methods focus on detecting vehicles on regular roadways. Scenarios with a higher prevalence of pedestrians and cyclists, such as university campuses and leisure centers, have recently received limited attention. To solve this problem, in this paper we propose a novel detection algorithm named SecondRcnn, which is built upon the SECOND algorithm and introduces a novel two-stage detection method. In the first stage, it utilizes 3D sparse convolution on the voxel LIDAR points to learn feature representations. In the second stage, regression is employed to refine the detection bounding boxes generated by the Region Of Interest pooling network. The algorithm was evaluated on the widely used KITTI data set and demonstrated significant performance improvements in detecting pedestrians (4.61% improvement) and cyclist (6.5% improvement) compared to baseline networks. Our work highlights the potential for accurate object detection in scenarios characterized by a higher presence of pedestrians and cyclists. Advancing the use of LIDAR in the field of 3D detection.</p></div>","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection\",\"authors\":\"Yue Ma, Lei Miao, Haosen Wang, Yan Li, Bo Lu, Shifeng Wang\",\"doi\":\"10.1007/s10946-023-10158-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In recent years, the application scope of LIDAR has been continuously expanding, especially in object detection. Yet existing LIDAR-based methods focus on detecting vehicles on regular roadways. Scenarios with a higher prevalence of pedestrians and cyclists, such as university campuses and leisure centers, have recently received limited attention. To solve this problem, in this paper we propose a novel detection algorithm named SecondRcnn, which is built upon the SECOND algorithm and introduces a novel two-stage detection method. In the first stage, it utilizes 3D sparse convolution on the voxel LIDAR points to learn feature representations. In the second stage, regression is employed to refine the detection bounding boxes generated by the Region Of Interest pooling network. The algorithm was evaluated on the widely used KITTI data set and demonstrated significant performance improvements in detecting pedestrians (4.61% improvement) and cyclist (6.5% improvement) compared to baseline networks. Our work highlights the potential for accurate object detection in scenarios characterized by a higher presence of pedestrians and cyclists. Advancing the use of LIDAR in the field of 3D detection.</p></div>\",\"PeriodicalId\":0,\"journal\":{\"name\":\"\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0,\"publicationDate\":\"2023-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10946-023-10158-2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s10946-023-10158-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
近年来,激光雷达的应用范围不断扩大,尤其是在物体检测方面。然而,现有的基于激光雷达的方法主要侧重于检测普通道路上的车辆。行人和骑自行车者较多的场景,如大学校园和休闲中心,最近受到的关注有限。为了解决这个问题,我们在本文中提出了一种名为 SecondRcnn 的新型检测算法,它建立在 SECOND 算法的基础上,并引入了一种新型的两阶段检测方法。在第一阶段,它利用体素激光雷达点的三维稀疏卷积来学习特征表示。在第二阶段,利用回归来完善由兴趣区域池网络生成的检测边界框。我们在广泛使用的 KITTI 数据集上对该算法进行了评估,结果表明,与基线网络相比,该算法在检测行人(提高 4.61%)和骑自行车者(提高 6.5%)方面的性能有了显著提高。我们的工作凸显了在行人和骑车人较多的场景中准确检测物体的潜力。推进激光雷达在 3D 检测领域的应用。
A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection
In recent years, the application scope of LIDAR has been continuously expanding, especially in object detection. Yet existing LIDAR-based methods focus on detecting vehicles on regular roadways. Scenarios with a higher prevalence of pedestrians and cyclists, such as university campuses and leisure centers, have recently received limited attention. To solve this problem, in this paper we propose a novel detection algorithm named SecondRcnn, which is built upon the SECOND algorithm and introduces a novel two-stage detection method. In the first stage, it utilizes 3D sparse convolution on the voxel LIDAR points to learn feature representations. In the second stage, regression is employed to refine the detection bounding boxes generated by the Region Of Interest pooling network. The algorithm was evaluated on the widely used KITTI data set and demonstrated significant performance improvements in detecting pedestrians (4.61% improvement) and cyclist (6.5% improvement) compared to baseline networks. Our work highlights the potential for accurate object detection in scenarios characterized by a higher presence of pedestrians and cyclists. Advancing the use of LIDAR in the field of 3D detection.