{"title":"基于两阶段方法的自动驾驶三维目标检测","authors":"Yuhui Lu, Zhong Chen, Mingde Zhao","doi":"10.1109/ISPCE-ASIA57917.2022.9971105","DOIUrl":null,"url":null,"abstract":"As one of the hottest areas in the current technology industry, the field of autonomous driving has attracted the attention of many technology workers. How to use point cloud data for accurate multi-objective prediction is a key issue, which includes 3D object detection and multi-object tracking. CenterPoint proposes a novel anchor-free, two-stage 3D object detection method. The first stage uses a CenterNet approach, that is, using the center point to represent the object, using the feature map after feature extraction as input, and outputting a heatmap of the probability of the location of the center of the object for each category to predict the location of the target object, and obtaining other properties from the feature regression of the point location. The second stage is to extract features from the center point of the bounding box of the prediction target to refine the prediction results. However, the 3D backbone network of the CenterPoint has the disadvantages of low feature extraction accuracy and low second stage refinement accuracy. In order to solve these problems, this paper proposes to use VoxelResBackBone8x based on deep residual network Resnet as the 3D backbone network, simplify the 2D backbone network to improve feature extraction accuracy, and use the Set Abstraction Module to make the model use both the processed advanced features and the original point cloud features to further improve the accuracy.","PeriodicalId":197173,"journal":{"name":"2022 IEEE International Symposium on Product Compliance Engineering - Asia (ISPCE-ASIA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"3D Objective Detection for Autonomous Driving based on Two-stage Approach\",\"authors\":\"Yuhui Lu, Zhong Chen, Mingde Zhao\",\"doi\":\"10.1109/ISPCE-ASIA57917.2022.9971105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As one of the hottest areas in the current technology industry, the field of autonomous driving has attracted the attention of many technology workers. How to use point cloud data for accurate multi-objective prediction is a key issue, which includes 3D object detection and multi-object tracking. CenterPoint proposes a novel anchor-free, two-stage 3D object detection method. The first stage uses a CenterNet approach, that is, using the center point to represent the object, using the feature map after feature extraction as input, and outputting a heatmap of the probability of the location of the center of the object for each category to predict the location of the target object, and obtaining other properties from the feature regression of the point location. The second stage is to extract features from the center point of the bounding box of the prediction target to refine the prediction results. However, the 3D backbone network of the CenterPoint has the disadvantages of low feature extraction accuracy and low second stage refinement accuracy. In order to solve these problems, this paper proposes to use VoxelResBackBone8x based on deep residual network Resnet as the 3D backbone network, simplify the 2D backbone network to improve feature extraction accuracy, and use the Set Abstraction Module to make the model use both the processed advanced features and the original point cloud features to further improve the accuracy.\",\"PeriodicalId\":197173,\"journal\":{\"name\":\"2022 IEEE International Symposium on Product Compliance Engineering - Asia (ISPCE-ASIA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Symposium on Product Compliance Engineering - Asia (ISPCE-ASIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPCE-ASIA57917.2022.9971105\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Product Compliance Engineering - Asia (ISPCE-ASIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPCE-ASIA57917.2022.9971105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
3D Objective Detection for Autonomous Driving based on Two-stage Approach
As one of the hottest areas in the current technology industry, the field of autonomous driving has attracted the attention of many technology workers. How to use point cloud data for accurate multi-objective prediction is a key issue, which includes 3D object detection and multi-object tracking. CenterPoint proposes a novel anchor-free, two-stage 3D object detection method. The first stage uses a CenterNet approach, that is, using the center point to represent the object, using the feature map after feature extraction as input, and outputting a heatmap of the probability of the location of the center of the object for each category to predict the location of the target object, and obtaining other properties from the feature regression of the point location. The second stage is to extract features from the center point of the bounding box of the prediction target to refine the prediction results. However, the 3D backbone network of the CenterPoint has the disadvantages of low feature extraction accuracy and low second stage refinement accuracy. In order to solve these problems, this paper proposes to use VoxelResBackBone8x based on deep residual network Resnet as the 3D backbone network, simplify the 2D backbone network to improve feature extraction accuracy, and use the Set Abstraction Module to make the model use both the processed advanced features and the original point cloud features to further improve the accuracy.