Kai Wang;Mingliang Zhou;Qing Lin;Guanglin Niu;Xiaowei Zhang
{"title":"用于 3D 物体检测的几何引导点生成技术","authors":"Kai Wang;Mingliang Zhou;Qing Lin;Guanglin Niu;Xiaowei Zhang","doi":"10.1109/LSP.2024.3503359","DOIUrl":null,"url":null,"abstract":"Point cloud completion 3D object detectors effectively tackle the challenge of incomplete shapes in sparse point clouds by generating pseudo points to improve detection performance. However, the absence of guidance provided by the heatmap information and the geometric shape information renders the precise recovery of object shapes an arduous task. To this end, we propose a Geometry-guided Point Generation for 3D Object Detection, named GgPG. Specifically, we first design a 3D heatmap auxiliary supervision subnetwork to enhance the quality of object proposals by capturing the actual size and position of the object within the 3D heatmap representation. Moreover, we introduce a density-aware point generation module that employs Kernel Density Estimation (KDE) to embed the point density into the grid point's feature representation, thereby enabling the completion of more precise object shapes. Our GgPG achieves progressive performance in both Waymo and KITTI benchmarks, notably GgPG outperforms PGRCNN by +1.02\n<inline-formula><tex-math>$\\%$</tex-math></inline-formula>\n, +1.18\n<inline-formula><tex-math>$\\%$</tex-math></inline-formula>\n, and +0.56\n<inline-formula><tex-math>$\\%$</tex-math></inline-formula>\n on the vehicle, pedestrian, and cyclist under LEVEL\n<inline-formula><tex-math>$\\_$</tex-math></inline-formula>\n 2 mAPH classes on Waymo Open Dataset, respectively.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"136-140"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Geometry-Guided Point Generation for 3D Object Detection\",\"authors\":\"Kai Wang;Mingliang Zhou;Qing Lin;Guanglin Niu;Xiaowei Zhang\",\"doi\":\"10.1109/LSP.2024.3503359\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Point cloud completion 3D object detectors effectively tackle the challenge of incomplete shapes in sparse point clouds by generating pseudo points to improve detection performance. However, the absence of guidance provided by the heatmap information and the geometric shape information renders the precise recovery of object shapes an arduous task. To this end, we propose a Geometry-guided Point Generation for 3D Object Detection, named GgPG. Specifically, we first design a 3D heatmap auxiliary supervision subnetwork to enhance the quality of object proposals by capturing the actual size and position of the object within the 3D heatmap representation. Moreover, we introduce a density-aware point generation module that employs Kernel Density Estimation (KDE) to embed the point density into the grid point's feature representation, thereby enabling the completion of more precise object shapes. Our GgPG achieves progressive performance in both Waymo and KITTI benchmarks, notably GgPG outperforms PGRCNN by +1.02\\n<inline-formula><tex-math>$\\\\%$</tex-math></inline-formula>\\n, +1.18\\n<inline-formula><tex-math>$\\\\%$</tex-math></inline-formula>\\n, and +0.56\\n<inline-formula><tex-math>$\\\\%$</tex-math></inline-formula>\\n on the vehicle, pedestrian, and cyclist under LEVEL\\n<inline-formula><tex-math>$\\\\_$</tex-math></inline-formula>\\n 2 mAPH classes on Waymo Open Dataset, respectively.\",\"PeriodicalId\":13154,\"journal\":{\"name\":\"IEEE Signal Processing Letters\",\"volume\":\"32 \",\"pages\":\"136-140\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Signal Processing Letters\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10758765/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10758765/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Geometry-Guided Point Generation for 3D Object Detection
Point cloud completion 3D object detectors effectively tackle the challenge of incomplete shapes in sparse point clouds by generating pseudo points to improve detection performance. However, the absence of guidance provided by the heatmap information and the geometric shape information renders the precise recovery of object shapes an arduous task. To this end, we propose a Geometry-guided Point Generation for 3D Object Detection, named GgPG. Specifically, we first design a 3D heatmap auxiliary supervision subnetwork to enhance the quality of object proposals by capturing the actual size and position of the object within the 3D heatmap representation. Moreover, we introduce a density-aware point generation module that employs Kernel Density Estimation (KDE) to embed the point density into the grid point's feature representation, thereby enabling the completion of more precise object shapes. Our GgPG achieves progressive performance in both Waymo and KITTI benchmarks, notably GgPG outperforms PGRCNN by +1.02
$\%$
, +1.18
$\%$
, and +0.56
$\%$
on the vehicle, pedestrian, and cyclist under LEVEL
$\_$
2 mAPH classes on Waymo Open Dataset, respectively.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.