Yan Xu, Li Liu, Yu Meng, Chaoda Zheng, Wen Yang, Chen Sun, Rui Zhou, Dongpu Cao
{"title":"Kernel Point Non-local Networks for LiDAR Semantic Segmentation","authors":"Yan Xu, Li Liu, Yu Meng, Chaoda Zheng, Wen Yang, Chen Sun, Rui Zhou, Dongpu Cao","doi":"10.1109/CVCI54083.2021.9661234","DOIUrl":null,"url":null,"abstract":"LiDAR point cloud semantic segmentation based on convolutional neural networks has become an effective way to understand traffic scenes. Previous works mainly focus on projecting point clouds onto a plane and then use efficient 2D CNN to achieve efficient feature extraction. However, the projection process is accompanied by 3D information loss, challenging to adapt to the complex traffic environment. In this paper, we propose a point-based segmentation network based on three-dimensional convolution, which directly takes the point cloud as input, integrates a variety of distributed kernel point convolutions and introduces an attention mechanism to learn 3d point features efficiently. To evaluate our algorithm, we conducted sufficient experiments on the widely used public dataset SemanticKITTI [1]. The results show that our proposed Kernel Point Non-local module improving the accuracy of KPConv [2] from 58.8% to 61.5%, leading to new state-of-the-art among point-based methods.","PeriodicalId":419836,"journal":{"name":"2021 5th CAA International Conference on Vehicular Control and Intelligence (CVCI)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th CAA International Conference on Vehicular Control and Intelligence (CVCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVCI54083.2021.9661234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
LiDAR point cloud semantic segmentation based on convolutional neural networks has become an effective way to understand traffic scenes. Previous works mainly focus on projecting point clouds onto a plane and then use efficient 2D CNN to achieve efficient feature extraction. However, the projection process is accompanied by 3D information loss, challenging to adapt to the complex traffic environment. In this paper, we propose a point-based segmentation network based on three-dimensional convolution, which directly takes the point cloud as input, integrates a variety of distributed kernel point convolutions and introduces an attention mechanism to learn 3d point features efficiently. To evaluate our algorithm, we conducted sufficient experiments on the widely used public dataset SemanticKITTI [1]. The results show that our proposed Kernel Point Non-local module improving the accuracy of KPConv [2] from 58.8% to 61.5%, leading to new state-of-the-art among point-based methods.