Chiao-Hua Tseng, Yu-Ting Lin, Wen-Chieh Lin, Chieh-Chih Wang
{"title":"Improving LiDAR Semantic Segmentation on Minority Classes and Generalization Capability for Autonomous Driving","authors":"Chiao-Hua Tseng, Yu-Ting Lin, Wen-Chieh Lin, Chieh-Chih Wang","doi":"10.1109/TAAI57707.2022.00032","DOIUrl":null,"url":null,"abstract":"LiDARs have emerged as an important sensor in autonomous driving systems because they offer more accurate geometric measurements than cameras and radars. Therefore, LiDARs have been commonly combined with cameras or radars to tackle many perception problems in autonomous driving, such as object detection, semantic segmentation, or navigation. For semantic segmentation of LiDAR data, due to the class imbalance issue of large-scale scene, there is a performance gap between majority classes and minority classes of large-scale dataset. The minority classes usually include the crucial classes to the autonomous driving, such as “person”, “motorcyclist”, “traffic-sign”. To improve the performance of minority classes, we adopt U-Net++ as the architecture, KPConv as convolution operator, and use both dice loss and cross entropy as loss functions. We get 5.1% mIoU improvement on SemanticKITTI of all classes and 9.5% mIoU improvement of minority classes. Moreover, due to the different resolution of LiDAR sensors, we show the generalization capability of our model by training it on 64-beam dataset and testing on 32-beam and 128-beam dataset. We get 3.3% mIoU improvement on 128-beam dataset and 1.9% mIoU improvement on 32-beam dataset.","PeriodicalId":111620,"journal":{"name":"2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","volume":"208 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAAI57707.2022.00032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
LiDARs have emerged as an important sensor in autonomous driving systems because they offer more accurate geometric measurements than cameras and radars. Therefore, LiDARs have been commonly combined with cameras or radars to tackle many perception problems in autonomous driving, such as object detection, semantic segmentation, or navigation. For semantic segmentation of LiDAR data, due to the class imbalance issue of large-scale scene, there is a performance gap between majority classes and minority classes of large-scale dataset. The minority classes usually include the crucial classes to the autonomous driving, such as “person”, “motorcyclist”, “traffic-sign”. To improve the performance of minority classes, we adopt U-Net++ as the architecture, KPConv as convolution operator, and use both dice loss and cross entropy as loss functions. We get 5.1% mIoU improvement on SemanticKITTI of all classes and 9.5% mIoU improvement of minority classes. Moreover, due to the different resolution of LiDAR sensors, we show the generalization capability of our model by training it on 64-beam dataset and testing on 32-beam and 128-beam dataset. We get 3.3% mIoU improvement on 128-beam dataset and 1.9% mIoU improvement on 32-beam dataset.