{"title":"基于多模态信息的三维道路障碍物检测","authors":"Yu-Quan Wang, Yi-Ting Chen, Man Wu, Ching-Hsiang Ko, Hao-Wei Hwang, Yung-Yao Chen","doi":"10.1109/ISPACS51563.2021.9650995","DOIUrl":null,"url":null,"abstract":"Considerable resources have been devoted to developing self-driving systems in industry and academia, for which three-dimensional object detection is critical. The commonly used LiDAR-based methods, in which point clouds serve as the input representation, are marred by the problems of sparsity and inhomogeneity, which make small or distant objects difficult to detect. Accordingly, we propose a LiDAR-based road obstacle detection method assisted by RGB images, which operates as follows. First, a depth completion network is used to transform RGB images into dense depth maps that can be used to create a pseudo–point cloud through matrix operations. Subsequently, both pseudo point cloud and real point cloud are transformed into a pillar form for a pillar-wise feature encoder; this is executed to generate a two-dimensional (2D) feature tensor. Finally, a standard 2D convolutional neural network detection architecture is used to learn features. This method increases the number of point features to remedy the sparsity and inhomogeneity of the original point cloud. Our method had an improvement compared with its LiDAR-based counterpart in experiments.","PeriodicalId":359822,"journal":{"name":"2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fusing Multi-Modality Information for 3D Road Obstacle Detection\",\"authors\":\"Yu-Quan Wang, Yi-Ting Chen, Man Wu, Ching-Hsiang Ko, Hao-Wei Hwang, Yung-Yao Chen\",\"doi\":\"10.1109/ISPACS51563.2021.9650995\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Considerable resources have been devoted to developing self-driving systems in industry and academia, for which three-dimensional object detection is critical. The commonly used LiDAR-based methods, in which point clouds serve as the input representation, are marred by the problems of sparsity and inhomogeneity, which make small or distant objects difficult to detect. Accordingly, we propose a LiDAR-based road obstacle detection method assisted by RGB images, which operates as follows. First, a depth completion network is used to transform RGB images into dense depth maps that can be used to create a pseudo–point cloud through matrix operations. Subsequently, both pseudo point cloud and real point cloud are transformed into a pillar form for a pillar-wise feature encoder; this is executed to generate a two-dimensional (2D) feature tensor. Finally, a standard 2D convolutional neural network detection architecture is used to learn features. This method increases the number of point features to remedy the sparsity and inhomogeneity of the original point cloud. Our method had an improvement compared with its LiDAR-based counterpart in experiments.\",\"PeriodicalId\":359822,\"journal\":{\"name\":\"2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPACS51563.2021.9650995\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS51563.2021.9650995","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fusing Multi-Modality Information for 3D Road Obstacle Detection
Considerable resources have been devoted to developing self-driving systems in industry and academia, for which three-dimensional object detection is critical. The commonly used LiDAR-based methods, in which point clouds serve as the input representation, are marred by the problems of sparsity and inhomogeneity, which make small or distant objects difficult to detect. Accordingly, we propose a LiDAR-based road obstacle detection method assisted by RGB images, which operates as follows. First, a depth completion network is used to transform RGB images into dense depth maps that can be used to create a pseudo–point cloud through matrix operations. Subsequently, both pseudo point cloud and real point cloud are transformed into a pillar form for a pillar-wise feature encoder; this is executed to generate a two-dimensional (2D) feature tensor. Finally, a standard 2D convolutional neural network detection architecture is used to learn features. This method increases the number of point features to remedy the sparsity and inhomogeneity of the original point cloud. Our method had an improvement compared with its LiDAR-based counterpart in experiments.