{"title":"JYOLO: Joint Point Cloud for Autonomous Driving 3D Object Detection","authors":"Hongpeng Tian, Lunlun Guo","doi":"10.1109/ICSPCC55723.2022.9984261","DOIUrl":null,"url":null,"abstract":"The camera and lidar are significant sensors for automatic driving, they can provide adequate complementary information. However, 3D point cloud object detection suffers from complexity and low accuracy. In this paper, a Joint-YOLO fusion model is proposed. It provides a low-complexity joint fusion object detection framework. First, the dilated attention is designed to pay attention to the feature resolution of correlation and reduce the number of calculations. And secondly, parallel inverted residual is constructed to connect deep and rich semantic information with high-dimensional features. Finally, the model present an efficient joint fusion structure embedded with camera-lidar detector based 2D-3D bounding box geometric and semantic information for 3D point cloud object detection.","PeriodicalId":346917,"journal":{"name":"2022 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)","volume":"746 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPCC55723.2022.9984261","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The camera and lidar are significant sensors for automatic driving, they can provide adequate complementary information. However, 3D point cloud object detection suffers from complexity and low accuracy. In this paper, a Joint-YOLO fusion model is proposed. It provides a low-complexity joint fusion object detection framework. First, the dilated attention is designed to pay attention to the feature resolution of correlation and reduce the number of calculations. And secondly, parallel inverted residual is constructed to connect deep and rich semantic information with high-dimensional features. Finally, the model present an efficient joint fusion structure embedded with camera-lidar detector based 2D-3D bounding box geometric and semantic information for 3D point cloud object detection.