{"title":"基于改进EPNet的三维目标检测算法","authors":"Jiwu Tang, Xianzhao Zhu, D. Yin","doi":"10.1109/CISCE58541.2023.10142760","DOIUrl":null,"url":null,"abstract":"Early 3D object detection algorithms focused on using a single sensor. However, with the development of target detection technology, the target detection algorithm based on multi-sensor fusion has gradually entered people's field of vision. In this paper, we improve the EPNet algorithm based on image and point cloud fusion. In its image stream branch, due to its relatively simple image upsampling method, most of the image information will be lost, which reduces the detection accuracy. Therefore, the method of combining the feature pyramid network with the coordinate attention mechanism is used to make up for the problem. Extensive experiments on the KITTI datasets demonstrate this method is good.","PeriodicalId":145263,"journal":{"name":"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A 3D Object Detection Algorithm Based on Improved EPNet\",\"authors\":\"Jiwu Tang, Xianzhao Zhu, D. Yin\",\"doi\":\"10.1109/CISCE58541.2023.10142760\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Early 3D object detection algorithms focused on using a single sensor. However, with the development of target detection technology, the target detection algorithm based on multi-sensor fusion has gradually entered people's field of vision. In this paper, we improve the EPNet algorithm based on image and point cloud fusion. In its image stream branch, due to its relatively simple image upsampling method, most of the image information will be lost, which reduces the detection accuracy. Therefore, the method of combining the feature pyramid network with the coordinate attention mechanism is used to make up for the problem. Extensive experiments on the KITTI datasets demonstrate this method is good.\",\"PeriodicalId\":145263,\"journal\":{\"name\":\"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISCE58541.2023.10142760\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISCE58541.2023.10142760","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A 3D Object Detection Algorithm Based on Improved EPNet
Early 3D object detection algorithms focused on using a single sensor. However, with the development of target detection technology, the target detection algorithm based on multi-sensor fusion has gradually entered people's field of vision. In this paper, we improve the EPNet algorithm based on image and point cloud fusion. In its image stream branch, due to its relatively simple image upsampling method, most of the image information will be lost, which reduces the detection accuracy. Therefore, the method of combining the feature pyramid network with the coordinate attention mechanism is used to make up for the problem. Extensive experiments on the KITTI datasets demonstrate this method is good.