Huaijin Liu, Jixiang Du, Yong Zhang, Hongbo Zhang, Jiandian Zeng
{"title":"Syn-Aug:一种用于三维目标检测的有效和通用同步数据增强框架","authors":"Huaijin Liu, Jixiang Du, Yong Zhang, Hongbo Zhang, Jiandian Zeng","doi":"10.1049/cit2.70001","DOIUrl":null,"url":null,"abstract":"<p>Data augmentation plays an important role in boosting the performance of 3D models, while very few studies handle the 3D point cloud data with this technique. Global augmentation and cut-paste are commonly used augmentation techniques for point clouds, where global augmentation is applied to the entire point cloud of the scene, and cut-paste samples objects from other frames into the current frame. Both types of data augmentation can improve performance, but the cut-paste technique cannot effectively deal with the occlusion relationship between the foreground object and the background scene and the rationality of object sampling, which may be counterproductive and may hurt the overall performance. In addition, LiDAR is susceptible to signal loss, external occlusion, extreme weather and other factors, which can easily cause object shape changes, while global augmentation and cut-paste cannot effectively enhance the robustness of the model. To this end, we propose Syn-Aug, a synchronous data augmentation framework for LiDAR-based 3D object detection. Specifically, we first propose a novel rendering-based object augmentation technique (Ren-Aug) to enrich training data while enhancing scene realism. Second, we propose a local augmentation technique (Local-Aug) to generate local noise by rotating and scaling objects in the scene while avoiding collisions, which can improve generalisation performance. Finally, we make full use of the structural information of 3D labels to make the model more robust by randomly changing the geometry of objects in the training frames. We verify the proposed framework with four different types of 3D object detectors. Experimental results show that our proposed Syn-Aug significantly improves the performance of various 3D object detectors in the KITTI and nuScenes datasets, proving the effectiveness and generality of Syn-Aug. On KITTI, four different types of baseline models using Syn-Aug improved mAP by 0.89%, 1.35%, 1.61% and 1.14% respectively. On nuScenes, four different types of baseline models using Syn-Aug improved mAP by 14.93%, 10.42%, 8.47% and 6.81% respectively. The code is available at https://github.com/liuhuaijjin/Syn-Aug.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 3","pages":"912-928"},"PeriodicalIF":8.4000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.70001","citationCount":"0","resultStr":"{\"title\":\"Syn-Aug: An Effective and General Synchronous Data Augmentation Framework for 3D Object Detection\",\"authors\":\"Huaijin Liu, Jixiang Du, Yong Zhang, Hongbo Zhang, Jiandian Zeng\",\"doi\":\"10.1049/cit2.70001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Data augmentation plays an important role in boosting the performance of 3D models, while very few studies handle the 3D point cloud data with this technique. Global augmentation and cut-paste are commonly used augmentation techniques for point clouds, where global augmentation is applied to the entire point cloud of the scene, and cut-paste samples objects from other frames into the current frame. Both types of data augmentation can improve performance, but the cut-paste technique cannot effectively deal with the occlusion relationship between the foreground object and the background scene and the rationality of object sampling, which may be counterproductive and may hurt the overall performance. In addition, LiDAR is susceptible to signal loss, external occlusion, extreme weather and other factors, which can easily cause object shape changes, while global augmentation and cut-paste cannot effectively enhance the robustness of the model. To this end, we propose Syn-Aug, a synchronous data augmentation framework for LiDAR-based 3D object detection. Specifically, we first propose a novel rendering-based object augmentation technique (Ren-Aug) to enrich training data while enhancing scene realism. Second, we propose a local augmentation technique (Local-Aug) to generate local noise by rotating and scaling objects in the scene while avoiding collisions, which can improve generalisation performance. Finally, we make full use of the structural information of 3D labels to make the model more robust by randomly changing the geometry of objects in the training frames. We verify the proposed framework with four different types of 3D object detectors. Experimental results show that our proposed Syn-Aug significantly improves the performance of various 3D object detectors in the KITTI and nuScenes datasets, proving the effectiveness and generality of Syn-Aug. On KITTI, four different types of baseline models using Syn-Aug improved mAP by 0.89%, 1.35%, 1.61% and 1.14% respectively. On nuScenes, four different types of baseline models using Syn-Aug improved mAP by 14.93%, 10.42%, 8.47% and 6.81% respectively. The code is available at https://github.com/liuhuaijjin/Syn-Aug.</p>\",\"PeriodicalId\":46211,\"journal\":{\"name\":\"CAAI Transactions on Intelligence Technology\",\"volume\":\"10 3\",\"pages\":\"912-928\"},\"PeriodicalIF\":8.4000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.70001\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CAAI Transactions on Intelligence Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/cit2.70001\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cit2.70001","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Syn-Aug: An Effective and General Synchronous Data Augmentation Framework for 3D Object Detection
Data augmentation plays an important role in boosting the performance of 3D models, while very few studies handle the 3D point cloud data with this technique. Global augmentation and cut-paste are commonly used augmentation techniques for point clouds, where global augmentation is applied to the entire point cloud of the scene, and cut-paste samples objects from other frames into the current frame. Both types of data augmentation can improve performance, but the cut-paste technique cannot effectively deal with the occlusion relationship between the foreground object and the background scene and the rationality of object sampling, which may be counterproductive and may hurt the overall performance. In addition, LiDAR is susceptible to signal loss, external occlusion, extreme weather and other factors, which can easily cause object shape changes, while global augmentation and cut-paste cannot effectively enhance the robustness of the model. To this end, we propose Syn-Aug, a synchronous data augmentation framework for LiDAR-based 3D object detection. Specifically, we first propose a novel rendering-based object augmentation technique (Ren-Aug) to enrich training data while enhancing scene realism. Second, we propose a local augmentation technique (Local-Aug) to generate local noise by rotating and scaling objects in the scene while avoiding collisions, which can improve generalisation performance. Finally, we make full use of the structural information of 3D labels to make the model more robust by randomly changing the geometry of objects in the training frames. We verify the proposed framework with four different types of 3D object detectors. Experimental results show that our proposed Syn-Aug significantly improves the performance of various 3D object detectors in the KITTI and nuScenes datasets, proving the effectiveness and generality of Syn-Aug. On KITTI, four different types of baseline models using Syn-Aug improved mAP by 0.89%, 1.35%, 1.61% and 1.14% respectively. On nuScenes, four different types of baseline models using Syn-Aug improved mAP by 14.93%, 10.42%, 8.47% and 6.81% respectively. The code is available at https://github.com/liuhuaijjin/Syn-Aug.
期刊介绍:
CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.