Syn-Aug：一种用于三维目标检测的有效和通用同步数据增强框架

IF 8.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology Pub Date : 2025-04-21 DOI:10.1049/cit2.70001

Huaijin Liu, Jixiang Du, Yong Zhang, Hongbo Zhang, Jiandian Zeng

{"title":"Syn-Aug：一种用于三维目标检测的有效和通用同步数据增强框架","authors":"Huaijin Liu, Jixiang Du, Yong Zhang, Hongbo Zhang, Jiandian Zeng","doi":"10.1049/cit2.70001","DOIUrl":null,"url":null,"abstract":"<p>Data augmentation plays an important role in boosting the performance of 3D models, while very few studies handle the 3D point cloud data with this technique. Global augmentation and cut-paste are commonly used augmentation techniques for point clouds, where global augmentation is applied to the entire point cloud of the scene, and cut-paste samples objects from other frames into the current frame. Both types of data augmentation can improve performance, but the cut-paste technique cannot effectively deal with the occlusion relationship between the foreground object and the background scene and the rationality of object sampling, which may be counterproductive and may hurt the overall performance. In addition, LiDAR is susceptible to signal loss, external occlusion, extreme weather and other factors, which can easily cause object shape changes, while global augmentation and cut-paste cannot effectively enhance the robustness of the model. To this end, we propose Syn-Aug, a synchronous data augmentation framework for LiDAR-based 3D object detection. Specifically, we first propose a novel rendering-based object augmentation technique (Ren-Aug) to enrich training data while enhancing scene realism. Second, we propose a local augmentation technique (Local-Aug) to generate local noise by rotating and scaling objects in the scene while avoiding collisions, which can improve generalisation performance. Finally, we make full use of the structural information of 3D labels to make the model more robust by randomly changing the geometry of objects in the training frames. We verify the proposed framework with four different types of 3D object detectors. Experimental results show that our proposed Syn-Aug significantly improves the performance of various 3D object detectors in the KITTI and nuScenes datasets, proving the effectiveness and generality of Syn-Aug. On KITTI, four different types of baseline models using Syn-Aug improved mAP by 0.89%, 1.35%, 1.61% and 1.14% respectively. On nuScenes, four different types of baseline models using Syn-Aug improved mAP by 14.93%, 10.42%, 8.47% and 6.81% respectively. The code is available at https://github.com/liuhuaijjin/Syn-Aug.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 3","pages":"912-928"},"PeriodicalIF":8.4000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.70001","citationCount":"0","resultStr":"{\"title\":\"Syn-Aug: An Effective and General Synchronous Data Augmentation Framework for 3D Object Detection\",\"authors\":\"Huaijin Liu, Jixiang Du, Yong Zhang, Hongbo Zhang, Jiandian Zeng\",\"doi\":\"10.1049/cit2.70001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Data augmentation plays an important role in boosting the performance of 3D models, while very few studies handle the 3D point cloud data with this technique. Global augmentation and cut-paste are commonly used augmentation techniques for point clouds, where global augmentation is applied to the entire point cloud of the scene, and cut-paste samples objects from other frames into the current frame. Both types of data augmentation can improve performance, but the cut-paste technique cannot effectively deal with the occlusion relationship between the foreground object and the background scene and the rationality of object sampling, which may be counterproductive and may hurt the overall performance. In addition, LiDAR is susceptible to signal loss, external occlusion, extreme weather and other factors, which can easily cause object shape changes, while global augmentation and cut-paste cannot effectively enhance the robustness of the model. To this end, we propose Syn-Aug, a synchronous data augmentation framework for LiDAR-based 3D object detection. Specifically, we first propose a novel rendering-based object augmentation technique (Ren-Aug) to enrich training data while enhancing scene realism. Second, we propose a local augmentation technique (Local-Aug) to generate local noise by rotating and scaling objects in the scene while avoiding collisions, which can improve generalisation performance. Finally, we make full use of the structural information of 3D labels to make the model more robust by randomly changing the geometry of objects in the training frames. We verify the proposed framework with four different types of 3D object detectors. Experimental results show that our proposed Syn-Aug significantly improves the performance of various 3D object detectors in the KITTI and nuScenes datasets, proving the effectiveness and generality of Syn-Aug. On KITTI, four different types of baseline models using Syn-Aug improved mAP by 0.89%, 1.35%, 1.61% and 1.14% respectively. On nuScenes, four different types of baseline models using Syn-Aug improved mAP by 14.93%, 10.42%, 8.47% and 6.81% respectively. The code is available at https://github.com/liuhuaijjin/Syn-Aug.</p>\",\"PeriodicalId\":46211,\"journal\":{\"name\":\"CAAI Transactions on Intelligence Technology\",\"volume\":\"10 3\",\"pages\":\"912-928\"},\"PeriodicalIF\":8.4000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.70001\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CAAI Transactions on Intelligence Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/cit2.70001\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cit2.70001","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

数据增强在提高三维模型的性能方面起着重要的作用，但很少有研究使用该技术来处理三维点云数据。全局增强和剪切粘贴是点云常用的增强技术，对场景的整个点云进行全局增强，并将其他帧的样本对象剪切粘贴到当前帧中。两种类型的数据增强都可以提高性能，但剪切粘贴技术不能有效处理前景对象与背景场景之间的遮挡关系以及对象采样的合理性，可能适得其反，可能会损害整体性能。此外，LiDAR容易受到信号丢失、外部遮挡、极端天气等因素的影响，容易造成物体形状的变化，而全局增强和剪切粘贴不能有效增强模型的鲁棒性。为此，我们提出了基于激光雷达的三维目标检测同步数据增强框架Syn-Aug。具体来说，我们首先提出了一种新的基于渲染的物体增强技术（Ren-Aug）来丰富训练数据，同时增强场景真实感。其次，我们提出了一种局部增强技术（local - aug），通过旋转和缩放场景中的物体来产生局部噪声，同时避免碰撞，从而提高泛化性能。最后，我们充分利用三维标签的结构信息，通过随机改变训练帧中物体的几何形状来增强模型的鲁棒性。我们用四种不同类型的3D目标检测器验证了所提出的框架。实验结果表明，我们提出的Syn-Aug在KITTI和nuScenes数据集上显著提高了各种3D目标检测器的性能，证明了Syn-Aug的有效性和通用性。在KITTI上，使用Syn-Aug的4种不同类型基线模型分别提高了0.89%、1.35%、1.61%和1.14%的mAP。在nuScenes上，使用Syn-Aug的4种不同类型基线模型分别提高了14.93%、10.42%、8.47%和6.81%的mAP。代码可在https://github.com/liuhuaijjin/Syn-Aug上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Syn-Aug: An Effective and General Synchronous Data Augmentation Framework for 3D Object Detection

查看原文本刊更多论文

Syn-Aug: An Effective and General Synchronous Data Augmentation Framework for 3D Object Detection

Data augmentation plays an important role in boosting the performance of 3D models, while very few studies handle the 3D point cloud data with this technique. Global augmentation and cut-paste are commonly used augmentation techniques for point clouds, where global augmentation is applied to the entire point cloud of the scene, and cut-paste samples objects from other frames into the current frame. Both types of data augmentation can improve performance, but the cut-paste technique cannot effectively deal with the occlusion relationship between the foreground object and the background scene and the rationality of object sampling, which may be counterproductive and may hurt the overall performance. In addition, LiDAR is susceptible to signal loss, external occlusion, extreme weather and other factors, which can easily cause object shape changes, while global augmentation and cut-paste cannot effectively enhance the robustness of the model. To this end, we propose Syn-Aug, a synchronous data augmentation framework for LiDAR-based 3D object detection. Specifically, we first propose a novel rendering-based object augmentation technique (Ren-Aug) to enrich training data while enhancing scene realism. Second, we propose a local augmentation technique (Local-Aug) to generate local noise by rotating and scaling objects in the scene while avoiding collisions, which can improve generalisation performance. Finally, we make full use of the structural information of 3D labels to make the model more robust by randomly changing the geometry of objects in the training frames. We verify the proposed framework with four different types of 3D object detectors. Experimental results show that our proposed Syn-Aug significantly improves the performance of various 3D object detectors in the KITTI and nuScenes datasets, proving the effectiveness and generality of Syn-Aug. On KITTI, four different types of baseline models using Syn-Aug improved mAP by 0.89%, 1.35%, 1.61% and 1.14% respectively. On nuScenes, four different types of baseline models using Syn-Aug improved mAP by 14.93%, 10.42%, 8.47% and 6.81% respectively. The code is available at https://github.com/liuhuaijjin/Syn-Aug.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

CAAI Transactions on Intelligence Technology COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

11.00

自引率

3.90%

发文量

134

审稿时长

35 weeks

期刊介绍： CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.