Gradient-based sparse voxel attacks on point cloud object detection

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Pattern Recognition Pub Date : 2024-11-12 DOI:10.1016/j.patcog.2024.111156

Junqi Wu , Wen Yao , Shuai Jia , Tingsong Jiang , Weien Zhou , Chao Ma , Xiaoqian Chen

{"title":"Gradient-based sparse voxel attacks on point cloud object detection","authors":"Junqi Wu , Wen Yao , Shuai Jia , Tingsong Jiang , Weien Zhou , Chao Ma , Xiaoqian Chen","doi":"10.1016/j.patcog.2024.111156","DOIUrl":null,"url":null,"abstract":"<div><div>Point cloud object detection is crucial for a variety of applications, including autonomous driving and robotics. Voxel-based representation for 3D point clouds has drawn significant attention due to their efficiency and effectiveness. Recent studies have revealed the vulnerability of deep learning models to adversarial attacks, while considerably less attention is paid to the robustness of voxel-based point cloud object detectors. Existing adversarial attacks on the point cloud data involve generating fake obstacles, removing objects or producing fake predictions. Despite the demonstrated success, these approaches have three limitations. First, manipulating point data, which was originally designed for point-based representation, is inapplicable to voxel-based representation. Second, existing works that modified points in the hold scene yield redundant perturbations. Third, the evaluation primarily performed on small-scale datasets, such as KITTI, does not scale well. To address these limitations, we propose a gradient-based sparse voxel attack (GSVA) algorithm for voxel-based 3D point cloud object detectors. Two novel frameworks, i.e., re-voxelization-based voxel attack framework and light voxel attack framework, successfully modify voxel-based representation instead of raw points. In addition to KITTI, extensive experiments on large-scale datasets including nuScenes and Waymo Open Dataset demonstrate the favorable attack performance (with mAP decrease by <span><math><mrow><mn>86</mn><mo>.</mo><mn>2</mn><mtext>%</mtext><mo>∼</mo><mn>99</mn><mo>.</mo><mn>5</mn><mtext>%</mtext></mrow></math></span>) and the slight perturbation costs (with lowest modification rate of 3.5%) of our voxel attack method over the state-of-the-art approaches.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"160 ","pages":"Article 111156"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320324009075","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Point cloud object detection is crucial for a variety of applications, including autonomous driving and robotics. Voxel-based representation for 3D point clouds has drawn significant attention due to their efficiency and effectiveness. Recent studies have revealed the vulnerability of deep learning models to adversarial attacks, while considerably less attention is paid to the robustness of voxel-based point cloud object detectors. Existing adversarial attacks on the point cloud data involve generating fake obstacles, removing objects or producing fake predictions. Despite the demonstrated success, these approaches have three limitations. First, manipulating point data, which was originally designed for point-based representation, is inapplicable to voxel-based representation. Second, existing works that modified points in the hold scene yield redundant perturbations. Third, the evaluation primarily performed on small-scale datasets, such as KITTI, does not scale well. To address these limitations, we propose a gradient-based sparse voxel attack (GSVA) algorithm for voxel-based 3D point cloud object detectors. Two novel frameworks, i.e., re-voxelization-based voxel attack framework and light voxel attack framework, successfully modify voxel-based representation instead of raw points. In addition to KITTI, extensive experiments on large-scale datasets including nuScenes and Waymo Open Dataset demonstrate the favorable attack performance (with mAP decrease by

86.2 % \sim 99.5 %

) and the slight perturbation costs (with lowest modification rate of 3.5%) of our voxel attack method over the state-of-the-art approaches.

查看原文本刊更多论文

基于梯度的点云物体检测稀疏体素攻击

点云对象检测对于自动驾驶和机器人等多种应用至关重要。基于体素的三维点云表示因其高效性和有效性而备受关注。最近的研究揭示了深度学习模型在对抗性攻击面前的脆弱性，而对基于体素的点云物体检测器的鲁棒性的关注则少得多。针对点云数据的现有对抗性攻击包括生成虚假障碍物、移除物体或生成虚假预测。尽管这些方法取得了成功，但也存在三个局限性。首先，操纵点数据原本是为基于点的表示而设计的，但不适用于基于体素的表示。其次，现有的工作修改了保持场景中的点，产生了多余的扰动。第三，主要在 KITTI 等小规模数据集上进行的评估不能很好地扩展。为了解决这些局限性，我们提出了一种基于梯度的稀疏体素攻击（GSVA）算法，用于基于体素的三维点云物体检测器。两个新颖的框架，即基于再象素化的象素攻击框架和光象素攻击框架，成功地修改了基于象素的表示而不是原始点。除了 KITTI 之外，包括 nuScenes 和 Waymo Open Dataset 在内的大规模数据集的广泛实验也证明了我们的体素攻击方法与最先进的方法相比，具有良好的攻击性能（mAP 降低了 86.2%∼99.5%）和轻微的扰动成本（最低修改率为 3.5%）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.