基于输入变形和增强的深度神经网络对抗性攻击

2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC) Pub Date : 2020-01-01 DOI:10.1109/ASP-DAC47756.2020.9045107

Pengfei Qiu, Qian Wang, Dongsheng Wang, Yongqiang Lyu, Zhaojun Lu, G. Qu

{"title":"基于输入变形和增强的深度神经网络对抗性攻击","authors":"Pengfei Qiu, Qian Wang, Dongsheng Wang, Yongqiang Lyu, Zhaojun Lu, G. Qu","doi":"10.1109/ASP-DAC47756.2020.9045107","DOIUrl":null,"url":null,"abstract":"Typical Deep Neural Networks (DNN) are susceptible to adversarial attacks that add malicious perturbations to input to mislead the DNN model. Most of the state-of-theart countermeasures concentrate on the defensive distillation or parameter re-training, which require prior knowledge of the target DNN and/or the attacking methods and hence greatly limit their generality and usability. In this paper, we propose to defend against adversarial attacks by utilizing the input deformation and augmentation techniques that are currently widely utilized to enlarge the dataset during DNN’s training phase. This is based on the observation that certain input deformation and augmentation methods will have little or no impact on DNN model’s accuracy, but the adversarial attacks will fail when the maliciously induced perturbations are randomly deformed. We also use the ensemble of decisions to further improve DNN model’s accuracy and the effectiveness of defending various attacks. Our proposed mitigation method is model independent (i.e. it does not require additional training, parameter finetuning, or any structure modifications of the target DNN model) and attack independent (i.e., it does not require any knowledge of the adversarial attacks). So it has excellent generality and usability. We conduct experiments on standard CIFAR-10 dataset and three representative adversarial attacks: Fast Gradient Sign Method, Carlini and Wagner, and Jacobian-based Saliency Map Attack. Results show that the average success rate of the attacks can be reduced from 96.5% to 28.7% while the DNN model accuracy is improved by about 2%.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"672 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Mitigating Adversarial Attacks for Deep Neural Networks by Input Deformation and Augmentation\",\"authors\":\"Pengfei Qiu, Qian Wang, Dongsheng Wang, Yongqiang Lyu, Zhaojun Lu, G. Qu\",\"doi\":\"10.1109/ASP-DAC47756.2020.9045107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Typical Deep Neural Networks (DNN) are susceptible to adversarial attacks that add malicious perturbations to input to mislead the DNN model. Most of the state-of-theart countermeasures concentrate on the defensive distillation or parameter re-training, which require prior knowledge of the target DNN and/or the attacking methods and hence greatly limit their generality and usability. In this paper, we propose to defend against adversarial attacks by utilizing the input deformation and augmentation techniques that are currently widely utilized to enlarge the dataset during DNN’s training phase. This is based on the observation that certain input deformation and augmentation methods will have little or no impact on DNN model’s accuracy, but the adversarial attacks will fail when the maliciously induced perturbations are randomly deformed. We also use the ensemble of decisions to further improve DNN model’s accuracy and the effectiveness of defending various attacks. Our proposed mitigation method is model independent (i.e. it does not require additional training, parameter finetuning, or any structure modifications of the target DNN model) and attack independent (i.e., it does not require any knowledge of the adversarial attacks). So it has excellent generality and usability. We conduct experiments on standard CIFAR-10 dataset and three representative adversarial attacks: Fast Gradient Sign Method, Carlini and Wagner, and Jacobian-based Saliency Map Attack. Results show that the average success rate of the attacks can be reduced from 96.5% to 28.7% while the DNN model accuracy is improved by about 2%.\",\"PeriodicalId\":125112,\"journal\":{\"name\":\"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)\",\"volume\":\"672 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASP-DAC47756.2020.9045107\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASP-DAC47756.2020.9045107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

典型的深度神经网络(DNN)容易受到对抗性攻击，这种攻击会在输入中添加恶意扰动，从而误导DNN模型。大多数最先进的对策集中在防御蒸馏或参数再训练上，这需要事先了解目标深度神经网络和/或攻击方法，因此极大地限制了它们的通用性和可用性。在本文中，我们建议利用目前广泛用于在DNN训练阶段扩大数据集的输入变形和增强技术来防御对抗性攻击。这是基于观察到某些输入变形和增强方法对DNN模型的准确性影响很小或没有影响，但当恶意诱导的扰动随机变形时，对抗性攻击将失败。我们还使用决策集合来进一步提高DNN模型的准确性和防御各种攻击的有效性。我们提出的缓解方法是模型独立的(即它不需要额外的训练，参数微调或目标DNN模型的任何结构修改)和攻击独立的(即，它不需要任何对抗性攻击的知识)。具有良好的通用性和可用性。我们在标准CIFAR-10数据集和三种具有代表性的对抗性攻击进行了实验:快速梯度符号法、Carlini和Wagner攻击和基于jacobian的显著性图攻击。结果表明，攻击的平均成功率从96.5%降低到28.7%，DNN模型的准确率提高了约2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Mitigating Adversarial Attacks for Deep Neural Networks by Input Deformation and Augmentation

Typical Deep Neural Networks (DNN) are susceptible to adversarial attacks that add malicious perturbations to input to mislead the DNN model. Most of the state-of-theart countermeasures concentrate on the defensive distillation or parameter re-training, which require prior knowledge of the target DNN and/or the attacking methods and hence greatly limit their generality and usability. In this paper, we propose to defend against adversarial attacks by utilizing the input deformation and augmentation techniques that are currently widely utilized to enlarge the dataset during DNN’s training phase. This is based on the observation that certain input deformation and augmentation methods will have little or no impact on DNN model’s accuracy, but the adversarial attacks will fail when the maliciously induced perturbations are randomly deformed. We also use the ensemble of decisions to further improve DNN model’s accuracy and the effectiveness of defending various attacks. Our proposed mitigation method is model independent (i.e. it does not require additional training, parameter finetuning, or any structure modifications of the target DNN model) and attack independent (i.e., it does not require any knowledge of the adversarial attacks). So it has excellent generality and usability. We conduct experiments on standard CIFAR-10 dataset and three representative adversarial attacks: Fast Gradient Sign Method, Carlini and Wagner, and Jacobian-based Saliency Map Attack. Results show that the average success rate of the attacks can be reduced from 96.5% to 28.7% while the DNN model accuracy is improved by about 2%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)

自引率

0.00%

发文量