Understanding DeepFool Adversarial Attack and Defense with Skater Interpretations

2023 International Conference on Wireless Communications Signal Processing and Networking (WiSPNET) Pub Date : 2023-03-29 DOI:10.1109/WiSPNET57748.2023.10134485

Dhivyashri Ramesh, Ishwarya Sriram, Kavya Sridhar, Snofy D. Dunston, M. V

{"title":"Understanding DeepFool Adversarial Attack and Defense with Skater Interpretations","authors":"Dhivyashri Ramesh, Ishwarya Sriram, Kavya Sridhar, Snofy D. Dunston, M. V","doi":"10.1109/WiSPNET57748.2023.10134485","DOIUrl":null,"url":null,"abstract":"With the incorporation of artificial intelligence in businesses, particularly features like computer vision, it has become increasingly important to ensure the robustness of the models being used. A popular technique used to exploit machine learning models is an adversarial attack. Adversarial attacks mis-lead a predictive model by providing it with perturbed input. In the context of computer vision, it involves creating perturbations in an image to deceive a model. One such adversarial attack is the DeepFool attack, which aims to create the most minimal perturbations to an image to deceive the model. These attacks can also affect the way in which interpretations are made. In this paper, we analyze the DeepFool attack and its countermeasures on the ResNet-50 model running on the NIH malarial dataset. To assess the efficiency of the attack and subsequent adversarial training, we have used accuracy and loss. The nature and impact of the attack and adversarial training are analysed using skater, a model interpretation framework. The variations in the interpretations when adversarial attacks are in place are also analysed.","PeriodicalId":150576,"journal":{"name":"2023 International Conference on Wireless Communications Signal Processing and Networking (WiSPNET)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Wireless Communications Signal Processing and Networking (WiSPNET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WiSPNET57748.2023.10134485","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the incorporation of artificial intelligence in businesses, particularly features like computer vision, it has become increasingly important to ensure the robustness of the models being used. A popular technique used to exploit machine learning models is an adversarial attack. Adversarial attacks mis-lead a predictive model by providing it with perturbed input. In the context of computer vision, it involves creating perturbations in an image to deceive a model. One such adversarial attack is the DeepFool attack, which aims to create the most minimal perturbations to an image to deceive the model. These attacks can also affect the way in which interpretations are made. In this paper, we analyze the DeepFool attack and its countermeasures on the ResNet-50 model running on the NIH malarial dataset. To assess the efficiency of the attack and subsequent adversarial training, we have used accuracy and loss. The nature and impact of the attack and adversarial training are analysed using skater, a model interpretation framework. The variations in the interpretations when adversarial attacks are in place are also analysed.

查看原文本刊更多论文

理解DeepFool对抗性攻击和防御与溜冰者的解释

随着人工智能在商业中的应用，尤其是计算机视觉等功能的应用，确保所使用模型的稳健性变得越来越重要。利用机器学习模型的一种流行技术是对抗性攻击。对抗性攻击通过提供扰动输入来误导预测模型。在计算机视觉的背景下，它涉及到在图像中制造扰动来欺骗模型。其中一种对抗性攻击是DeepFool攻击，其目的是对图像产生最小的扰动，以欺骗模型。这些攻击也会影响解读的方式。在本文中，我们分析了在NIH疟疾数据集上运行的ResNet-50模型上的DeepFool攻击及其对策。为了评估攻击的效率和随后的对抗性训练，我们使用了准确性和损失。攻击和对抗训练的性质和影响分析使用滑冰者，一个模型解释框架。还分析了对抗性攻击发生时解释的变化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Wireless Communications Signal Processing and Networking (WiSPNET)

自引率

0.00%

发文量