A Data-Centric Approach for Improving Adversarial Training Through the Lens of Out-of-Distribution Detection

2023 28th International Computer Conference, Computer Society of Iran (CSICC) Pub Date : 2023-01-25 DOI:10.1109/CSICC58665.2023.10105351

Mohammad Azizmalayeri, Arman Zarei, Alireza Isavand, M. T. Manzuri, M. Rohban

{"title":"A Data-Centric Approach for Improving Adversarial Training Through the Lens of Out-of-Distribution Detection","authors":"Mohammad Azizmalayeri, Arman Zarei, Alireza Isavand, M. T. Manzuri, M. Rohban","doi":"10.1109/CSICC58665.2023.10105351","DOIUrl":null,"url":null,"abstract":"Current machine learning models achieve super-human performance in many real-world applications. Still, they are susceptible against imperceptible adversarial perturbations. The most effective solution for this problem is adversarial training that trains the model with adversarially perturbed samples instead of original ones. Various methods have been developed over recent years to improve adversarial training such as data augmentation or modifying training attacks. In this work, we examine the same problem from a new data-centric perspective. For this purpose, we first demonstrate that the existing model-based methods can be equivalent to applying smaller perturbation or optimization weights to the hard training examples. By using this finding, we propose detecting and removing these hard samples directly from the training procedure rather than applying complicated algorithms to mitigate their effects. For detection, we use maximum softmax probability as an effective method in out-of-distribution detection since we can consider the hard samples as the out-of-distribution samples for the whole data distribution. Our results on SVHN and CIFAR-10 datasets show the effectiveness of this method in improving the adversarial training without adding too much computational cost.","PeriodicalId":127277,"journal":{"name":"2023 28th International Computer Conference, Computer Society of Iran (CSICC)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 28th International Computer Conference, Computer Society of Iran (CSICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSICC58665.2023.10105351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Current machine learning models achieve super-human performance in many real-world applications. Still, they are susceptible against imperceptible adversarial perturbations. The most effective solution for this problem is adversarial training that trains the model with adversarially perturbed samples instead of original ones. Various methods have been developed over recent years to improve adversarial training such as data augmentation or modifying training attacks. In this work, we examine the same problem from a new data-centric perspective. For this purpose, we first demonstrate that the existing model-based methods can be equivalent to applying smaller perturbation or optimization weights to the hard training examples. By using this finding, we propose detecting and removing these hard samples directly from the training procedure rather than applying complicated algorithms to mitigate their effects. For detection, we use maximum softmax probability as an effective method in out-of-distribution detection since we can consider the hard samples as the out-of-distribution samples for the whole data distribution. Our results on SVHN and CIFAR-10 datasets show the effectiveness of this method in improving the adversarial training without adding too much computational cost.

查看原文本刊更多论文

一种以数据为中心的方法，通过分布外检测来改进对抗训练

当前的机器学习模型在许多现实世界的应用中实现了超人的性能。尽管如此，它们还是容易受到难以察觉的对抗性扰动的影响。解决这一问题最有效的方法是对抗性训练，即用对抗性扰动样本代替原始样本来训练模型。近年来已经开发了各种方法来改进对抗性训练，例如数据增强或修改训练攻击。在这项工作中，我们从一个新的以数据为中心的角度来研究同样的问题。为此，我们首先证明了现有的基于模型的方法可以等效于对硬训练示例应用较小的扰动或优化权重。利用这一发现，我们建议直接从训练过程中检测和删除这些硬样本，而不是应用复杂的算法来减轻它们的影响。对于检测，我们使用最大softmax概率作为一种有效的分布外检测方法，因为我们可以将硬样本视为整个数据分布的分布外样本。我们在SVHN和CIFAR-10数据集上的结果表明，该方法在不增加太多计算成本的情况下有效地改进了对抗性训练。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 28th International Computer Conference, Computer Society of Iran (CSICC)

自引率

0.00%

发文量