The Generation of Visually Credible Adversarial Examples with Genetic Algorithms

ACM Transactions on Evolutionary Learning Pub Date : 2023-01-30 DOI:10.1145/3582276

James R. Bradley, A. P. Blossom

{"title":"The Generation of Visually Credible Adversarial Examples with Genetic Algorithms","authors":"James R. Bradley, A. P. Blossom","doi":"10.1145/3582276","DOIUrl":null,"url":null,"abstract":"An adversarial example is an input that a neural network misclassifies although the input differs only slightly from an input that the network classifies correctly. Adversarial examples are used to augment neural network training data, measure the vulnerability of neural networks, and provide intuitive interpretations of neural network output that humans can understand. Although adversarial examples are defined in the literature as similar to authentic input from the perspective of humans, the literature measures similarity with mathematical norms that are not scientifically correlated with human perception. Our main contributions are to construct a genetic algorithm (GA) that generates adversarial examples more similar to authentic input than do existing methods and to demonstrate with a survey that humans perceive those adversarial examples to have greater visual similarity than existing methods. The GA incorporates a neural network, and we test many parameter sets to determine which fitness function, selection operator, mutation operator, and neural network generate adversarial examples most visually similar to authentic input. We establish which mathematical norms are most correlated with human perception, which permits future research to incorporate the human perspective without testing many norms or conducting intensive surveys with human subjects. We also document a tradeoff between speed and quality in adversarial examples generated by GAs and existing methods. Although existing adversarial methods are faster, a GA provides higher-quality adversarial examples in terms of visual similarity and feasibility of adversarial examples. We apply the GA to the Modified National Institute of Standards and Technology (MNIST) and Canadian Institute for Advanced Research (CIFAR-10) datasets.","PeriodicalId":220659,"journal":{"name":"ACM Transactions on Evolutionary Learning","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Evolutionary Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3582276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

An adversarial example is an input that a neural network misclassifies although the input differs only slightly from an input that the network classifies correctly. Adversarial examples are used to augment neural network training data, measure the vulnerability of neural networks, and provide intuitive interpretations of neural network output that humans can understand. Although adversarial examples are defined in the literature as similar to authentic input from the perspective of humans, the literature measures similarity with mathematical norms that are not scientifically correlated with human perception. Our main contributions are to construct a genetic algorithm (GA) that generates adversarial examples more similar to authentic input than do existing methods and to demonstrate with a survey that humans perceive those adversarial examples to have greater visual similarity than existing methods. The GA incorporates a neural network, and we test many parameter sets to determine which fitness function, selection operator, mutation operator, and neural network generate adversarial examples most visually similar to authentic input. We establish which mathematical norms are most correlated with human perception, which permits future research to incorporate the human perspective without testing many norms or conducting intensive surveys with human subjects. We also document a tradeoff between speed and quality in adversarial examples generated by GAs and existing methods. Although existing adversarial methods are faster, a GA provides higher-quality adversarial examples in terms of visual similarity and feasibility of adversarial examples. We apply the GA to the Modified National Institute of Standards and Technology (MNIST) and Canadian Institute for Advanced Research (CIFAR-10) datasets.

查看原文本刊更多论文

用遗传算法生成视觉可信的对抗实例

对抗性示例是神经网络错误分类的输入，尽管该输入与网络正确分类的输入仅略有不同。对抗性示例用于增强神经网络训练数据，测量神经网络的脆弱性，并提供人类可以理解的神经网络输出的直观解释。虽然从人类的角度来看，对抗性例子在文献中被定义为与真实输入相似，但文献测量的相似性与数学规范无关，与人类感知无关。我们的主要贡献是构建一种遗传算法(GA)，该算法生成的对抗性示例比现有方法更接近真实输入，并通过一项调查证明，人类认为这些对抗性示例比现有方法具有更大的视觉相似性。该遗传算法结合了一个神经网络，我们测试了许多参数集，以确定哪个适应度函数、选择算子、突变算子和神经网络生成的对抗性示例在视觉上与真实输入最相似。我们确定了哪些数学规范与人类感知最相关，这允许未来的研究纳入人类视角，而无需测试许多规范或对人类受试者进行密集调查。我们还记录了由GAs和现有方法生成的对抗性示例中速度和质量之间的权衡。尽管现有的对抗方法更快，但从视觉相似性和对抗示例的可行性来看，GA提供了更高质量的对抗示例。我们将遗传算法应用于修改后的美国国家标准与技术研究院(MNIST)和加拿大高等研究院(CIFAR-10)数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Evolutionary Learning

自引率

0.00%

发文量