Understanding and Quantifying Adversarial Examples Existence in Linear Classification

2022 International Conference on Machine Learning and Cybernetics (ICMLC) Pub Date : 2019-10-27 DOI:10.1109/ICMLC56445.2022.9941315

Xupeng Shi, A. Ding

引用次数: 3

Abstract

State-of-art deep neural networks (DNN) are vulnerable to attacks by adversarial examples: a carefully designed small perturbation to the input, that is imperceptible to human, can mislead DNN. To understand the root cause of adversarial examples, we quantify the probability of adversarial example existence for linear classifiers. Previous mathematical definition of adversarial examples only involves the overall perturbation amount, and we propose a more practical relevant definition of strong adversarial examples that separately limits the perturbation along the signal direction also. We show that linear classifiers can be made robust to strong adversarial examples attack in cases where no adversarial robust linear classifiers exist under the previous definition. The results suggest that designing general strong-adversarial-robust learning systems is feasible but only through incorporating human knowledge of the underlying classification problem.

查看原文本刊更多论文

理解和量化线性分类中对抗性例子的存在

最先进的深度神经网络(DNN)很容易受到对抗性示例的攻击:对输入进行精心设计的小扰动，这是人类无法察觉的，可以误导DNN。为了理解对抗性示例的根本原因，我们量化了线性分类器对抗性示例存在的概率。以前对抗性示例的数学定义只涉及总体摄动量，我们提出了一个更实用的强对抗性示例的相关定义，该定义也单独限制了沿信号方向的摄动。我们证明了在之前的定义下不存在对抗性鲁棒线性分类器的情况下，线性分类器可以对强对抗性示例攻击具有鲁棒性。结果表明，设计一般的强对抗鲁棒学习系统是可行的，但只有通过结合人类对潜在分类问题的知识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Machine Learning and Cybernetics (ICMLC)

自引率

0.00%

发文量