Generating Textual Adversaries with Minimal Perturbation

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing Pub Date : 2022-11-12 DOI:10.48550/arXiv.2211.06571

Xingyi Zhao, Lu Zhang, Depeng Xu, Shuhan Yuan

引用次数: 0

Abstract

Many word-level adversarial attack approaches for textual data have been proposed in recent studies. However, due to the massive search space consisting of combinations of candidate words, the existing approaches face the problem of preserving the semantics of texts when crafting adversarial counterparts. In this paper, we develop a novel attack strategy to find adversarial texts with high similarity to the original texts while introducing minimal perturbation. The rationale is that we expect the adversarial texts with small perturbation can better preserve the semantic meaning of original texts. Experiments show that, compared with state-of-the-art attack approaches, our approach achieves higher success rates and lower perturbation rates in four benchmark datasets.

查看原文本刊更多论文

用最小扰动生成文本对手

近年来，研究人员提出了许多针对文本数据的词级对抗性攻击方法。然而，由于候选词组合的巨大搜索空间，现有的方法面临着在制作对抗性对应时保留文本语义的问题。在本文中，我们开发了一种新的攻击策略，在引入最小扰动的情况下找到与原始文本高度相似的对抗文本。其基本原理是我们期望扰动较小的对抗性文本能够更好地保留原文本的语义。实验表明，与最先进的攻击方法相比，我们的方法在四个基准数据集上获得了更高的成功率和更低的扰动率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing

自引率

0.00%

发文量