A Semantic, Syntactic, And Context-Aware Natural Language Adversarial Example Generator

IEEE Transactions on Dependable and Secure Computing Pub Date : 2024-03-18 DOI:10.1109/TDSC.2024.3359817

Javad Asl, Mohammad H. Rafiei, Manar Alohaly, Daniel Takabi

{"title":"A Semantic, Syntactic, And Context-Aware Natural Language Adversarial Example Generator","authors":"Javad Asl, Mohammad H. Rafiei, Manar Alohaly, Daniel Takabi","doi":"10.1109/TDSC.2024.3359817","DOIUrl":null,"url":null,"abstract":"Machine learning models are vulnerable to maliciously crafted Adversarial Examples (AEs). Training a machine learning model with AEs improves its robustness and stability against adversarial attacks. It is essential to develop models that produce high-quality AEs. Developing such models has been much slower in natural language processing (NLP) than in areas such as computer vision. This paper introduces a practical and efficient adversarial attack model called SSCAE for \\textbf{S}emantic, \\textbf{S}yntactic, and \\textbf{C}ontext-aware natural language \\textbf{AE}s generator. SSCAE identifies important words and uses a masked language model to generate an early set of substitutions. Next, two well-known language models are employed to evaluate the initial set in terms of semantic and syntactic characteristics. We introduce (1) a dynamic threshold to capture more efficient perturbations and (2) a local greedy search to generate high-quality AEs. As a black-box method, SSCAE generates humanly imperceptible and context-aware AEs that preserve semantic consistency and the source language's syntactical and grammatical requirements. The effectiveness and superiority of the proposed SSCAE model are illustrated with fifteen comparative experiments and extensive sensitivity analysis for parameter optimization. SSCAE outperforms the existing models in all experiments while maintaining a higher semantic consistency with a lower query number and a comparable perturbation rate.","PeriodicalId":508198,"journal":{"name":"IEEE Transactions on Dependable and Secure Computing","volume":"352 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Dependable and Secure Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TDSC.2024.3359817","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Machine learning models are vulnerable to maliciously crafted Adversarial Examples (AEs). Training a machine learning model with AEs improves its robustness and stability against adversarial attacks. It is essential to develop models that produce high-quality AEs. Developing such models has been much slower in natural language processing (NLP) than in areas such as computer vision. This paper introduces a practical and efficient adversarial attack model called SSCAE for \textbf{S}emantic, \textbf{S}yntactic, and \textbf{C}ontext-aware natural language \textbf{AE}s generator. SSCAE identifies important words and uses a masked language model to generate an early set of substitutions. Next, two well-known language models are employed to evaluate the initial set in terms of semantic and syntactic characteristics. We introduce (1) a dynamic threshold to capture more efficient perturbations and (2) a local greedy search to generate high-quality AEs. As a black-box method, SSCAE generates humanly imperceptible and context-aware AEs that preserve semantic consistency and the source language's syntactical and grammatical requirements. The effectiveness and superiority of the proposed SSCAE model are illustrated with fifteen comparative experiments and extensive sensitivity analysis for parameter optimization. SSCAE outperforms the existing models in all experiments while maintaining a higher semantic consistency with a lower query number and a comparable perturbation rate.

查看原文本刊更多论文

语义、句法和上下文感知自然语言对抗示例生成器

机器学习模型很容易受到恶意制作的对抗性示例（AE）的攻击。使用 AE 训练机器学习模型可以提高其在对抗恶意攻击时的鲁棒性和稳定性。开发能生成高质量 AE 的模型至关重要。与计算机视觉等领域相比，自然语言处理（NLP）领域开发此类模型的速度要慢得多。本文介绍了一种名为 SSCAE 的实用而高效的对抗攻击模型，它适用于文本语义、文本句法和文本感知的自然语言文本生成器。SSCAE 可识别重要词语，并使用屏蔽语言模型生成一组早期替换词。接下来，我们使用两个著名的语言模型从语义和句法特征方面对初始集合进行评估。我们引入了（1）动态阈值来捕捉更有效的扰动，以及（2）局部贪婪搜索来生成高质量的 AE。作为一种黑盒方法，SSCAE 可生成人类无法感知且上下文感知的 AE，这些 AE 可保持语义一致性以及源语言的句法和语法要求。通过 15 个对比实验和广泛的参数优化敏感性分析，说明了所提出的 SSCAE 模型的有效性和优越性。在所有实验中，SSCAE 的表现都优于现有模型，同时以较低的查询次数和可比的扰动率保持了较高的语义一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Dependable and Secure Computing

自引率

0.00%

发文量