lexguard:通过毫不费力的对抗强化来提升NLP的稳健性

Advances in Engineering Innovation Pub Date : 2023-10-07 DOI:10.54254/2977-3903/2/2023012

Marwan Omar

{"title":"lexguard:通过毫不费力的对抗强化来提升NLP的稳健性","authors":"Marwan Omar","doi":"10.54254/2977-3903/2/2023012","DOIUrl":null,"url":null,"abstract":"NLP models have demonstrated susceptibility to adversarial attacks, thereby compromising their robustness. Even slight modifications to input text possess the capacity to deceive NLP models, leading to inaccurate text classifications. In the present investigation, we introduce Lexi-Guard: an innovative method for Adversarial Text Generation. This approach facilitates the rapid and efficient generation of adversarial texts when supplied with initial input text. To illustrate, when targeting a sentiment classification model, the utilization of product categories as attributes is employed, ensuring that the sentiment of reviews remains unaltered. Empirical assessments were conducted on real-world NLP datasets to showcase the efficacy of our technique in producing adversarial texts that are both more semantically meaningful and exhibit greater diversity, surpassing the capabilities of numerous existing adversarial text generation methodologies. Furthermore, we leverage the generated adversarial instances to enhance models through adversarial training, demonstrating the heightened resilience of our generated attacks against model retraining endeavors and diverse model architectures.","PeriodicalId":476183,"journal":{"name":"Advances in Engineering Innovation","volume":"1 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LexiGuard: Elevating NLP robustness through effortless adversarial fortification\",\"authors\":\"Marwan Omar\",\"doi\":\"10.54254/2977-3903/2/2023012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"NLP models have demonstrated susceptibility to adversarial attacks, thereby compromising their robustness. Even slight modifications to input text possess the capacity to deceive NLP models, leading to inaccurate text classifications. In the present investigation, we introduce Lexi-Guard: an innovative method for Adversarial Text Generation. This approach facilitates the rapid and efficient generation of adversarial texts when supplied with initial input text. To illustrate, when targeting a sentiment classification model, the utilization of product categories as attributes is employed, ensuring that the sentiment of reviews remains unaltered. Empirical assessments were conducted on real-world NLP datasets to showcase the efficacy of our technique in producing adversarial texts that are both more semantically meaningful and exhibit greater diversity, surpassing the capabilities of numerous existing adversarial text generation methodologies. Furthermore, we leverage the generated adversarial instances to enhance models through adversarial training, demonstrating the heightened resilience of our generated attacks against model retraining endeavors and diverse model architectures.\",\"PeriodicalId\":476183,\"journal\":{\"name\":\"Advances in Engineering Innovation\",\"volume\":\"1 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Engineering Innovation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54254/2977-3903/2/2023012\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Engineering Innovation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54254/2977-3903/2/2023012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

NLP模型已经证明了对抗性攻击的易感性，从而损害了它们的鲁棒性。即使是对输入文本的轻微修改也有可能欺骗NLP模型，导致不准确的文本分类。在本研究中，我们介绍了Lexi-Guard:一种对抗文本生成的创新方法。当提供初始输入文本时，这种方法有助于快速有效地生成对抗性文本。为了说明，当针对情感分类模型时，使用产品类别作为属性，确保评论的情感保持不变。在真实世界的NLP数据集上进行了实证评估，以展示我们的技术在生成对抗性文本方面的有效性，这些文本在语义上更有意义，表现出更大的多样性，超越了许多现有对抗性文本生成方法的能力。此外，我们利用生成的对抗性实例通过对抗性训练来增强模型，展示了我们生成的攻击对模型再训练努力和不同模型架构的高弹性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

LexiGuard: Elevating NLP robustness through effortless adversarial fortification

NLP models have demonstrated susceptibility to adversarial attacks, thereby compromising their robustness. Even slight modifications to input text possess the capacity to deceive NLP models, leading to inaccurate text classifications. In the present investigation, we introduce Lexi-Guard: an innovative method for Adversarial Text Generation. This approach facilitates the rapid and efficient generation of adversarial texts when supplied with initial input text. To illustrate, when targeting a sentiment classification model, the utilization of product categories as attributes is employed, ensuring that the sentiment of reviews remains unaltered. Empirical assessments were conducted on real-world NLP datasets to showcase the efficacy of our technique in producing adversarial texts that are both more semantically meaningful and exhibit greater diversity, surpassing the capabilities of numerous existing adversarial text generation methodologies. Furthermore, we leverage the generated adversarial instances to enhance models through adversarial training, demonstrating the heightened resilience of our generated attacks against model retraining endeavors and diverse model architectures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advances in Engineering Innovation

自引率

0.00%

发文量