Multi-target Backdoor Attacks for Code Pre-trained Models

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-06-14 DOI:10.48550/arXiv.2306.08350

Yanzhou Li, Shangqing Liu, Kangjie Chen, Xiaofei Xie, Tianwei Zhang, Yang Liu

引用次数: 3

Abstract

Backdoor attacks for neural code models have gained considerable attention due to the advancement of code intelligence. However, most existing works insert triggers into task-specific data for code-related downstream tasks, thereby limiting the scope of attacks. Moreover, the majority of attacks for pre-trained models are designed for understanding tasks. In this paper, we propose task-agnostic backdoor attacks for code pre-trained models. Our backdoored model is pre-trained with two learning strategies (i.e., Poisoned Seq2Seq learning and token representation learning) to support the multi-target attack of downstream code understanding and generation tasks. During the deployment phase, the implanted backdoors in the victim models can be activated by the designed triggers to achieve the targeted attack. We evaluate our approach on two code understanding tasks and three code generation tasks over seven datasets. Extensive experimental results demonstrate that our approach effectively and stealthily attacks code-related downstream tasks.

查看原文本刊更多论文

代码预训练模型的多目标后门攻击

随着代码智能的发展，神经代码模型的后门攻击受到了越来越多的关注。然而，大多数现有的工作将触发器插入到与代码相关的下游任务的特定于任务的数据中，从而限制了攻击的范围。此外，大多数针对预训练模型的攻击都是为了理解任务而设计的。在本文中，我们提出了针对代码预训练模型的任务不可知后门攻击。我们的后门模型使用两种学习策略(即中毒Seq2Seq学习和令牌表示学习)进行预训练，以支持下游代码理解和生成任务的多目标攻击。在部署阶段，受害者模型中植入的后门可以被设计的触发器激活，以实现目标攻击。我们在七个数据集上对两个代码理解任务和三个代码生成任务进行了评估。大量的实验结果表明，我们的方法有效且隐蔽地攻击与代码相关的下游任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annual Meeting of the Association for Computational Linguistics

自引率

0.00%

发文量