BABE: Backdoor attack with bokeh effects via latent separation suppression

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2024-10-22 DOI:10.1016/j.engappai.2024.109462

Junjian Li , Honglong Chen , Yudong Gao , Shaozhong Guo , Kai Lin , Yuping Liu , Peng Sun

{"title":"BABE: Backdoor attack with bokeh effects via latent separation suppression","authors":"Junjian Li , Honglong Chen , Yudong Gao , Shaozhong Guo , Kai Lin , Yuping Liu , Peng Sun","doi":"10.1016/j.engappai.2024.109462","DOIUrl":null,"url":null,"abstract":"<div><div>The escalating menace of backdoor attacks constitutes a formidable obstacle to the ongoing advancement of deep neural networks (DNNs), particularly in the security-sensitive applications such as face recognition and self-driving. Backdoored models render deliberately incorrect predictions on the inputs with the crafted triggers while behaving normally with the benign ones. Despite demonstrating the varying degrees of threat, existing backdoor attack strategies often prioritize stealthiness and defense evasions but neglect the practical feasibility in the real-world deployment scenarios. In this paper, we develop a backdoor attack leveraging bokeh effects (<span><math><mrow><mi>B</mi><mi>A</mi><mi>B</mi><mi>E</mi></mrow></math></span>), which introduces the bokeh effects as the trigger. Once the backdoored model is deployed in the vision application, the model’s malicious behaviors can be activated only by utilizing the captured bokeh images without any other modifications. Specially, we employ the saliency and depth estimation maps to derive the bokeh images, thereby serving as the poisoned samples. Furthermore, to avoid the latent separation of the generated poisoned images, we propose distinct attack strategies on the basis of the adversary’s prior abilities. For the adversary only with the data manipulation, we retain the original semantic labels for a subset of poisoned data during the training process. For the adversary with the manipulation of both the data and models, we construct a reference model trained on the clean samples to impose constraints on the latent representations of the poisoned images. Extensive experiments demonstrate the attack effects of the proposed <span><math><mrow><mi>B</mi><mi>A</mi><mi>B</mi><mi>E</mi></mrow></math></span>, even on the bokeh photos captured from Digital Still Cameras (DSC) and smartphones.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"138 ","pages":"Article 109462"},"PeriodicalIF":7.5000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016208","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The escalating menace of backdoor attacks constitutes a formidable obstacle to the ongoing advancement of deep neural networks (DNNs), particularly in the security-sensitive applications such as face recognition and self-driving. Backdoored models render deliberately incorrect predictions on the inputs with the crafted triggers while behaving normally with the benign ones. Despite demonstrating the varying degrees of threat, existing backdoor attack strategies often prioritize stealthiness and defense evasions but neglect the practical feasibility in the real-world deployment scenarios. In this paper, we develop a backdoor attack leveraging bokeh effects (

B A B E

), which introduces the bokeh effects as the trigger. Once the backdoored model is deployed in the vision application, the model’s malicious behaviors can be activated only by utilizing the captured bokeh images without any other modifications. Specially, we employ the saliency and depth estimation maps to derive the bokeh images, thereby serving as the poisoned samples. Furthermore, to avoid the latent separation of the generated poisoned images, we propose distinct attack strategies on the basis of the adversary’s prior abilities. For the adversary only with the data manipulation, we retain the original semantic labels for a subset of poisoned data during the training process. For the adversary with the manipulation of both the data and models, we construct a reference model trained on the clean samples to impose constraints on the latent representations of the poisoned images. Extensive experiments demonstrate the attack effects of the proposed

B A B E

, even on the bokeh photos captured from Digital Still Cameras (DSC) and smartphones.

查看原文本刊更多论文

BABE：通过潜在分离抑制实现虚化效果的后门攻击

不断升级的后门攻击威胁对深度神经网络（DNN）的持续发展构成了巨大障碍，尤其是在人脸识别和自动驾驶等对安全敏感的应用领域。受后门攻击的模型会故意用精心制作的触发器对输入做出错误的预测，而对良性触发器则表现正常。尽管存在不同程度的威胁，但现有的后门攻击策略往往优先考虑隐蔽性和防御规避，却忽视了在现实世界部署场景中的实际可行性。本文开发了一种利用虚化效果的后门攻击（BABE），引入虚化效果作为触发器。一旦在视觉应用中部署了后门模型，只需利用捕捉到的虚化图像就能激活模型的恶意行为，而无需做任何其他修改。特别是，我们利用显著性和深度估计图来获取虚化图像，从而作为中毒样本。此外，为了避免对生成的中毒图像进行潜在分离，我们根据对手的先验能力提出了不同的攻击策略。对于只具有数据操作能力的对手，我们在训练过程中保留了中毒数据子集的原始语义标签。对于同时操纵数据和模型的对手，我们构建一个在干净样本上训练的参考模型，对中毒图像的潜在表示施加约束。广泛的实验证明了所提出的 BABE 的攻击效果，即使是在从数码相机（DSC）和智能手机捕获的虚化照片上也是如此。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.