基于扩散模型的真实世界补丁攻击对抗防御系统

Xingxing Wei, Caixin Kang, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Yubo Chen, Hang Su
{"title":"基于扩散模型的真实世界补丁攻击对抗防御系统","authors":"Xingxing Wei, Caixin Kang, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Yubo Chen, Hang Su","doi":"arxiv-2409.09406","DOIUrl":null,"url":null,"abstract":"Adversarial patches present significant challenges to the robustness of deep\nlearning models, making the development of effective defenses become critical\nfor real-world applications. This paper introduces DIFFender, a novel\nDIFfusion-based DeFender framework that leverages the power of a text-guided\ndiffusion model to counter adversarial patch attacks. At the core of our\napproach is the discovery of the Adversarial Anomaly Perception (AAP)\nphenomenon, which enables the diffusion model to accurately detect and locate\nadversarial patches by analyzing distributional anomalies. DIFFender seamlessly\nintegrates the tasks of patch localization and restoration within a unified\ndiffusion model framework, enhancing defense efficacy through their close\ninteraction. Additionally, DIFFender employs an efficient few-shot\nprompt-tuning algorithm, facilitating the adaptation of the pre-trained\ndiffusion model to defense tasks without the need for extensive retraining. Our\ncomprehensive evaluation, covering image classification and face recognition\ntasks, as well as real-world scenarios, demonstrates DIFFender's robust\nperformance against adversarial attacks. The framework's versatility and\ngeneralizability across various settings, classifiers, and attack methodologies\nmark a significant advancement in adversarial patch defense strategies. Except\nfor the popular visible domain, we have identified another advantage of\nDIFFender: its capability to easily expand into the infrared domain.\nConsequently, we demonstrate the good flexibility of DIFFender, which can\ndefend against both infrared and visible adversarial patch attacks\nalternatively using a universal defense framework.","PeriodicalId":501332,"journal":{"name":"arXiv - CS - Cryptography and Security","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-world Adversarial Defense against Patch Attacks based on Diffusion Model\",\"authors\":\"Xingxing Wei, Caixin Kang, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Yubo Chen, Hang Su\",\"doi\":\"arxiv-2409.09406\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Adversarial patches present significant challenges to the robustness of deep\\nlearning models, making the development of effective defenses become critical\\nfor real-world applications. This paper introduces DIFFender, a novel\\nDIFfusion-based DeFender framework that leverages the power of a text-guided\\ndiffusion model to counter adversarial patch attacks. At the core of our\\napproach is the discovery of the Adversarial Anomaly Perception (AAP)\\nphenomenon, which enables the diffusion model to accurately detect and locate\\nadversarial patches by analyzing distributional anomalies. DIFFender seamlessly\\nintegrates the tasks of patch localization and restoration within a unified\\ndiffusion model framework, enhancing defense efficacy through their close\\ninteraction. Additionally, DIFFender employs an efficient few-shot\\nprompt-tuning algorithm, facilitating the adaptation of the pre-trained\\ndiffusion model to defense tasks without the need for extensive retraining. Our\\ncomprehensive evaluation, covering image classification and face recognition\\ntasks, as well as real-world scenarios, demonstrates DIFFender's robust\\nperformance against adversarial attacks. The framework's versatility and\\ngeneralizability across various settings, classifiers, and attack methodologies\\nmark a significant advancement in adversarial patch defense strategies. Except\\nfor the popular visible domain, we have identified another advantage of\\nDIFFender: its capability to easily expand into the infrared domain.\\nConsequently, we demonstrate the good flexibility of DIFFender, which can\\ndefend against both infrared and visible adversarial patch attacks\\nalternatively using a universal defense framework.\",\"PeriodicalId\":501332,\"journal\":{\"name\":\"arXiv - CS - Cryptography and Security\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Cryptography and Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.09406\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Cryptography and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09406","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

对抗性补丁对深度学习模型的鲁棒性提出了巨大挑战,因此开发有效的防御措施对现实世界的应用至关重要。本文介绍了 DIFFender,这是一种基于 DIFfusion 的新型 DeFender 框架,它利用文本引导扩散模型的强大功能来对抗对抗性补丁攻击。我们方法的核心是发现了对抗性异常感知(AAP)现象,它使扩散模型能够通过分析分布异常来准确检测和定位对抗性补丁。DIFFender 在统一的扩散模型框架内无缝集成了补丁定位和恢复任务,通过它们之间的紧密交互提高了防御效率。此外,DIFFender 还采用了一种高效的几发提示调整算法,使预先训练好的扩散模型能够适应防御任务,而无需进行大量的重新训练。我们的综合评估涵盖了图像分类和人脸识别任务以及现实世界的各种场景,证明了 DIFFender 在对抗恶意攻击方面的强大性能。该框架在各种环境、分类器和攻击方法中的通用性和通用性标志着对抗性补丁防御策略的重大进步。因此,我们展示了 DIFFender 的良好灵活性,它可以利用通用防御框架同时防御红外和可见光对抗性补丁攻击。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Real-world Adversarial Defense against Patch Attacks based on Diffusion Model
Adversarial patches present significant challenges to the robustness of deep learning models, making the development of effective defenses become critical for real-world applications. This paper introduces DIFFender, a novel DIFfusion-based DeFender framework that leverages the power of a text-guided diffusion model to counter adversarial patch attacks. At the core of our approach is the discovery of the Adversarial Anomaly Perception (AAP) phenomenon, which enables the diffusion model to accurately detect and locate adversarial patches by analyzing distributional anomalies. DIFFender seamlessly integrates the tasks of patch localization and restoration within a unified diffusion model framework, enhancing defense efficacy through their close interaction. Additionally, DIFFender employs an efficient few-shot prompt-tuning algorithm, facilitating the adaptation of the pre-trained diffusion model to defense tasks without the need for extensive retraining. Our comprehensive evaluation, covering image classification and face recognition tasks, as well as real-world scenarios, demonstrates DIFFender's robust performance against adversarial attacks. The framework's versatility and generalizability across various settings, classifiers, and attack methodologies mark a significant advancement in adversarial patch defense strategies. Except for the popular visible domain, we have identified another advantage of DIFFender: its capability to easily expand into the infrared domain. Consequently, we demonstrate the good flexibility of DIFFender, which can defend against both infrared and visible adversarial patch attacks alternatively using a universal defense framework.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信