LoRID：逆向纯化的低链迭代扩散

arXiv - CS - Machine Learning Pub Date : 2024-09-12 DOI:arxiv-2409.08255

Geigh Zollicoffer, Minh Vu, Ben Nebgen, Juan Castorena, Boian Alexandrov, Manish Bhattarai

{"title":"LoRID：逆向纯化的低链迭代扩散","authors":"Geigh Zollicoffer, Minh Vu, Ben Nebgen, Juan Castorena, Boian Alexandrov, Manish Bhattarai","doi":"arxiv-2409.08255","DOIUrl":null,"url":null,"abstract":"This work presents an information-theoretic examination of diffusion-based\npurification methods, the state-of-the-art adversarial defenses that utilize\ndiffusion models to remove malicious perturbations in adversarial examples. By\ntheoretically characterizing the inherent purification errors associated with\nthe Markov-based diffusion purifications, we introduce LoRID, a novel Low-Rank\nIterative Diffusion purification method designed to remove adversarial\nperturbation with low intrinsic purification errors. LoRID centers around a\nmulti-stage purification process that leverages multiple rounds of\ndiffusion-denoising loops at the early time-steps of the diffusion models, and\nthe integration of Tucker decomposition, an extension of matrix factorization,\nto remove adversarial noise at high-noise regimes. Consequently, LoRID\nincreases the effective diffusion time-steps and overcomes strong adversarial\nattacks, achieving superior robustness performance in CIFAR-10/100, CelebA-HQ,\nand ImageNet datasets under both white-box and black-box settings.","PeriodicalId":501301,"journal":{"name":"arXiv - CS - Machine Learning","volume":"6 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LoRID: Low-Rank Iterative Diffusion for Adversarial Purification\",\"authors\":\"Geigh Zollicoffer, Minh Vu, Ben Nebgen, Juan Castorena, Boian Alexandrov, Manish Bhattarai\",\"doi\":\"arxiv-2409.08255\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work presents an information-theoretic examination of diffusion-based\\npurification methods, the state-of-the-art adversarial defenses that utilize\\ndiffusion models to remove malicious perturbations in adversarial examples. By\\ntheoretically characterizing the inherent purification errors associated with\\nthe Markov-based diffusion purifications, we introduce LoRID, a novel Low-Rank\\nIterative Diffusion purification method designed to remove adversarial\\nperturbation with low intrinsic purification errors. LoRID centers around a\\nmulti-stage purification process that leverages multiple rounds of\\ndiffusion-denoising loops at the early time-steps of the diffusion models, and\\nthe integration of Tucker decomposition, an extension of matrix factorization,\\nto remove adversarial noise at high-noise regimes. Consequently, LoRID\\nincreases the effective diffusion time-steps and overcomes strong adversarial\\nattacks, achieving superior robustness performance in CIFAR-10/100, CelebA-HQ,\\nand ImageNet datasets under both white-box and black-box settings.\",\"PeriodicalId\":501301,\"journal\":{\"name\":\"arXiv - CS - Machine Learning\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.08255\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本研究从信息论角度对基于扩散的净化方法进行了研究，这些方法是最先进的对抗防御手段，利用扩散模型来消除对抗示例中的恶意扰动。通过从理论上描述与基于马尔可夫的扩散净化相关的固有净化误差，我们引入了 LoRID，这是一种新型的低阶迭代扩散净化方法，旨在以较低的固有净化误差消除对抗性扰动。LoRID 以多级净化过程为中心，在扩散模型的早期时间步骤利用多轮扩散-去噪循环，并结合矩阵因式分解的扩展--塔克分解，以去除高噪声状态下的对抗性噪声。因此，LoRID 增加了有效的扩散时间步数，克服了强大的对抗性攻击，在 CIFAR-10/100、CelebA-HQ 和 ImageNet 数据集的白盒和黑盒设置下都取得了卓越的鲁棒性表现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

LoRID: Low-Rank Iterative Diffusion for Adversarial Purification

This work presents an information-theoretic examination of diffusion-based purification methods, the state-of-the-art adversarial defenses that utilize diffusion models to remove malicious perturbations in adversarial examples. By theoretically characterizing the inherent purification errors associated with the Markov-based diffusion purifications, we introduce LoRID, a novel Low-Rank Iterative Diffusion purification method designed to remove adversarial perturbation with low intrinsic purification errors. LoRID centers around a multi-stage purification process that leverages multiple rounds of diffusion-denoising loops at the early time-steps of the diffusion models, and the integration of Tucker decomposition, an extension of matrix factorization, to remove adversarial noise at high-noise regimes. Consequently, LoRID increases the effective diffusion time-steps and overcomes strong adversarial attacks, achieving superior robustness performance in CIFAR-10/100, CelebA-HQ, and ImageNet datasets under both white-box and black-box settings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Machine Learning

自引率

0.00%

发文量