APR-Net: Defense Against Adversarial Examples Based on Universal Adversarial Perturbation Removal Network

IEEE transactions on artificial intelligence Pub Date : 2024-11-22 DOI:10.1109/TAI.2024.3504478

Wenxing Liao;Zhuxian Liu;Minghuang Shen;Riqing Chen;Xiaolong Liu

{"title":"APR-Net: Defense Against Adversarial Examples Based on Universal Adversarial Perturbation Removal Network","authors":"Wenxing Liao;Zhuxian Liu;Minghuang Shen;Riqing Chen;Xiaolong Liu","doi":"10.1109/TAI.2024.3504478","DOIUrl":null,"url":null,"abstract":"Adversarial attack, a bleeding-edge technique that attempts to fool deep learning classification model by generating adversarial examples with imperceptible perturbations, is becoming a growing threat in artificial intelligence fields. Preprocessing models that remove perturbations are an effective approach for enhancing the robustness of classification models. However, most existing methods overlook a critical issue: although powerful preprocessing operations can remove adversarial perturbations, they may also weaken the representation of key features in the image, leading to decreased defense performance. To address this, we propose a novel universal defense model, APR-Net, which aims to remove adversarial perturbations while effectively preserving high-quality images. The key innovation of APR-Net lies in its dual-module design, which consists of a denoising module and an image restoration module. This design not only effectively eliminates imperceptible adversarial perturbations but also ensures the restoration of high-quality images. Unlike existing methods, APR-Net does not require modifications to the classifier architecture or specialized adversarial training, making it highly versatile. Extensive experiments on the ImageNet dataset demonstrate that APR-Net provides strong defense against various adversarial attack algorithms, significantly improves image quality, and outperforms other state-of-the-art defense methods in terms of overall performance.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 4","pages":"945-954"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10765144/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Adversarial attack, a bleeding-edge technique that attempts to fool deep learning classification model by generating adversarial examples with imperceptible perturbations, is becoming a growing threat in artificial intelligence fields. Preprocessing models that remove perturbations are an effective approach for enhancing the robustness of classification models. However, most existing methods overlook a critical issue: although powerful preprocessing operations can remove adversarial perturbations, they may also weaken the representation of key features in the image, leading to decreased defense performance. To address this, we propose a novel universal defense model, APR-Net, which aims to remove adversarial perturbations while effectively preserving high-quality images. The key innovation of APR-Net lies in its dual-module design, which consists of a denoising module and an image restoration module. This design not only effectively eliminates imperceptible adversarial perturbations but also ensures the restoration of high-quality images. Unlike existing methods, APR-Net does not require modifications to the classifier architecture or specialized adversarial training, making it highly versatile. Extensive experiments on the ImageNet dataset demonstrate that APR-Net provides strong defense against various adversarial attack algorithms, significantly improves image quality, and outperforms other state-of-the-art defense methods in terms of overall performance.

查看原文本刊更多论文

APR-Net：基于通用对抗性扰动去除网络的对抗性实例防御

对抗性攻击（Adversarial attack）是一种试图通过生成具有难以察觉扰动的对抗性示例来欺骗深度学习分类模型的前沿技术，正在成为人工智能领域日益增长的威胁。去除扰动的预处理模型是提高分类模型鲁棒性的有效方法。然而，大多数现有方法忽略了一个关键问题：尽管强大的预处理操作可以消除对抗性扰动，但它们也可能削弱图像中关键特征的表示，导致防御性能下降。为了解决这个问题，我们提出了一种新的通用防御模型，APR-Net，旨在消除对抗性扰动，同时有效地保留高质量的图像。APR-Net的创新之处在于它的双模块设计，即去噪模块和图像恢复模块。这种设计不仅有效地消除了难以察觉的对抗性扰动，而且保证了高质量图像的恢复。与现有方法不同，APR-Net不需要修改分类器架构或专门的对抗训练，使其具有高度的通用性。在ImageNet数据集上进行的大量实验表明，APR-Net提供了针对各种对抗性攻击算法的强大防御，显著提高了图像质量，并且在整体性能方面优于其他最先进的防御方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on artificial intelligence

CiteScore

7.70

自引率

0.00%

发文量