{"title":"AFD: Defending Convolutional Neural Networks Without Using Adversarial Samples","authors":"Nupur Thakur;Yuzhen Ding;Baoxin Li","doi":"10.1109/OJSP.2025.3571681","DOIUrl":null,"url":null,"abstract":"The vulnerability of deep neural networks to adversarial attacks has attracted much research effort. Still, studies have shown that it is challenging to simultaneously achieve both strong robustness to adversarial attacks and low degradation in the performance on the original task, as there is always a trade-off between the two objectives. In this paper, we present a novel training strategy named Adversarial-Free Defense (AFD), which introduces a minimal change to a network architecture (by modifying the first convolution layer) while employing a learning algorithm that leads to special properties of the first-layer kernels. We show how this learning strategy enhances the robustness of the network to adversarial attacks (without using adversarial samples) while maintaining a reasonable performance on the original task. Empirical results including analysis in terms of the effective Lipschitz constant of the learned network suggest that, compared to most existing methods that rely on elaborate regularization schemes imposed on all layers, our seemingly simplistic approach demonstrates high effectiveness.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"571-580"},"PeriodicalIF":2.9000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11007011","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE open journal of signal processing","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11007011/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The vulnerability of deep neural networks to adversarial attacks has attracted much research effort. Still, studies have shown that it is challenging to simultaneously achieve both strong robustness to adversarial attacks and low degradation in the performance on the original task, as there is always a trade-off between the two objectives. In this paper, we present a novel training strategy named Adversarial-Free Defense (AFD), which introduces a minimal change to a network architecture (by modifying the first convolution layer) while employing a learning algorithm that leads to special properties of the first-layer kernels. We show how this learning strategy enhances the robustness of the network to adversarial attacks (without using adversarial samples) while maintaining a reasonable performance on the original task. Empirical results including analysis in terms of the effective Lipschitz constant of the learned network suggest that, compared to most existing methods that rely on elaborate regularization schemes imposed on all layers, our seemingly simplistic approach demonstrates high effectiveness.