SAT: Improving Adversarial Training via Curriculum-Based Loss Smoothing

Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security Pub Date : 2020-03-18 DOI:10.1145/3474369.3486878

Chawin Sitawarin, S. Chakraborty, David A. Wagner

{"title":"SAT: Improving Adversarial Training via Curriculum-Based Loss Smoothing","authors":"Chawin Sitawarin, S. Chakraborty, David A. Wagner","doi":"10.1145/3474369.3486878","DOIUrl":null,"url":null,"abstract":"Adversarial training (AT) has become a popular choice for training robust networks. However, it tends to sacrifice clean accuracy heavily in favor of robustness and suffers from a large generalization error. To address these concerns, we propose Smooth Adversarial Training (SAT), guided by our analysis on the eigenspectrum of the loss Hessian. We find that curriculum learning, a scheme that emphasizes on starting \"easy'' and gradually ramping up on the \"difficulty'' of training, smooths the adversarial loss landscape for a suitably chosen difficulty metric. We present a general formulation for curriculum learning in the adversarial setting and propose two difficulty metrics based on the maximal Hessian eigenvalue (H-SAT) and the softmax probability (P-SA). We demonstrate that SAT stabilizes network training even for a large perturbation norm and allows the network to operate at a better clean accuracy versus robustness trade-off curve compared to AT. This leads to a significant improvement in both clean accuracy and robustness compared to AT, TRADES, and other baselines. To highlight a few results, our best model improves normal and robust accuracy by 6% and 1% on CIFAR-100 compared to AT, respectively. On Imagenette, a ten-class subset of ImageNet, our model outperforms AT by 23% and 3% on normal and robust accuracy respectively.","PeriodicalId":411057,"journal":{"name":"Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3474369.3486878","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 29

Abstract

Adversarial training (AT) has become a popular choice for training robust networks. However, it tends to sacrifice clean accuracy heavily in favor of robustness and suffers from a large generalization error. To address these concerns, we propose Smooth Adversarial Training (SAT), guided by our analysis on the eigenspectrum of the loss Hessian. We find that curriculum learning, a scheme that emphasizes on starting "easy'' and gradually ramping up on the "difficulty'' of training, smooths the adversarial loss landscape for a suitably chosen difficulty metric. We present a general formulation for curriculum learning in the adversarial setting and propose two difficulty metrics based on the maximal Hessian eigenvalue (H-SAT) and the softmax probability (P-SA). We demonstrate that SAT stabilizes network training even for a large perturbation norm and allows the network to operate at a better clean accuracy versus robustness trade-off curve compared to AT. This leads to a significant improvement in both clean accuracy and robustness compared to AT, TRADES, and other baselines. To highlight a few results, our best model improves normal and robust accuracy by 6% and 1% on CIFAR-100 compared to AT, respectively. On Imagenette, a ten-class subset of ImageNet, our model outperforms AT by 23% and 3% on normal and robust accuracy respectively.

查看原文本刊更多论文

SAT:通过基于课程的损失平滑改进对抗性训练

对抗训练(AT)已成为训练鲁棒网络的一种流行选择。然而，它倾向于牺牲干净的准确性来支持鲁棒性，并遭受很大的泛化误差。为了解决这些问题，我们提出了平滑对抗训练(SAT)，以我们对损失黑森特征谱的分析为指导。我们发现课程学习(一种强调从“简单”开始并逐渐提高训练“难度”的方案)能够通过适当选择的难度度量来平滑对抗性损失。我们提出了对抗性环境下课程学习的一般公式，并提出了基于最大Hessian特征值(H-SAT)和softmax概率(P-SA)的两个难度指标。我们证明，即使对于大扰动范数，SAT也可以稳定网络训练，并且与at相比，SAT允许网络以更好的干净精度和鲁棒性权衡曲线运行。与AT、TRADES和其他基线相比，这在清洁精度和稳健性方面都有显著提高。为了突出显示一些结果，与AT相比，我们最好的模型在CIFAR-100上的正常和鲁棒精度分别提高了6%和1%。在ImageNet的10类子集Imagenette上，我们的模型在正常精度和鲁棒精度上分别比AT高出23%和3%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security

自引率

0.00%

发文量