Generalization Bounds for Adversarial Contrastive Learning

J. Mach. Learn. Res. Pub Date : 2023-02-21 DOI:10.48550/arXiv.2302.10633

Xin Zou, Weiwei Liu

引用次数: 2

Abstract

Deep networks are well-known to be fragile to adversarial attacks, and adversarial training is one of the most popular methods used to train a robust model. To take advantage of unlabeled data, recent works have applied adversarial training to contrastive learning (Adversarial Contrastive Learning; ACL for short) and obtain promising robust performance. However, the theory of ACL is not well understood. To fill this gap, we leverage the Rademacher complexity to analyze the generalization performance of ACL, with a particular focus on linear models and multi-layer neural networks under $\ell_p$ attack ($p \ge 1$). Our theory shows that the average adversarial risk of the downstream tasks can be upper bounded by the adversarial unsupervised risk of the upstream task. The experimental results validate our theory.

查看原文本刊更多论文

对抗性对比学习的泛化界限

众所周知，深度网络容易受到对抗性攻击，而对抗性训练是训练鲁棒模型最常用的方法之一。为了利用未标记数据，最近的研究将对抗性训练应用于对比学习(对抗性对比学习;ACL(简称ACL)，获得了良好的鲁棒性能。然而，ACL的理论并没有得到很好的理解。为了填补这一空白，我们利用Rademacher复杂度来分析ACL的泛化性能，特别关注线性模型和多层神经网络在$\ell_p$攻击($p \ge 1$)下的性能。我们的理论表明，下游任务的平均对抗风险可以由上游任务的对抗无监督风险上界。实验结果验证了我们的理论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

J. Mach. Learn. Res.

自引率

0.00%

发文量