Defending Against Targeted Poisoning Attacks in Federated Learning

2022 IEEE 4th International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications (TPS-ISA) Pub Date : 2022-12-01 DOI:10.1109/TPS-ISA56441.2022.00033

Pinar Erbil, M. E. Gursoy

{"title":"Defending Against Targeted Poisoning Attacks in Federated Learning","authors":"Pinar Erbil, M. E. Gursoy","doi":"10.1109/TPS-ISA56441.2022.00033","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) enables multiple participants to collaboratively train a deep neural network (DNN) model. To combat malicious participants in FL, Byzantine-resilient aggregation rules (AGRs) have been developed. However, although Byzantine-resilient AGRs are effective against untargeted attacks, they become suboptimal when attacks are stealthy and targeted. In this paper, we study the problem of defending against targeted data poisoning attacks in FL and make three main contributions. First, we propose a method for selective extraction of DNN parameters from FL participants’ update vectors that are indicative of attack, and embedding them into low-dimensional latent space. We show that the effectiveness of Byzantine-resilient AGRs such as Trimmed Mean and Krum can be improved if they are used in combination with our proposed method. Second, we develop a clustering-based defense using X-Means for separating items into malicious versus benign clusters in latent space. Such separation allows identification of malicious versus benign updates. Third, using the separation from the previous step, we show that a \"clean\" model (i.e., a model that is not negatively impacted by the attack) can be trained using only the benign updates. We experimentally evaluate our defense methods on Fashion-MNIST and CIFAR-10 datasets. Results show that our methods can achieve up to 95% true positive rate and 99% accuracy in malicious update identification across various settings. In addition, the clean models trained using our approach achieve similar accuracy compared to a baseline scenario without poisoning.","PeriodicalId":427887,"journal":{"name":"2022 IEEE 4th International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications (TPS-ISA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 4th International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications (TPS-ISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TPS-ISA56441.2022.00033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Federated learning (FL) enables multiple participants to collaboratively train a deep neural network (DNN) model. To combat malicious participants in FL, Byzantine-resilient aggregation rules (AGRs) have been developed. However, although Byzantine-resilient AGRs are effective against untargeted attacks, they become suboptimal when attacks are stealthy and targeted. In this paper, we study the problem of defending against targeted data poisoning attacks in FL and make three main contributions. First, we propose a method for selective extraction of DNN parameters from FL participants’ update vectors that are indicative of attack, and embedding them into low-dimensional latent space. We show that the effectiveness of Byzantine-resilient AGRs such as Trimmed Mean and Krum can be improved if they are used in combination with our proposed method. Second, we develop a clustering-based defense using X-Means for separating items into malicious versus benign clusters in latent space. Such separation allows identification of malicious versus benign updates. Third, using the separation from the previous step, we show that a "clean" model (i.e., a model that is not negatively impacted by the attack) can be trained using only the benign updates. We experimentally evaluate our defense methods on Fashion-MNIST and CIFAR-10 datasets. Results show that our methods can achieve up to 95% true positive rate and 99% accuracy in malicious update identification across various settings. In addition, the clean models trained using our approach achieve similar accuracy compared to a baseline scenario without poisoning.

查看原文本刊更多论文

防御联邦学习中的针对性中毒攻击

联邦学习(FL)使多个参与者能够协同训练深度神经网络(DNN)模型。为了打击FL中的恶意参与者，开发了拜占庭弹性聚合规则(agr)。然而，尽管拜占庭弹性agr对非目标攻击有效，但当攻击是隐形的和有目标的时，它们就变得次优了。本文主要研究了FL中目标数据中毒攻击的防御问题，并做出了三个主要贡献。首先，我们提出了一种从FL参与者的指示攻击的更新向量中选择性提取DNN参数并将其嵌入到低维潜在空间的方法。我们表明，如果与我们提出的方法结合使用，拜占庭弹性agr(如trim Mean和Krum)的有效性可以得到提高。其次，我们开发了一个基于聚类的防御，使用X-Means将项目分为潜在空间中的恶意和良性聚类。这种分离允许识别恶意更新和良性更新。第三，使用与前一步的分离，我们展示了一个“干净”模型(即，一个没有受到攻击负面影响的模型)可以只使用良性更新来训练。我们在Fashion-MNIST和CIFAR-10数据集上实验评估了我们的防御方法。结果表明，我们的方法可以在各种设置下实现高达95%的真阳性率和99%的准确率的恶意更新识别。此外，与没有中毒的基线情景相比，使用我们的方法训练的清洁模型达到了相似的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 4th International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications (TPS-ISA)

自引率

0.00%

发文量