Joint Client-and-Sample Selection for Federated Learning via Bi-Level Optimization

IF 7.7 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Mobile Computing Pub Date : 2024-09-06 DOI:10.1109/TMC.2024.3455331

Anran Li;Guangjing Wang;Ming Hu;Jianfei Sun;Lan Zhang;Luu Anh Tuan;Han Yu

{"title":"Joint Client-and-Sample Selection for Federated Learning via Bi-Level Optimization","authors":"Anran Li;Guangjing Wang;Ming Hu;Jianfei Sun;Lan Zhang;Luu Anh Tuan;Han Yu","doi":"10.1109/TMC.2024.3455331","DOIUrl":null,"url":null,"abstract":"Federated Learning (FL) enables massive local data owners to collaboratively train a deep learning model without disclosing their private data. The importance of local data samples from various data owners to FL models varies widely. This is exacerbated by the presence of noisy data that exhibit large losses similar to important (hard) samples. Currently, there lacks an FL approach that can effectively distinguish hard samples (which are beneficial) from noisy samples (which are harmful). To bridge this gap, we propose the joint Federated Meta-Weighting based Client and Sample Selection (FedMW-CSS) approach to simultaneously mitigate label noise and hard sample selection. It is a bilevel optimization approach for FL client-and-sample selection and global model construction to achieve hard sample-aware noise-robust learning in a privacy preserving manner. It performs meta-learning based online approximation to iteratively update global FL models, select the most positively influential samples and deal with training data noise. To utilize both the instance-level information and class-level information for better performance improvements, FedMW-CSS efficiently learns a class-level weight by manipulating gradients at the class level, e.g., it performs a gradient descent step on class-level weights, which only relies on intermediate gradients. Theoretically, we analyze the privacy guarantees and convergence of FedMW-CSS. Extensive experiments comparison against eight state-of-the-art baselines on six real-world datasets in the presence of data noise and heterogeneity shows that FedMW-CSS achieves up to 28.5% higher test accuracy, while saving communication and computation costs by at least 49.3% and 1.2%, respectively.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"23 12","pages":"15196-15209"},"PeriodicalIF":7.7000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10668811/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Federated Learning (FL) enables massive local data owners to collaboratively train a deep learning model without disclosing their private data. The importance of local data samples from various data owners to FL models varies widely. This is exacerbated by the presence of noisy data that exhibit large losses similar to important (hard) samples. Currently, there lacks an FL approach that can effectively distinguish hard samples (which are beneficial) from noisy samples (which are harmful). To bridge this gap, we propose the joint Federated Meta-Weighting based Client and Sample Selection (FedMW-CSS) approach to simultaneously mitigate label noise and hard sample selection. It is a bilevel optimization approach for FL client-and-sample selection and global model construction to achieve hard sample-aware noise-robust learning in a privacy preserving manner. It performs meta-learning based online approximation to iteratively update global FL models, select the most positively influential samples and deal with training data noise. To utilize both the instance-level information and class-level information for better performance improvements, FedMW-CSS efficiently learns a class-level weight by manipulating gradients at the class level, e.g., it performs a gradient descent step on class-level weights, which only relies on intermediate gradients. Theoretically, we analyze the privacy guarantees and convergence of FedMW-CSS. Extensive experiments comparison against eight state-of-the-art baselines on six real-world datasets in the presence of data noise and heterogeneity shows that FedMW-CSS achieves up to 28.5% higher test accuracy, while saving communication and computation costs by at least 49.3% and 1.2%, respectively.

查看原文本刊更多论文

通过双层优化为联合学习联合选择客户和样本

联合学习（FL）使大量本地数据所有者能够在不公开其私人数据的情况下合作训练深度学习模型。来自不同数据所有者的本地数据样本对 FL 模型的重要性差别很大。由于存在与重要（硬）样本类似的大量损失的噪声数据，这种情况更加严重。目前，还没有一种 FL 方法能有效区分硬样本（有益样本）和噪声样本（有害样本）。为了弥补这一差距，我们提出了基于客户端和样本选择的联合元权重（FedMW-CSS）方法，以同时减轻标签噪声和硬样本选择。这是一种用于 FL 客户端和样本选择以及全局模型构建的双层优化方法，能以保护隐私的方式实现硬样本感知噪声的鲁棒学习。它执行基于元学习的在线近似，迭代更新全局 FL 模型，选择最具积极影响的样本，并处理训练数据噪声。为了同时利用实例级信息和类级信息来提高性能，FedMW-CSS 通过处理类级梯度来有效地学习类级权重，例如，它对类级权重执行梯度下降步骤，这只依赖于中间梯度。我们从理论上分析了 FedMW-CSS 的隐私保证和收敛性。在存在数据噪声和异质性的情况下，我们在六个真实数据集上与八个最先进的基线进行了广泛的实验对比，结果表明 FedMW-CSS 的测试准确率提高了 28.5%，同时通信和计算成本分别节省了至少 49.3% 和 1.2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Mobile Computing 工程技术-电信学

CiteScore

12.90

自引率

2.50%

发文量

403

审稿时长

6.6 months

期刊介绍： IEEE Transactions on Mobile Computing addresses key technical issues related to various aspects of mobile computing. This includes (a) architectures, (b) support services, (c) algorithm/protocol design and analysis, (d) mobile environments, (e) mobile communication systems, (f) applications, and (g) emerging technologies. Topics of interest span a wide range, covering aspects like mobile networks and hosts, mobility management, multimedia, operating system support, power management, online and mobile environments, security, scalability, reliability, and emerging technologies such as wearable computers, body area networks, and wireless sensor networks. The journal serves as a comprehensive platform for advancements in mobile computing research.