基于异构数据和资源驱动的高效联邦学习客户端选择算法

2023 IEEE International Conference on Joint Cloud Computing (JCC) Pub Date : 2023-07-01 DOI:10.1109/JCC59055.2023.00012

Ruilin Zhang, Zhenan Xu, Hao Yin

{"title":"基于异构数据和资源驱动的高效联邦学习客户端选择算法","authors":"Ruilin Zhang, Zhenan Xu, Hao Yin","doi":"10.1109/JCC59055.2023.00012","DOIUrl":null,"url":null,"abstract":"Federated Learning is a novel distributed machine learning paradigm that leverages the computing power of numerous decentralized data sources for jointly training machine learning models while ensuring user privacy. In the most commonly used cross-device scenarios, the client cluster typically cover a vast number of heterogeneous end devices. Due to physical limitations such as bandwidth, only a few clients can participate in each round of training. The core issue of the client selection is to determine an appropriate client set for each training round. However, existing selection algorithms, especially the widely adopted random selection, suffer from a number of issues that prevent them from achieving a good balance between training efficiency and speed. Therefore, we propose Scout, which utilizes the heterogeneity features of clients’ data and resources to jointly model the utility function, and enhances the utilization of correlation among clients and the diversity among selected clients to achieve better training efficiency and speed. Furthermore, Scout maintains the scalability and fairness. Our experiments demonstrate that in large-scale heterogeneous clients scenarios, Scout outperforms three baseline algorithms and the state-of-the-art dual-feature dimension algorithm Oort in evaluation metrics.","PeriodicalId":117254,"journal":{"name":"2023 IEEE International Conference on Joint Cloud Computing (JCC)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Scout: An Efficient Federated Learning Client Selection Algorithm Driven by Heterogeneous Data and Resource\",\"authors\":\"Ruilin Zhang, Zhenan Xu, Hao Yin\",\"doi\":\"10.1109/JCC59055.2023.00012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated Learning is a novel distributed machine learning paradigm that leverages the computing power of numerous decentralized data sources for jointly training machine learning models while ensuring user privacy. In the most commonly used cross-device scenarios, the client cluster typically cover a vast number of heterogeneous end devices. Due to physical limitations such as bandwidth, only a few clients can participate in each round of training. The core issue of the client selection is to determine an appropriate client set for each training round. However, existing selection algorithms, especially the widely adopted random selection, suffer from a number of issues that prevent them from achieving a good balance between training efficiency and speed. Therefore, we propose Scout, which utilizes the heterogeneity features of clients’ data and resources to jointly model the utility function, and enhances the utilization of correlation among clients and the diversity among selected clients to achieve better training efficiency and speed. Furthermore, Scout maintains the scalability and fairness. Our experiments demonstrate that in large-scale heterogeneous clients scenarios, Scout outperforms three baseline algorithms and the state-of-the-art dual-feature dimension algorithm Oort in evaluation metrics.\",\"PeriodicalId\":117254,\"journal\":{\"name\":\"2023 IEEE International Conference on Joint Cloud Computing (JCC)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Joint Cloud Computing (JCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCC59055.2023.00012\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Joint Cloud Computing (JCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCC59055.2023.00012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

联邦学习是一种新型的分布式机器学习范式，它利用众多分散数据源的计算能力来联合训练机器学习模型，同时确保用户隐私。在最常用的跨设备场景中，客户端集群通常覆盖大量异构终端设备。由于带宽等物理限制，每轮培训只能有少数客户参加。客户选择的核心问题是为每一轮培训确定一个合适的客户集。然而，现有的选择算法，尤其是被广泛采用的随机选择算法，存在许多问题，使得它们无法在训练效率和速度之间取得良好的平衡。因此，我们提出Scout，利用客户数据和资源的异质性特征共同建模效用函数，增强客户之间的相关性和所选客户之间的多样性的利用，以达到更好的培训效率和速度。此外，Scout还保持了可扩展性和公平性。我们的实验表明，在大规模异构客户端场景中，Scout在评估指标上优于三种基线算法和最先进的双特征维度算法Oort。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Scout: An Efficient Federated Learning Client Selection Algorithm Driven by Heterogeneous Data and Resource

Federated Learning is a novel distributed machine learning paradigm that leverages the computing power of numerous decentralized data sources for jointly training machine learning models while ensuring user privacy. In the most commonly used cross-device scenarios, the client cluster typically cover a vast number of heterogeneous end devices. Due to physical limitations such as bandwidth, only a few clients can participate in each round of training. The core issue of the client selection is to determine an appropriate client set for each training round. However, existing selection algorithms, especially the widely adopted random selection, suffer from a number of issues that prevent them from achieving a good balance between training efficiency and speed. Therefore, we propose Scout, which utilizes the heterogeneity features of clients’ data and resources to jointly model the utility function, and enhances the utilization of correlation among clients and the diversity among selected clients to achieve better training efficiency and speed. Furthermore, Scout maintains the scalability and fairness. Our experiments demonstrate that in large-scale heterogeneous clients scenarios, Scout outperforms three baseline algorithms and the state-of-the-art dual-feature dimension algorithm Oort in evaluation metrics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE International Conference on Joint Cloud Computing (JCC)

自引率

0.00%

发文量