基于异构数据和资源驱动的高效联邦学习客户端选择算法

Ruilin Zhang, Zhenan Xu, Hao Yin
{"title":"基于异构数据和资源驱动的高效联邦学习客户端选择算法","authors":"Ruilin Zhang, Zhenan Xu, Hao Yin","doi":"10.1109/JCC59055.2023.00012","DOIUrl":null,"url":null,"abstract":"Federated Learning is a novel distributed machine learning paradigm that leverages the computing power of numerous decentralized data sources for jointly training machine learning models while ensuring user privacy. In the most commonly used cross-device scenarios, the client cluster typically cover a vast number of heterogeneous end devices. Due to physical limitations such as bandwidth, only a few clients can participate in each round of training. The core issue of the client selection is to determine an appropriate client set for each training round. However, existing selection algorithms, especially the widely adopted random selection, suffer from a number of issues that prevent them from achieving a good balance between training efficiency and speed. Therefore, we propose Scout, which utilizes the heterogeneity features of clients’ data and resources to jointly model the utility function, and enhances the utilization of correlation among clients and the diversity among selected clients to achieve better training efficiency and speed. Furthermore, Scout maintains the scalability and fairness. Our experiments demonstrate that in large-scale heterogeneous clients scenarios, Scout outperforms three baseline algorithms and the state-of-the-art dual-feature dimension algorithm Oort in evaluation metrics.","PeriodicalId":117254,"journal":{"name":"2023 IEEE International Conference on Joint Cloud Computing (JCC)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Scout: An Efficient Federated Learning Client Selection Algorithm Driven by Heterogeneous Data and Resource\",\"authors\":\"Ruilin Zhang, Zhenan Xu, Hao Yin\",\"doi\":\"10.1109/JCC59055.2023.00012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated Learning is a novel distributed machine learning paradigm that leverages the computing power of numerous decentralized data sources for jointly training machine learning models while ensuring user privacy. In the most commonly used cross-device scenarios, the client cluster typically cover a vast number of heterogeneous end devices. Due to physical limitations such as bandwidth, only a few clients can participate in each round of training. The core issue of the client selection is to determine an appropriate client set for each training round. However, existing selection algorithms, especially the widely adopted random selection, suffer from a number of issues that prevent them from achieving a good balance between training efficiency and speed. Therefore, we propose Scout, which utilizes the heterogeneity features of clients’ data and resources to jointly model the utility function, and enhances the utilization of correlation among clients and the diversity among selected clients to achieve better training efficiency and speed. Furthermore, Scout maintains the scalability and fairness. Our experiments demonstrate that in large-scale heterogeneous clients scenarios, Scout outperforms three baseline algorithms and the state-of-the-art dual-feature dimension algorithm Oort in evaluation metrics.\",\"PeriodicalId\":117254,\"journal\":{\"name\":\"2023 IEEE International Conference on Joint Cloud Computing (JCC)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Joint Cloud Computing (JCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCC59055.2023.00012\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Joint Cloud Computing (JCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCC59055.2023.00012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

联邦学习是一种新型的分布式机器学习范式,它利用众多分散数据源的计算能力来联合训练机器学习模型,同时确保用户隐私。在最常用的跨设备场景中,客户端集群通常覆盖大量异构终端设备。由于带宽等物理限制,每轮培训只能有少数客户参加。客户选择的核心问题是为每一轮培训确定一个合适的客户集。然而,现有的选择算法,尤其是被广泛采用的随机选择算法,存在许多问题,使得它们无法在训练效率和速度之间取得良好的平衡。因此,我们提出Scout,利用客户数据和资源的异质性特征共同建模效用函数,增强客户之间的相关性和所选客户之间的多样性的利用,以达到更好的培训效率和速度。此外,Scout还保持了可扩展性和公平性。我们的实验表明,在大规模异构客户端场景中,Scout在评估指标上优于三种基线算法和最先进的双特征维度算法Oort。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Scout: An Efficient Federated Learning Client Selection Algorithm Driven by Heterogeneous Data and Resource
Federated Learning is a novel distributed machine learning paradigm that leverages the computing power of numerous decentralized data sources for jointly training machine learning models while ensuring user privacy. In the most commonly used cross-device scenarios, the client cluster typically cover a vast number of heterogeneous end devices. Due to physical limitations such as bandwidth, only a few clients can participate in each round of training. The core issue of the client selection is to determine an appropriate client set for each training round. However, existing selection algorithms, especially the widely adopted random selection, suffer from a number of issues that prevent them from achieving a good balance between training efficiency and speed. Therefore, we propose Scout, which utilizes the heterogeneity features of clients’ data and resources to jointly model the utility function, and enhances the utilization of correlation among clients and the diversity among selected clients to achieve better training efficiency and speed. Furthermore, Scout maintains the scalability and fairness. Our experiments demonstrate that in large-scale heterogeneous clients scenarios, Scout outperforms three baseline algorithms and the state-of-the-art dual-feature dimension algorithm Oort in evaluation metrics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信