Research and Implementation of Asynchronous Transmission and Update Strategy for Federated Learning

2022 IEEE 8th International Conference on Computer and Communications (ICCC) Pub Date : 2022-12-09 DOI:10.1109/ICCC56324.2022.10065902

Yudong Jia, Ningbo Zhang

{"title":"Research and Implementation of Asynchronous Transmission and Update Strategy for Federated Learning","authors":"Yudong Jia, Ningbo Zhang","doi":"10.1109/ICCC56324.2022.10065902","DOIUrl":null,"url":null,"abstract":"In today's era, there is a growing concern about data privacy and security. Data is the foundation of machine learning, and due to the complexity of the reality, data often exists in silos. The great development of computing and storage capacity of edge devices has made it possible to train machine learning models on edge devices with data. Federated learning is a special kind of distributed machine learning in which multiple distributed clients use local data to train a model and upload the resulting model parameters to a central server for aggregation and iteration to finally obtain a global model with good generalization performance. The interaction between the clients and the server is model parameters rather than the data itself, which effectively protects data privacy and security. After research, it is found that Federated Learning is mainly divided into two aggregation models: synchronous aggregation and asynchronous aggregation. In this paper, the classical FedAvg synchronous federated average aggregation algorithm is analyzed. The central server needs to wait for all participating nodes to be trained before aggregating and updating the model, which is inconvenient to achieve global synchronization on resource heterogeneous clients. To improve the utilization of computational resources, FedAsync, an asynchronous federated learning aggregation mechanism with a receivable window based on dual weights, which are data volume size weight and staleness weight, is proposed. And FedAvg and FedAsync are tested and analyzed. Based on this, the impact on federated learning due to asynchronous data arrival rate and limited communication resources is further explored.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10065902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In today's era, there is a growing concern about data privacy and security. Data is the foundation of machine learning, and due to the complexity of the reality, data often exists in silos. The great development of computing and storage capacity of edge devices has made it possible to train machine learning models on edge devices with data. Federated learning is a special kind of distributed machine learning in which multiple distributed clients use local data to train a model and upload the resulting model parameters to a central server for aggregation and iteration to finally obtain a global model with good generalization performance. The interaction between the clients and the server is model parameters rather than the data itself, which effectively protects data privacy and security. After research, it is found that Federated Learning is mainly divided into two aggregation models: synchronous aggregation and asynchronous aggregation. In this paper, the classical FedAvg synchronous federated average aggregation algorithm is analyzed. The central server needs to wait for all participating nodes to be trained before aggregating and updating the model, which is inconvenient to achieve global synchronization on resource heterogeneous clients. To improve the utilization of computational resources, FedAsync, an asynchronous federated learning aggregation mechanism with a receivable window based on dual weights, which are data volume size weight and staleness weight, is proposed. And FedAvg and FedAsync are tested and analyzed. Based on this, the impact on federated learning due to asynchronous data arrival rate and limited communication resources is further explored.

查看原文本刊更多论文

面向联邦学习的异步传输与更新策略研究与实现

在当今时代，人们越来越关注数据隐私和安全。数据是机器学习的基础，由于现实的复杂性，数据往往以竖井的形式存在。边缘设备的计算和存储能力的巨大发展使得在边缘设备上用数据训练机器学习模型成为可能。联邦学习是一种特殊的分布式机器学习，多个分布式客户端使用本地数据对模型进行训练，并将训练得到的模型参数上传到中央服务器进行聚合迭代，最终得到具有良好泛化性能的全局模型。客户端与服务器之间的交互是模型参数而不是数据本身，有效地保护了数据的隐私和安全。经过研究发现，联邦学习主要分为两种聚合模型:同步聚合和异步聚合。本文分析了经典的fedag同步联邦平均聚合算法。中心服务器需要等待所有参与节点训练完毕后才能对模型进行聚合和更新，这不利于在资源异构的客户端上实现全局同步。为了提高计算资源的利用率，提出了一种基于数据体积大小权值和过期权值的可接收窗口异步联邦学习聚合机制FedAsync。并对fedag和FedAsync进行了测试和分析。在此基础上，进一步探讨了异步数据到达率和有限通信资源对联邦学习的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 8th International Conference on Computer and Communications (ICCC)

自引率

0.00%

发文量