Topology Design and Graph Embedding for Decentralized Federated Learning

Intelligent and Converged Networks Pub Date : 2024-06-01 DOI:10.23919/ICN.2024.0008

Yubin Duan;Xiuqi Li;Jie Wu

{"title":"Topology Design and Graph Embedding for Decentralized Federated Learning","authors":"Yubin Duan;Xiuqi Li;Jie Wu","doi":"10.23919/ICN.2024.0008","DOIUrl":null,"url":null,"abstract":"Federated learning has been widely employed in many applications to protect the data privacy of participating clients. Although the dataset is decentralized among training devices in federated learning, the model parameters are usually stored in a centralized manner. Centralized federated learning is easy to implement; however, a centralized scheme causes a communication bottleneck at the central server, which may significantly slow down the training process. To improve training efficiency, we investigate the decentralized federated learning scheme. The decentralized scheme has become feasible with the rapid development of device-to-device communication techniques under 5G. Nevertheless, the convergence rate of learning models in the decentralized scheme depends on the network topology design. We propose optimizing the topology design to improve training efficiency for decentralized federated learning, which is a non-trivial problem, especially when considering data heterogeneity. In this paper, we first demonstrate the advantage of hypercube topology and present a hypercube graph construction method to reduce data heterogeneity by carefully selecting neighbors of each training device—a process that resembles classic graph embedding. In addition, we propose a heuristic method for generating torus graphs. Moreover, we have explored the communication patterns in hypercube topology and propose a sequential synchronization scheme to reduce communication cost during training. A batch synchronization scheme is presented to fine-tune the communication pattern for hypercube topology. Experiments on real-world datasets show that our proposed graph construction methods can accelerate the training process, and our sequential synchronization scheme can significantly reduce the overall communication traffic during training.","PeriodicalId":100681,"journal":{"name":"Intelligent and Converged Networks","volume":"5 2","pages":"100-115"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10601665","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent and Converged Networks","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10601665/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Federated learning has been widely employed in many applications to protect the data privacy of participating clients. Although the dataset is decentralized among training devices in federated learning, the model parameters are usually stored in a centralized manner. Centralized federated learning is easy to implement; however, a centralized scheme causes a communication bottleneck at the central server, which may significantly slow down the training process. To improve training efficiency, we investigate the decentralized federated learning scheme. The decentralized scheme has become feasible with the rapid development of device-to-device communication techniques under 5G. Nevertheless, the convergence rate of learning models in the decentralized scheme depends on the network topology design. We propose optimizing the topology design to improve training efficiency for decentralized federated learning, which is a non-trivial problem, especially when considering data heterogeneity. In this paper, we first demonstrate the advantage of hypercube topology and present a hypercube graph construction method to reduce data heterogeneity by carefully selecting neighbors of each training device—a process that resembles classic graph embedding. In addition, we propose a heuristic method for generating torus graphs. Moreover, we have explored the communication patterns in hypercube topology and propose a sequential synchronization scheme to reduce communication cost during training. A batch synchronization scheme is presented to fine-tune the communication pattern for hypercube topology. Experiments on real-world datasets show that our proposed graph construction methods can accelerate the training process, and our sequential synchronization scheme can significantly reduce the overall communication traffic during training.

查看原文本刊更多论文

分散式联合学习的拓扑设计和图嵌入

为保护参与客户的数据隐私，联盟学习已在许多应用中得到广泛应用。虽然联合学习中的数据集分散在不同的训练设备中，但模型参数通常是集中存储的。集中式联合学习很容易实现，但集中式方案会在中央服务器上造成通信瓶颈，从而大大降低训练过程的速度。为了提高训练效率，我们研究了分散式联合学习方案。随着 5G 下设备到设备通信技术的快速发展，分散式方案变得可行。然而，分散式方案中学习模型的收敛速度取决于网络拓扑设计。我们建议优化拓扑设计，以提高分散式联合学习的训练效率，这是一个非难解决的问题，尤其是在考虑数据异构的情况下。在本文中，我们首先展示了超立方拓扑的优势，并提出了一种超立方图构建方法，通过仔细选择每个训练设备的邻居来减少数据异构性--这一过程类似于经典的图嵌入。此外，我们还提出了一种生成环形图的启发式方法。此外，我们还探索了超立方拓扑中的通信模式，并提出了一种顺序同步方案，以降低训练过程中的通信成本。我们还提出了一种批量同步方案，用于微调超立方拓扑的通信模式。在实际数据集上的实验表明，我们提出的图构建方法可以加速训练过程，而我们的顺序同步方案可以显著减少训练过程中的整体通信流量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Intelligent and Converged Networks

自引率

0.00%

发文量