Tackling Intertwined Data and Device Heterogeneities in Federated Learning with Unlimited Staleness.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence Pub Date : 2025-01-01 Epub Date: 2025-04-11 DOI:10.1609/aaai.v39i20.35405

Haoming Wang, Wei Gao

{"title":"Tackling Intertwined Data and Device Heterogeneities in Federated Learning with Unlimited Staleness.","authors":"Haoming Wang, Wei Gao","doi":"10.1609/aaai.v39i20.35405","DOIUrl":null,"url":null,"abstract":"<p><p>Federated Learning (FL) can be affected by data and device heterogeneities, caused by clients' different local data distributions and latencies in uploading model updates (i.e., staleness). Traditional schemes consider these heterogeneities as two separate and independent aspects, but this assumption is unrealistic in practical FL scenarios where these heterogeneities are intertwined. In these cases, traditional FL schemes are ineffective, and a better approach is to convert a stale model update into a unstale one. In this paper, we present a new FL framework that ensures the accuracy and computational efficiency of this conversion, hence effectively tackling the intertwined heterogeneities that may cause unlimited staleness in model updates. Our basic idea is to estimate the distributions of clients' local training data from their uploaded stale model updates, and use these estimations to compute unstale client model updates. In this way, our approach does not require any auxiliary dataset nor the clients' local models to be fully trained, and does not incur any additional computation or communication overhead at client devices. We compared our approach with the existing FL strategies on mainstream datasets and models, and showed that our approach can improve the trained model accuracy by up to 25% and reduce the number of required training epochs by up to 35%. Source codes can be found at: https://github.com/pittisl/FL-with-intertwined-heterogeneity.</p>","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"39 20","pages":"21080-21089"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12116981/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1609/aaai.v39i20.35405","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/11 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Federated Learning (FL) can be affected by data and device heterogeneities, caused by clients' different local data distributions and latencies in uploading model updates (i.e., staleness). Traditional schemes consider these heterogeneities as two separate and independent aspects, but this assumption is unrealistic in practical FL scenarios where these heterogeneities are intertwined. In these cases, traditional FL schemes are ineffective, and a better approach is to convert a stale model update into a unstale one. In this paper, we present a new FL framework that ensures the accuracy and computational efficiency of this conversion, hence effectively tackling the intertwined heterogeneities that may cause unlimited staleness in model updates. Our basic idea is to estimate the distributions of clients' local training data from their uploaded stale model updates, and use these estimations to compute unstale client model updates. In this way, our approach does not require any auxiliary dataset nor the clients' local models to be fully trained, and does not incur any additional computation or communication overhead at client devices. We compared our approach with the existing FL strategies on mainstream datasets and models, and showed that our approach can improve the trained model accuracy by up to 25% and reduce the number of required training epochs by up to 35%. Source codes can be found at: https://github.com/pittisl/FL-with-intertwined-heterogeneity.

查看原文本刊更多论文

在具有无限过时性的联邦学习中处理交织的数据和设备异构性。

联邦学习（FL）可能会受到数据和设备异构性的影响，这是由客户端不同的本地数据分布和上传模型更新时的延迟（即过时）造成的。传统的方案将这些异构性视为两个独立的方面，但是这种假设在实际的FL场景中是不现实的，因为这些异构性是相互交织的。在这些情况下，传统的FL方案是无效的，更好的方法是将过时的模型更新转换为不过时的模型更新。在本文中，我们提出了一个新的FL框架，确保了这种转换的准确性和计算效率，从而有效地解决了可能导致模型更新无限过时的相互交织的异构性。我们的基本思想是从客户端上传的陈旧模型更新中估计其本地训练数据的分布，并使用这些估计来计算未陈旧的客户端模型更新。通过这种方式，我们的方法不需要任何辅助数据集，也不需要完全训练客户端的本地模型，并且不会在客户端设备上产生任何额外的计算或通信开销。我们将我们的方法与主流数据集和模型上现有的FL策略进行了比较，结果表明，我们的方法可以将训练模型的准确率提高高达25%，并将所需的训练epoch数量减少高达35%。源代码可以在https://github.com/pittisl/FL-with-intertwined-heterogeneity找到。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

自引率

0.00%

发文量