Data-Aware Gradient Compression for FL in Communication-Constrained Mobile Computing

IF 7.7 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Mobile Computing Pub Date : 2024-11-21 DOI:10.1109/TMC.2024.3504284

Rongwei Lu;Yutong Jiang;Yinan Mao;Chen Tang;Bin Chen;Laizhong Cui;Zhi Wang

{"title":"Data-Aware Gradient Compression for FL in Communication-Constrained Mobile Computing","authors":"Rongwei Lu;Yutong Jiang;Yinan Mao;Chen Tang;Bin Chen;Laizhong Cui;Zhi Wang","doi":"10.1109/TMC.2024.3504284","DOIUrl":null,"url":null,"abstract":"Federated Learning (FL) in mobile environments faces significant communication bottlenecks. Gradient compression has proven as an effective solution to this issue, offering substantial benefits in environments with limited bandwidth and metered data. Yet, it encounters severe performance drops in non-IID environments due to a one-size-fits-all compression approach, which does not account for the varying data volumes across workers. Assigning varying compression ratios to workers with distinct data distributions and volumes is therefore a promising solution. This work derives the convergence rate of distributed SGD with non-uniform compression, which reveals the intricate relationship between model convergence and the compression ratios applied to individual workers. Accordingly, we frame the relative compression ratio assignment as an <inline-formula><tex-math>$n$</tex-math></inline-formula>-variable chi-squared nonlinear optimization problem, constrained by a limited communication budget. We propose DAGC-R, which assigns conservative compression to workers handling larger data volumes. Recognizing the computational limitations of mobile devices, we propose the DAGC-A, which is computationally less demanding and enhances the robustness of compression in non-IID scenarios. Our experiments confirm that the DAGC-R and DAGC-A can speed up the training speed by up to 25.43% and 16.65% compared to the uniform compression respectively, when dealing with highly imbalanced data volume distribution and restricted communication.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 4","pages":"2755-2768"},"PeriodicalIF":7.7000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10759817/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Federated Learning (FL) in mobile environments faces significant communication bottlenecks. Gradient compression has proven as an effective solution to this issue, offering substantial benefits in environments with limited bandwidth and metered data. Yet, it encounters severe performance drops in non-IID environments due to a one-size-fits-all compression approach, which does not account for the varying data volumes across workers. Assigning varying compression ratios to workers with distinct data distributions and volumes is therefore a promising solution. This work derives the convergence rate of distributed SGD with non-uniform compression, which reveals the intricate relationship between model convergence and the compression ratios applied to individual workers. Accordingly, we frame the relative compression ratio assignment as an

$n$

-variable chi-squared nonlinear optimization problem, constrained by a limited communication budget. We propose DAGC-R, which assigns conservative compression to workers handling larger data volumes. Recognizing the computational limitations of mobile devices, we propose the DAGC-A, which is computationally less demanding and enhances the robustness of compression in non-IID scenarios. Our experiments confirm that the DAGC-R and DAGC-A can speed up the training speed by up to 25.43% and 16.65% compared to the uniform compression respectively, when dealing with highly imbalanced data volume distribution and restricted communication.

查看原文本刊更多论文

移动环境中的联合学习（FL）面临着巨大的通信瓶颈。梯度压缩已被证明是解决这一问题的有效方法，在带宽有限和数据计量的环境中具有显著优势。然而，在非 IID 环境中，由于采用一刀切的压缩方法，没有考虑到不同工人的不同数据量，梯度压缩会导致性能严重下降。因此，为具有不同数据分布和数据量的工作人员分配不同的压缩率是一种很有前途的解决方案。这项工作推导出了采用非均匀压缩的分布式 SGD 的收敛率，揭示了模型收敛与应用于单个工作者的压缩比之间错综复杂的关系。因此，我们将相对压缩比分配作为一个 $n$ 变量的奇平方非线性优化问题，并受到有限通信预算的限制。我们提出了 DAGC-R，它将保守压缩分配给处理较大数据量的工作人员。考虑到移动设备的计算局限性，我们提出了 DAGC-A，它对计算的要求较低，并增强了非 IID 场景下压缩的鲁棒性。我们的实验证实，在处理高度不平衡的数据量分布和通信受限的情况下，DAGC-R 和 DAGC-A 与统一压缩相比，可分别将训练速度提高 25.43% 和 16.65%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Mobile Computing 工程技术-电信学

CiteScore

12.90

自引率

2.50%

发文量

403

审稿时长

6.6 months

期刊介绍： IEEE Transactions on Mobile Computing addresses key technical issues related to various aspects of mobile computing. This includes (a) architectures, (b) support services, (c) algorithm/protocol design and analysis, (d) mobile environments, (e) mobile communication systems, (f) applications, and (g) emerging technologies. Topics of interest span a wide range, covering aspects like mobile networks and hosts, mobility management, multimedia, operating system support, power management, online and mobile environments, security, scalability, reliability, and emerging technologies such as wearable computers, body area networks, and wireless sensor networks. The journal serves as a comprehensive platform for advancements in mobile computing research.