Communication-Efficient Vertical Federated Learning via Compressed Error Feedback

IF 4.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Pedro Valdeira;João Xavier;Cláudia Soares;Yuejie Chi
{"title":"Communication-Efficient Vertical Federated Learning via Compressed Error Feedback","authors":"Pedro Valdeira;João Xavier;Cláudia Soares;Yuejie Chi","doi":"10.1109/TSP.2025.3540655","DOIUrl":null,"url":null,"abstract":"Communication overhead is a known bottleneck in federated learning (FL). To address this, lossy compression is commonly used on the information communicated between the server and clients during training. In horizontal FL, where each client holds a subset of the samples, such communication-compressed training methods have recently seen significant progress. However, in their vertical FL counterparts, where each client holds a subset of the features, our understanding remains limited. To address this, we propose an error feedback compressed vertical federated learning (<monospace>EF-VFL</monospace>) method to train split neural networks. In contrast to previous communication-compressed methods for vertical FL, <monospace>EF-VFL</monospace> does not require a vanishing compression error for the gradient norm to converge to zero for smooth nonconvex problems. By leveraging error feedback, our method can achieve a <inline-formula><tex-math>$\\mathcal{O}({1}/{T})$</tex-math></inline-formula> convergence rate for a sufficiently large batch size, improving over the state-of-the-art <inline-formula><tex-math>$\\mathcal{O}({1}/{\\sqrt{T}})$</tex-math></inline-formula> rate under <inline-formula><tex-math>$\\mathcal{O}({1}/{\\sqrt{T}})$</tex-math></inline-formula> compression error, and matching the rate of uncompressed methods. Further, when the objective function satisfies the Polyak-Łojasiewicz inequality, our method converges linearly. In addition to improving convergence, our method also supports the use of private labels. Numerical experiments show that <monospace>EF-VFL</monospace> significantly improves over the prior art, confirming our theoretical results.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"73 ","pages":"1065-1080"},"PeriodicalIF":4.6000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10880110/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Communication overhead is a known bottleneck in federated learning (FL). To address this, lossy compression is commonly used on the information communicated between the server and clients during training. In horizontal FL, where each client holds a subset of the samples, such communication-compressed training methods have recently seen significant progress. However, in their vertical FL counterparts, where each client holds a subset of the features, our understanding remains limited. To address this, we propose an error feedback compressed vertical federated learning (EF-VFL) method to train split neural networks. In contrast to previous communication-compressed methods for vertical FL, EF-VFL does not require a vanishing compression error for the gradient norm to converge to zero for smooth nonconvex problems. By leveraging error feedback, our method can achieve a $\mathcal{O}({1}/{T})$ convergence rate for a sufficiently large batch size, improving over the state-of-the-art $\mathcal{O}({1}/{\sqrt{T}})$ rate under $\mathcal{O}({1}/{\sqrt{T}})$ compression error, and matching the rate of uncompressed methods. Further, when the objective function satisfies the Polyak-Łojasiewicz inequality, our method converges linearly. In addition to improving convergence, our method also supports the use of private labels. Numerical experiments show that EF-VFL significantly improves over the prior art, confirming our theoretical results.
基于压缩误差反馈的高效通信垂直联邦学习
通信开销是联邦学习(FL)中一个已知的瓶颈。为了解决这个问题,在训练期间,通常对服务器和客户端之间通信的信息使用有损压缩。在水平FL中,每个客户端持有样本的子集,这种通信压缩训练方法最近取得了重大进展。然而,在他们的垂直FL对应物中,每个客户都拥有特征的子集,我们的理解仍然有限。为了解决这个问题,我们提出了一种误差反馈压缩垂直联邦学习(EF-VFL)方法来训练分裂神经网络。相对于以前的垂直FL的通信压缩方法,EF-VFL不需要一个消失的压缩误差梯度范数收敛到零的光滑非凸问题。通过利用错误反馈,我们的方法可以在足够大的批大小下实现$\mathcal{O}({1}/{T})$的收敛率,提高了最先进的$\mathcal{O}({1}/{\sqrt{T}})$压缩错误下的$\mathcal{O}({1}/{\sqrt{T}})$的收敛率,并与未压缩方法的速率相匹配。进一步,当目标函数满足Polyak-Łojasiewicz不等式时,我们的方法线性收敛。除了提高收敛性外,我们的方法还支持使用私有标签。数值实验表明,与现有技术相比,EF-VFL有了明显的改善,证实了我们的理论结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Signal Processing
IEEE Transactions on Signal Processing 工程技术-工程:电子与电气
CiteScore
11.20
自引率
9.30%
发文量
310
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信