Communication-Efficient Vertical Federated Learning via Compressed Error Feedback

IF 4.6 2区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Signal Processing Pub Date : 2025-02-11 DOI:10.1109/TSP.2025.3540655

Pedro Valdeira;João Xavier;Cláudia Soares;Yuejie Chi

{"title":"Communication-Efficient Vertical Federated Learning via Compressed Error Feedback","authors":"Pedro Valdeira;João Xavier;Cláudia Soares;Yuejie Chi","doi":"10.1109/TSP.2025.3540655","DOIUrl":null,"url":null,"abstract":"Communication overhead is a known bottleneck in federated learning (FL). To address this, lossy compression is commonly used on the information communicated between the server and clients during training. In horizontal FL, where each client holds a subset of the samples, such communication-compressed training methods have recently seen significant progress. However, in their vertical FL counterparts, where each client holds a subset of the features, our understanding remains limited. To address this, we propose an error feedback compressed vertical federated learning (<monospace>EF-VFL</monospace>) method to train split neural networks. In contrast to previous communication-compressed methods for vertical FL, <monospace>EF-VFL</monospace> does not require a vanishing compression error for the gradient norm to converge to zero for smooth nonconvex problems. By leveraging error feedback, our method can achieve a <inline-formula><tex-math>$\\mathcal{O}({1}/{T})$</tex-math></inline-formula> convergence rate for a sufficiently large batch size, improving over the state-of-the-art <inline-formula><tex-math>$\\mathcal{O}({1}/{\\sqrt{T}})$</tex-math></inline-formula> rate under <inline-formula><tex-math>$\\mathcal{O}({1}/{\\sqrt{T}})$</tex-math></inline-formula> compression error, and matching the rate of uncompressed methods. Further, when the objective function satisfies the Polyak-Łojasiewicz inequality, our method converges linearly. In addition to improving convergence, our method also supports the use of private labels. Numerical experiments show that <monospace>EF-VFL</monospace> significantly improves over the prior art, confirming our theoretical results.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"73 ","pages":"1065-1080"},"PeriodicalIF":4.6000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10880110/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Communication overhead is a known bottleneck in federated learning (FL). To address this, lossy compression is commonly used on the information communicated between the server and clients during training. In horizontal FL, where each client holds a subset of the samples, such communication-compressed training methods have recently seen significant progress. However, in their vertical FL counterparts, where each client holds a subset of the features, our understanding remains limited. To address this, we propose an error feedback compressed vertical federated learning (EF-VFL) method to train split neural networks. In contrast to previous communication-compressed methods for vertical FL, EF-VFL does not require a vanishing compression error for the gradient norm to converge to zero for smooth nonconvex problems. By leveraging error feedback, our method can achieve a

$\mathcal{O}({1}/{T})$

convergence rate for a sufficiently large batch size, improving over the state-of-the-art

$\mathcal{O}({1}/{\sqrt{T}})$

rate under

$\mathcal{O}({1}/{\sqrt{T}})$

compression error, and matching the rate of uncompressed methods. Further, when the objective function satisfies the Polyak-Łojasiewicz inequality, our method converges linearly. In addition to improving convergence, our method also supports the use of private labels. Numerical experiments show that EF-VFL significantly improves over the prior art, confirming our theoretical results.

查看原文本刊更多论文

基于压缩误差反馈的高效通信垂直联邦学习

通信开销是联邦学习（FL）中一个已知的瓶颈。为了解决这个问题，在训练期间，通常对服务器和客户端之间通信的信息使用有损压缩。在水平FL中，每个客户端持有样本的子集，这种通信压缩训练方法最近取得了重大进展。然而，在他们的垂直FL对应物中，每个客户都拥有特征的子集，我们的理解仍然有限。为了解决这个问题，我们提出了一种误差反馈压缩垂直联邦学习（EF-VFL）方法来训练分裂神经网络。相对于以前的垂直FL的通信压缩方法，EF-VFL不需要一个消失的压缩误差梯度范数收敛到零的光滑非凸问题。通过利用错误反馈，我们的方法可以在足够大的批大小下实现$\mathcal{O}({1}/{T})$的收敛率，提高了最先进的$\mathcal{O}({1}/{\sqrt{T}})$压缩错误下的$\mathcal{O}({1}/{\sqrt{T}})$的收敛率，并与未压缩方法的速率相匹配。进一步，当目标函数满足Polyak-Łojasiewicz不等式时，我们的方法线性收敛。除了提高收敛性外，我们的方法还支持使用私有标签。数值实验表明，与现有技术相比，EF-VFL有了明显的改善，证实了我们的理论结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Signal Processing 工程技术-工程：电子与电气

CiteScore

11.20

自引率

9.30%

发文量

310

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.