Federated Fine-Tuning on Heterogeneous LoRAs With Error-Compensated Aggregation.

IF 10.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Wanyi Ning,Jingyu Wang,Qi Qi,Haifeng Sun,Daixuan Cheng,Cong Liu,Lei Zhang,Zirui Zhuang,Jianxin Liao
{"title":"Federated Fine-Tuning on Heterogeneous LoRAs With Error-Compensated Aggregation.","authors":"Wanyi Ning,Jingyu Wang,Qi Qi,Haifeng Sun,Daixuan Cheng,Cong Liu,Lei Zhang,Zirui Zhuang,Jianxin Liao","doi":"10.1109/tnnls.2025.3586545","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) has recently been applied to the parameter-efficient fine-tuning (PEFT) of large language models (LLMs). While promising, client resource heterogeneity has imposed the challenge of the \"bucket effect\" to FL, where model configuration must cater to the client with the fewest resources. To tackle this issue, heterogeneous low-rank adaptation (LoRA) has recently emerged in FL, which enables clients to do local fine-tuning with different LoRA ranks. However, existing works in this area typically adopt zero-padding, stacking, or singular value decomposition (SVD) for LoRA aggregation, which often incur precision loss or significant overhead, limiting their practicality. In this article, we propose ECLoRA, a novel method for federated fine-tuning with heterogeneous LoRA settings across clients. ECLoRA employs randomized SVD (RSVD) to dramatically reduce aggregation overhead while introducing an error compensation (EC) mechanism that incorporates the decomposition error from previous rounds to improve aggregation precision. Extensive experiments on four widely used foundation models across six public tasks demonstrate the effectiveness of ECLoRA. Specifically, ECLoRA is: (1) accurate, significantly improving the final model performance; (2) fast, accelerating convergence with an average speedup of $1.54\\times $ to $3.01\\times $ ; and (3) practical, reducing aggregation time by approximately $40\\times $ compared to classical SVD.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"109 1","pages":""},"PeriodicalIF":10.2000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/tnnls.2025.3586545","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Federated learning (FL) has recently been applied to the parameter-efficient fine-tuning (PEFT) of large language models (LLMs). While promising, client resource heterogeneity has imposed the challenge of the "bucket effect" to FL, where model configuration must cater to the client with the fewest resources. To tackle this issue, heterogeneous low-rank adaptation (LoRA) has recently emerged in FL, which enables clients to do local fine-tuning with different LoRA ranks. However, existing works in this area typically adopt zero-padding, stacking, or singular value decomposition (SVD) for LoRA aggregation, which often incur precision loss or significant overhead, limiting their practicality. In this article, we propose ECLoRA, a novel method for federated fine-tuning with heterogeneous LoRA settings across clients. ECLoRA employs randomized SVD (RSVD) to dramatically reduce aggregation overhead while introducing an error compensation (EC) mechanism that incorporates the decomposition error from previous rounds to improve aggregation precision. Extensive experiments on four widely used foundation models across six public tasks demonstrate the effectiveness of ECLoRA. Specifically, ECLoRA is: (1) accurate, significantly improving the final model performance; (2) fast, accelerating convergence with an average speedup of $1.54\times $ to $3.01\times $ ; and (3) practical, reducing aggregation time by approximately $40\times $ compared to classical SVD.
基于误差补偿聚合的异构LoRAs的联邦微调。
联邦学习(FL)最近被应用于大型语言模型(llm)的参数有效微调(PEFT)。虽然前景看好,但客户端资源的异质性给FL带来了“桶效应”的挑战,即模型配置必须以最少的资源来满足客户端。为了解决这个问题,最近在FL中出现了异构低级别适应(LoRA),它使客户端能够使用不同的LoRA级别进行本地微调。然而,该领域的现有工作通常采用零填充、堆叠或奇异值分解(SVD)进行LoRA聚合,这通常会导致精度损失或显著的开销,限制了它们的实用性。在本文中,我们提出了ECLoRA,这是一种用于跨客户机使用异构LoRA设置进行联邦微调的新方法。ECLoRA采用随机SVD (RSVD)来显著降低聚合开销,同时引入了误差补偿(EC)机制,该机制结合了前几轮的分解误差,以提高聚合精度。在6个公共任务的4个广泛使用的基础模型上进行的大量实验证明了ECLoRA的有效性。具体而言,ECLoRA具有:(1)准确性,显著提高了最终模型的性能;(2)快速,加速收敛,平均速度从1.54倍提高到3.01倍;(3)实用性,与经典SVD相比,聚合时间缩短了约40倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE transactions on neural networks and learning systems
IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
CiteScore
23.80
自引率
9.60%
发文量
2102
审稿时长
3-8 weeks
期刊介绍: The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信