{"title":"用于减轻联邦学习中数据异构的基于原型的微调","authors":"Liming Chai, Jun Xie, Nanrun Zhou","doi":"10.1016/j.future.2025.107831","DOIUrl":null,"url":null,"abstract":"<div><div>In federated learning with data heterogeneity, the global model often exhibits a severe imbalance in fitting data from different categories, and clients may not be able to obtain useful information from the impaired global model. To address this challenge, Federated Learning Based on Model Repair (FedMR) is proposed to repair the global model by a set of prototypes with minimal divergence. The repair step of FedMR is executed after global aggregation and before local training. Different clients first obtain similar local prototypes on the same feature extractor, and then fine-tune the global classifier with these local prototypes. The repaired classifier is aggregated at the server and broadcast to all clients, enabling them to start local training from a consensus point. This approach effectively mitigates the adverse effects of uneven sample distribution. In most experimental configurations, FedMR outperforms the state-of-the-art federated learning algorithms.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"170 ","pages":"Article 107831"},"PeriodicalIF":6.2000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prototype-based fine-tuning for mitigating data heterogeneity in federated learning\",\"authors\":\"Liming Chai, Jun Xie, Nanrun Zhou\",\"doi\":\"10.1016/j.future.2025.107831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In federated learning with data heterogeneity, the global model often exhibits a severe imbalance in fitting data from different categories, and clients may not be able to obtain useful information from the impaired global model. To address this challenge, Federated Learning Based on Model Repair (FedMR) is proposed to repair the global model by a set of prototypes with minimal divergence. The repair step of FedMR is executed after global aggregation and before local training. Different clients first obtain similar local prototypes on the same feature extractor, and then fine-tune the global classifier with these local prototypes. The repaired classifier is aggregated at the server and broadcast to all clients, enabling them to start local training from a consensus point. This approach effectively mitigates the adverse effects of uneven sample distribution. In most experimental configurations, FedMR outperforms the state-of-the-art federated learning algorithms.</div></div>\",\"PeriodicalId\":55132,\"journal\":{\"name\":\"Future Generation Computer Systems-The International Journal of Escience\",\"volume\":\"170 \",\"pages\":\"Article 107831\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Future Generation Computer Systems-The International Journal of Escience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167739X25001268\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25001268","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
摘要
在具有数据异质性的联邦学习中,全局模型在拟合不同类别的数据时往往表现出严重的不平衡,客户端可能无法从受损的全局模型中获得有用的信息。为了解决这一挑战,提出了基于模型修复的联邦学习(federal Learning Based on Model Repair, federmr),通过一组分歧最小的原型来修复全局模型。FedMR的修复步骤在全局聚合之后,局部训练之前执行。不同的客户端首先在相同的特征提取器上获得相似的局部原型,然后利用这些局部原型对全局分类器进行微调。修复后的分类器在服务器上聚合并广播到所有客户端,使它们能够从共识点开始本地训练。这种方法有效地减轻了样本分布不均匀的不利影响。在大多数实验配置中,FedMR优于最先进的联邦学习算法。
Prototype-based fine-tuning for mitigating data heterogeneity in federated learning
In federated learning with data heterogeneity, the global model often exhibits a severe imbalance in fitting data from different categories, and clients may not be able to obtain useful information from the impaired global model. To address this challenge, Federated Learning Based on Model Repair (FedMR) is proposed to repair the global model by a set of prototypes with minimal divergence. The repair step of FedMR is executed after global aggregation and before local training. Different clients first obtain similar local prototypes on the same feature extractor, and then fine-tune the global classifier with these local prototypes. The repaired classifier is aggregated at the server and broadcast to all clients, enabling them to start local training from a consensus point. This approach effectively mitigates the adverse effects of uneven sample distribution. In most experimental configurations, FedMR outperforms the state-of-the-art federated learning algorithms.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.