FedMDD: Multi-deliberation based calibration for federated long-tailed learning

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2025-05-26 DOI:10.1016/j.knosys.2025.113741

Yiwen Wang , Jiaxin Li , Heye Zhang , Jingfeng Zhang , Feng Wan , Anqi Qiu , Zhifan Gao

{"title":"FedMDD: Multi-deliberation based calibration for federated long-tailed learning","authors":"Yiwen Wang , Jiaxin Li , Heye Zhang , Jingfeng Zhang , Feng Wan , Anqi Qiu , Zhifan Gao","doi":"10.1016/j.knosys.2025.113741","DOIUrl":null,"url":null,"abstract":"<div><div>Federated learning is a decentralized framework enabling collaborative training of machine learning models across distributed data clients while ensuring privacy protection. Despite its advantages, traditional federated learning faces the global long-tailed imbalance, leading to poor performance by overemphasizing head classes and under-representing tail classes. While planned (pre-hoc) and post-hoc imbalance adjustments have been explored, post-hoc methods often require auxiliary data or suffer from overconfident decision boundaries, which limits their effectiveness. To address the overconfidence and out-of-distribution in existing solutions, we propose a multi-deliberation based post-hoc calibration method (FedMDD) tailored for the federated long-tailed problem. FedMDD calibrates the global decision boundary for balance. It incorporates a local–global feature contrast constraint to generate effective features and uses consistency across client models to deliberate a model-aware margin. This margin promotes a large relative distance between tail classes and the decision boundary, preserving privacy by leveraging model performance without requiring access to local class distributions. Extensive experiments demonstrate that FedMDD outperforms existing methods in balancing decision boundaries and enhancing privacy protection, achieving superior performance on long-tailed data distributions.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"323 ","pages":"Article 113741"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125007877","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Federated learning is a decentralized framework enabling collaborative training of machine learning models across distributed data clients while ensuring privacy protection. Despite its advantages, traditional federated learning faces the global long-tailed imbalance, leading to poor performance by overemphasizing head classes and under-representing tail classes. While planned (pre-hoc) and post-hoc imbalance adjustments have been explored, post-hoc methods often require auxiliary data or suffer from overconfident decision boundaries, which limits their effectiveness. To address the overconfidence and out-of-distribution in existing solutions, we propose a multi-deliberation based post-hoc calibration method (FedMDD) tailored for the federated long-tailed problem. FedMDD calibrates the global decision boundary for balance. It incorporates a local–global feature contrast constraint to generate effective features and uses consistency across client models to deliberate a model-aware margin. This margin promotes a large relative distance between tail classes and the decision boundary, preserving privacy by leveraging model performance without requiring access to local class distributions. Extensive experiments demonstrate that FedMDD outperforms existing methods in balancing decision boundaries and enhancing privacy protection, achieving superior performance on long-tailed data distributions.

查看原文本刊更多论文

FedMDD：基于多考虑的联邦长尾学习校准

联邦学习是一个去中心化的框架，支持跨分布式数据客户端协作训练机器学习模型，同时确保隐私保护。传统的联邦学习虽然具有诸多优势，但由于过度强调头类，对尾类的代表性不足，导致学习效果不佳。虽然已经探索了计划的（事前的）和事后的不平衡调整，但事后的方法通常需要辅助数据或遭受过度自信的决策边界，这限制了它们的有效性。为了解决现有解决方案中的过度置信度和分布外问题，我们提出了一种针对联邦长尾问题的基于多审议的事后校准方法（FedMDD）。FedMDD为平衡校准全局决策边界。它结合了局部-全局特征对比约束来生成有效的特征，并使用跨客户端模型的一致性来考虑模型感知边界。这个余量促进了尾类和决策边界之间的较大相对距离，通过利用模型性能而不需要访问局部类分布来保护隐私。大量实验表明，FedMDD在平衡决策边界和增强隐私保护方面优于现有方法，在长尾数据分布上取得了优异的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.