Yiwen Wang , Jiaxin Li , Heye Zhang , Jingfeng Zhang , Feng Wan , Anqi Qiu , Zhifan Gao
{"title":"FedMDD: Multi-deliberation based calibration for federated long-tailed learning","authors":"Yiwen Wang , Jiaxin Li , Heye Zhang , Jingfeng Zhang , Feng Wan , Anqi Qiu , Zhifan Gao","doi":"10.1016/j.knosys.2025.113741","DOIUrl":null,"url":null,"abstract":"<div><div>Federated learning is a decentralized framework enabling collaborative training of machine learning models across distributed data clients while ensuring privacy protection. Despite its advantages, traditional federated learning faces the global long-tailed imbalance, leading to poor performance by overemphasizing head classes and under-representing tail classes. While planned (pre-hoc) and post-hoc imbalance adjustments have been explored, post-hoc methods often require auxiliary data or suffer from overconfident decision boundaries, which limits their effectiveness. To address the overconfidence and out-of-distribution in existing solutions, we propose a multi-deliberation based post-hoc calibration method (FedMDD) tailored for the federated long-tailed problem. FedMDD calibrates the global decision boundary for balance. It incorporates a local–global feature contrast constraint to generate effective features and uses consistency across client models to deliberate a model-aware margin. This margin promotes a large relative distance between tail classes and the decision boundary, preserving privacy by leveraging model performance without requiring access to local class distributions. Extensive experiments demonstrate that FedMDD outperforms existing methods in balancing decision boundaries and enhancing privacy protection, achieving superior performance on long-tailed data distributions.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"323 ","pages":"Article 113741"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125007877","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Federated learning is a decentralized framework enabling collaborative training of machine learning models across distributed data clients while ensuring privacy protection. Despite its advantages, traditional federated learning faces the global long-tailed imbalance, leading to poor performance by overemphasizing head classes and under-representing tail classes. While planned (pre-hoc) and post-hoc imbalance adjustments have been explored, post-hoc methods often require auxiliary data or suffer from overconfident decision boundaries, which limits their effectiveness. To address the overconfidence and out-of-distribution in existing solutions, we propose a multi-deliberation based post-hoc calibration method (FedMDD) tailored for the federated long-tailed problem. FedMDD calibrates the global decision boundary for balance. It incorporates a local–global feature contrast constraint to generate effective features and uses consistency across client models to deliberate a model-aware margin. This margin promotes a large relative distance between tail classes and the decision boundary, preserving privacy by leveraging model performance without requiring access to local class distributions. Extensive experiments demonstrate that FedMDD outperforms existing methods in balancing decision boundaries and enhancing privacy protection, achieving superior performance on long-tailed data distributions.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.