{"title":"HDP-FedCD: Data-quality-driven hierarchical federated learning for optimizing privacy protection in non-IID data","authors":"Chunxiao Yin , Kai He , Jiaoli Shi","doi":"10.1016/j.future.2025.108140","DOIUrl":null,"url":null,"abstract":"<div><div>With the proliferation of Internet of Things (IoT) devices, Federated Learning (FL) has become a key paradigm for collaborative machine learning on decentralized edge data. However, FL remains vulnerable to inference attacks, posing significant privacy concerns, particularly in scenarios with diverse data quality and distribution. Existing privacy protection methods often neglect such heterogeneity, resulting in suboptimal trade-offs between privacy and performance. We propose a Hierarchical Differential Privacy protection scheme in Federated Learning based on Core-Degree (HDP-FedCD), which achieves an optimal balance between privacy and utility by leveraging core-degree as a measure of data quality to dynamically adjust noise levels. Using an adaptive core-degree threshold, HDP-FedCD layers local datasets into core and non-core layers, tailoring noise intensity to data quality: low-intensity noise preserves utility for core-layer data, while high-intensity noise enhances privacy for non-core-layer data. Furthermore, the adaptive threshold mechanism responds to dynamic data distribution changes, ensuring robustness across diverse FL scenarios. Empirical evaluations on image classification tasks demonstrate that HDP-FedCD outperforms state-of-the-art methods in model accuracy and resistance to inference attacks, offering an innovative solution for privacy-preserving federated learning.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"176 ","pages":"Article 108140"},"PeriodicalIF":6.2000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25004340","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
With the proliferation of Internet of Things (IoT) devices, Federated Learning (FL) has become a key paradigm for collaborative machine learning on decentralized edge data. However, FL remains vulnerable to inference attacks, posing significant privacy concerns, particularly in scenarios with diverse data quality and distribution. Existing privacy protection methods often neglect such heterogeneity, resulting in suboptimal trade-offs between privacy and performance. We propose a Hierarchical Differential Privacy protection scheme in Federated Learning based on Core-Degree (HDP-FedCD), which achieves an optimal balance between privacy and utility by leveraging core-degree as a measure of data quality to dynamically adjust noise levels. Using an adaptive core-degree threshold, HDP-FedCD layers local datasets into core and non-core layers, tailoring noise intensity to data quality: low-intensity noise preserves utility for core-layer data, while high-intensity noise enhances privacy for non-core-layer data. Furthermore, the adaptive threshold mechanism responds to dynamic data distribution changes, ensuring robustness across diverse FL scenarios. Empirical evaluations on image classification tasks demonstrate that HDP-FedCD outperforms state-of-the-art methods in model accuracy and resistance to inference attacks, offering an innovative solution for privacy-preserving federated learning.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.