Jiaqi Shi , Xiaoling Huang , Xianjie Guo , Kui Yu , Chengxiang Hu , Peng Zhou
{"title":"Federated causal structure learning with missing data","authors":"Jiaqi Shi , Xiaoling Huang , Xianjie Guo , Kui Yu , Chengxiang Hu , Peng Zhou","doi":"10.1016/j.knosys.2025.114601","DOIUrl":null,"url":null,"abstract":"<div><div>Federated causal structure learning (CSL) is an emerging research direction that aims to discover causal relationships from decentralized data across multiple clients, while preserving data privacy. Existing federated CSL algorithms primarily focus on complete datasets and often overlook data-quality issues, such as missing data, which are common in real-world scenarios. Moreover, client diversity can destabilize federated CSL, and this challenge is further worsened by missing data. To address these issues, we propose FedImpCSL, a novel federated CSL method, for effectively handling missing data. Our approach consists of two key components: (1) a local-to-global missing data imputation strategy that reconstructs imputed and accurate datasets from missing samples, and (2) a dynamic client weighting and weighted aggregation strategy to address inter-client differences, enhancing the CSL accuracy without utilizing each client’s original data. We demonstrate the effectiveness of FedImpCSL through comprehensive experiments on various types of datasets, showing its superior performance over existing federated CSL methods in handling missing data scenarios.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114601"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125016405","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Federated causal structure learning (CSL) is an emerging research direction that aims to discover causal relationships from decentralized data across multiple clients, while preserving data privacy. Existing federated CSL algorithms primarily focus on complete datasets and often overlook data-quality issues, such as missing data, which are common in real-world scenarios. Moreover, client diversity can destabilize federated CSL, and this challenge is further worsened by missing data. To address these issues, we propose FedImpCSL, a novel federated CSL method, for effectively handling missing data. Our approach consists of two key components: (1) a local-to-global missing data imputation strategy that reconstructs imputed and accurate datasets from missing samples, and (2) a dynamic client weighting and weighted aggregation strategy to address inter-client differences, enhancing the CSL accuracy without utilizing each client’s original data. We demonstrate the effectiveness of FedImpCSL through comprehensive experiments on various types of datasets, showing its superior performance over existing federated CSL methods in handling missing data scenarios.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.