{"title":"FedDQA: A novel regularization-based deep learning method for data quality assessment in federated learning","authors":"Zongxiang Zhang , Gang Chen , Yunjie Xu , Lihua Huang , Chenghong Zhang , Shuaiyong Xiao","doi":"10.1016/j.dss.2024.114183","DOIUrl":null,"url":null,"abstract":"<div><p>Researchers strive to design artificial intelligence (AI) models that can fully utilize the potentials of data while protecting privacy. Federated learning is a promising solution because it utilizes data but shields them from those who do not own them. However, assessing data quality becomes a challenge in federated learning. We propose a data quality assessment method, Federated Data Quality Assessment (FedDQA), and compare it with traditional federated learning methods. FedDQA identifies low-quality data from participants and reduces their influence on the global model. We integrate data quality regularization strategies at the instance, feature, and participant levels into federate learning model. In various data poisoning settings, FedDQA outperforms existing federated learning methods in prediction performance and the accuracy in detecting low-quality data.</p></div>","PeriodicalId":55181,"journal":{"name":"Decision Support Systems","volume":"180 ","pages":"Article 114183"},"PeriodicalIF":6.8000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Support Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167923624000162","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Researchers strive to design artificial intelligence (AI) models that can fully utilize the potentials of data while protecting privacy. Federated learning is a promising solution because it utilizes data but shields them from those who do not own them. However, assessing data quality becomes a challenge in federated learning. We propose a data quality assessment method, Federated Data Quality Assessment (FedDQA), and compare it with traditional federated learning methods. FedDQA identifies low-quality data from participants and reduces their influence on the global model. We integrate data quality regularization strategies at the instance, feature, and participant levels into federate learning model. In various data poisoning settings, FedDQA outperforms existing federated learning methods in prediction performance and the accuracy in detecting low-quality data.
期刊介绍:
The common thread of articles published in Decision Support Systems is their relevance to theoretical and technical issues in the support of enhanced decision making. The areas addressed may include foundations, functionality, interfaces, implementation, impacts, and evaluation of decision support systems (DSSs).