Chin-Feng Lai , Ying-Hsun Lai , Ming-Chin Kao , Mu-Yen Chen
{"title":"Enhancing global model accuracy in federated learning with deep neuro-fuzzy clustering cyclic algorithm","authors":"Chin-Feng Lai , Ying-Hsun Lai , Ming-Chin Kao , Mu-Yen Chen","doi":"10.1016/j.aej.2024.10.093","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, with the increasing importance of privacy protection, many laws and regulations have standardized data usage, requiring companies to obtain user consent to access personal data. This has become more challenging for models that require large amounts of data for training. Therefore, the concept of federated learning was proposed in 2016, aiming to train models with different clients without sharing data to ensure data privacy. However, federated learning faces several challenges, including heterogeneous devices, data security, data heterogeneity, communication costs, and training time costs. This study focuses on addressing the issue of data heterogeneity, where the data distribution among participating clients differs significantly, leading to poor performance of the aggregated model after training. To tackle this problem, we propose a federated clustering cyclic algorithm, which involves two-step clustering of clients to make the data distribution of each cluster approach independent and identically distributed. We also introduce deep neural fuzzy methods to handle fuzzy, uncertain, or incomplete data. According to experimental results, the proposed deep neuro-fuzzy clustered cyclic algorithm outperforms methods such as FedAvg, FedProx, and CyclicFL on various non-IID datasets, with accuracy approaching that of centralized learning in certain experiments. This indicates that the deep neural fuzzy methods and clustering cyclic algorithm DNCC presented in this study can improve the accuracy of global models, especially in increasingly non-IID scenarios. Furthermore, we extend this method to big data processing to cope with more complex data environments.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"112 ","pages":"Pages 474-486"},"PeriodicalIF":6.2000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016824012584","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, with the increasing importance of privacy protection, many laws and regulations have standardized data usage, requiring companies to obtain user consent to access personal data. This has become more challenging for models that require large amounts of data for training. Therefore, the concept of federated learning was proposed in 2016, aiming to train models with different clients without sharing data to ensure data privacy. However, federated learning faces several challenges, including heterogeneous devices, data security, data heterogeneity, communication costs, and training time costs. This study focuses on addressing the issue of data heterogeneity, where the data distribution among participating clients differs significantly, leading to poor performance of the aggregated model after training. To tackle this problem, we propose a federated clustering cyclic algorithm, which involves two-step clustering of clients to make the data distribution of each cluster approach independent and identically distributed. We also introduce deep neural fuzzy methods to handle fuzzy, uncertain, or incomplete data. According to experimental results, the proposed deep neuro-fuzzy clustered cyclic algorithm outperforms methods such as FedAvg, FedProx, and CyclicFL on various non-IID datasets, with accuracy approaching that of centralized learning in certain experiments. This indicates that the deep neural fuzzy methods and clustering cyclic algorithm DNCC presented in this study can improve the accuracy of global models, especially in increasingly non-IID scenarios. Furthermore, we extend this method to big data processing to cope with more complex data environments.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering