{"title":"An Efficient Federated Learning Framework for Enhancing Data Diversity and Communication in Industrial IoT Fault Diagnosis","authors":"Xuehua Sun;Zengsen Yuan;Xianguang Kong;Liang Xue;Han Cheng;Zhong Chen","doi":"10.1109/JIOT.2025.3583081","DOIUrl":null,"url":null,"abstract":"In the field of smart manufacturing, the widespread application of Industrial Internet of Things (IIoT) has prompted various data silos to generate and accumulate huge amounts of industrial data. To ensure the security of multiparty data, the industry has introduced federated learning (FL) technology to realize the circulation of data value by training machine learning models across silos. However, the nonindependent and identically distributed (Non-IID) nature of data distribution among different silos in real industrial environments, which results in degradation of model performance and increase in communication overheads, has become a key challenge that needs to be addressed. In this article, we propose a sparse quantization compression solution based on FL multiagent generative adversarial network (MAD-GAN) named SQC-FLMGAN, aiming to alleviate the problem of Non-IID data across silos in industrial environments and to improve communication efficiency. The scheme generates data samples with diversity and high quality for each client by training MAD-GAN models in an FL environment. This process effectively mitigates the modal collapse problem and improves the generalization ability and diagnostic accuracy of the model. In order to further reduce communication overhead, we have introduced an advanced model compression technique that combines Top-k sparsification and quantization methods. The Top-k algorithm selects crucial model parameters for transmission, while quantization further reduces their precision, eliminating redundancy in communication reducing the amount of data transmitted. Through experiments on the bearing dataset in industrial scenarios, we have demonstrated that the SQC-FLMGAN scheme significantly reduces communication overhead while maintaining model performance. The code framework is available at <uri>https://github.com/sqcflmgan/SQC-FLMGAN</uri>.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 17","pages":"36562-36576"},"PeriodicalIF":8.9000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11050896/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of smart manufacturing, the widespread application of Industrial Internet of Things (IIoT) has prompted various data silos to generate and accumulate huge amounts of industrial data. To ensure the security of multiparty data, the industry has introduced federated learning (FL) technology to realize the circulation of data value by training machine learning models across silos. However, the nonindependent and identically distributed (Non-IID) nature of data distribution among different silos in real industrial environments, which results in degradation of model performance and increase in communication overheads, has become a key challenge that needs to be addressed. In this article, we propose a sparse quantization compression solution based on FL multiagent generative adversarial network (MAD-GAN) named SQC-FLMGAN, aiming to alleviate the problem of Non-IID data across silos in industrial environments and to improve communication efficiency. The scheme generates data samples with diversity and high quality for each client by training MAD-GAN models in an FL environment. This process effectively mitigates the modal collapse problem and improves the generalization ability and diagnostic accuracy of the model. In order to further reduce communication overhead, we have introduced an advanced model compression technique that combines Top-k sparsification and quantization methods. The Top-k algorithm selects crucial model parameters for transmission, while quantization further reduces their precision, eliminating redundancy in communication reducing the amount of data transmitted. Through experiments on the bearing dataset in industrial scenarios, we have demonstrated that the SQC-FLMGAN scheme significantly reduces communication overhead while maintaining model performance. The code framework is available at https://github.com/sqcflmgan/SQC-FLMGAN.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.