An Efficient Federated Learning Framework for Enhancing Data Diversity and Communication in Industrial IoT Fault Diagnosis

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Internet of Things Journal Pub Date : 2025-06-25 DOI:10.1109/JIOT.2025.3583081

Xuehua Sun;Zengsen Yuan;Xianguang Kong;Liang Xue;Han Cheng;Zhong Chen

{"title":"An Efficient Federated Learning Framework for Enhancing Data Diversity and Communication in Industrial IoT Fault Diagnosis","authors":"Xuehua Sun;Zengsen Yuan;Xianguang Kong;Liang Xue;Han Cheng;Zhong Chen","doi":"10.1109/JIOT.2025.3583081","DOIUrl":null,"url":null,"abstract":"In the field of smart manufacturing, the widespread application of Industrial Internet of Things (IIoT) has prompted various data silos to generate and accumulate huge amounts of industrial data. To ensure the security of multiparty data, the industry has introduced federated learning (FL) technology to realize the circulation of data value by training machine learning models across silos. However, the nonindependent and identically distributed (Non-IID) nature of data distribution among different silos in real industrial environments, which results in degradation of model performance and increase in communication overheads, has become a key challenge that needs to be addressed. In this article, we propose a sparse quantization compression solution based on FL multiagent generative adversarial network (MAD-GAN) named SQC-FLMGAN, aiming to alleviate the problem of Non-IID data across silos in industrial environments and to improve communication efficiency. The scheme generates data samples with diversity and high quality for each client by training MAD-GAN models in an FL environment. This process effectively mitigates the modal collapse problem and improves the generalization ability and diagnostic accuracy of the model. In order to further reduce communication overhead, we have introduced an advanced model compression technique that combines Top-k sparsification and quantization methods. The Top-k algorithm selects crucial model parameters for transmission, while quantization further reduces their precision, eliminating redundancy in communication reducing the amount of data transmitted. Through experiments on the bearing dataset in industrial scenarios, we have demonstrated that the SQC-FLMGAN scheme significantly reduces communication overhead while maintaining model performance. The code framework is available at <uri>https://github.com/sqcflmgan/SQC-FLMGAN</uri>.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 17","pages":"36562-36576"},"PeriodicalIF":8.9000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11050896/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In the field of smart manufacturing, the widespread application of Industrial Internet of Things (IIoT) has prompted various data silos to generate and accumulate huge amounts of industrial data. To ensure the security of multiparty data, the industry has introduced federated learning (FL) technology to realize the circulation of data value by training machine learning models across silos. However, the nonindependent and identically distributed (Non-IID) nature of data distribution among different silos in real industrial environments, which results in degradation of model performance and increase in communication overheads, has become a key challenge that needs to be addressed. In this article, we propose a sparse quantization compression solution based on FL multiagent generative adversarial network (MAD-GAN) named SQC-FLMGAN, aiming to alleviate the problem of Non-IID data across silos in industrial environments and to improve communication efficiency. The scheme generates data samples with diversity and high quality for each client by training MAD-GAN models in an FL environment. This process effectively mitigates the modal collapse problem and improves the generalization ability and diagnostic accuracy of the model. In order to further reduce communication overhead, we have introduced an advanced model compression technique that combines Top-k sparsification and quantization methods. The Top-k algorithm selects crucial model parameters for transmission, while quantization further reduces their precision, eliminating redundancy in communication reducing the amount of data transmitted. Through experiments on the bearing dataset in industrial scenarios, we have demonstrated that the SQC-FLMGAN scheme significantly reduces communication overhead while maintaining model performance. The code framework is available at https://github.com/sqcflmgan/SQC-FLMGAN.

查看原文本刊更多论文

工业物联网故障诊断中增强数据多样性和通信的高效联邦学习框架

在智能制造领域，工业物联网（IIoT）的广泛应用促使各种数据孤岛产生并积累了海量的工业数据。为了确保多方数据的安全性，业界引入了联邦学习（FL）技术，通过跨孤岛训练机器学习模型来实现数据价值的流通。然而，在实际工业环境中，数据分布在不同筒仓之间的非独立和同分布（Non-IID）性质会导致模型性能下降和通信开销增加，这已成为需要解决的关键挑战。本文提出了一种基于FL多智能体生成对抗网络（MAD-GAN）的稀疏量化压缩方案SQC-FLMGAN，旨在缓解工业环境中非iid数据跨筒仓的问题，提高通信效率。该方案通过在FL环境中训练MAD-GAN模型，为每个客户端生成具有多样性和高质量的数据样本。该方法有效地缓解了模态崩溃问题，提高了模型的泛化能力和诊断精度。为了进一步减少通信开销，我们引入了一种先进的模型压缩技术，它结合了Top-k稀疏化和量化方法。Top-k算法选择关键的模型参数进行传输，量化进一步降低了它们的精度，消除了通信中的冗余，减少了传输的数据量。通过工业场景的轴承数据集实验，我们证明了SQC-FLMGAN方案在保持模型性能的同时显着降低了通信开销。代码框架可从https://github.com/sqcflmgan/SQC-FLMGAN获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.