工业大规模诊断模型与轻量级定制部署分布式多个非iid诊断任务

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2025-06-03 DOI:10.1109/JSEN.2025.3574226

Xinming Jia;Na Qin;Deqing Huang;Jiahao Du;Tianwei Wang

{"title":"工业大规模诊断模型与轻量级定制部署分布式多个非iid诊断任务","authors":"Xinming Jia;Na Qin;Deqing Huang;Jiahao Du;Tianwei Wang","doi":"10.1109/JSEN.2025.3574226","DOIUrl":null,"url":null,"abstract":"Industrial fault diagnosis frequently confronts the challenge that both the data and targets are not independent and identically distributed (Non-IID). Conventional fault diagnosis often requires tailoring the model to specific task scenarios, thereby suffering from high costs and limited generalization capabilities. An industrial large-scale diagnostic model (ILDM) is proposed, with lightweight learning and customized deployment for distributed multiple Non-IID fault diagnosis tasks. ILDM can acquire knowledge from large public datasets that are completely irrelevant to downstream tasks and have the advantages of low learning cost, robust task adaptability, and high-quality deployment customization. First, a large-scale foundation model is built utilizing time-series embedding and 1-D–2-D–1-D variation, enhancing feature extraction capability while preserving the original temporal information. Second, through designed preprocessors and postprocessors, the constructed foundation model is pretrained on numerous public Non-IID datasets, ensuring that the model acquires generic classification capabilities. Finally, the pretrained large-scale model (LM) is tailored for deployment at different low-computing-power terminals to conduct various process/system-specific Non-IID diagnostic tasks, with lightweight customization achieved through fine-tuning and label smoothing based on terminal datasets. The performance of ILDM is evaluated across three representative Non-IID industrial diagnostic tasks in few-shot and full-shot experiments, as well as in experiments with varying sample lengths. In addition, this article discusses the potential of large-scale models and longer data transmission chains. Code is available at this repository: <uri>https://github.com/JMu-Jia/ILDM</uri>","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 14","pages":"27043-27055"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Industrial Large-Scale Diagnostic Model With Lightweight Customized Deployment for Distributed Multiple Non-IID Diagnostic Tasks\",\"authors\":\"Xinming Jia;Na Qin;Deqing Huang;Jiahao Du;Tianwei Wang\",\"doi\":\"10.1109/JSEN.2025.3574226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Industrial fault diagnosis frequently confronts the challenge that both the data and targets are not independent and identically distributed (Non-IID). Conventional fault diagnosis often requires tailoring the model to specific task scenarios, thereby suffering from high costs and limited generalization capabilities. An industrial large-scale diagnostic model (ILDM) is proposed, with lightweight learning and customized deployment for distributed multiple Non-IID fault diagnosis tasks. ILDM can acquire knowledge from large public datasets that are completely irrelevant to downstream tasks and have the advantages of low learning cost, robust task adaptability, and high-quality deployment customization. First, a large-scale foundation model is built utilizing time-series embedding and 1-D–2-D–1-D variation, enhancing feature extraction capability while preserving the original temporal information. Second, through designed preprocessors and postprocessors, the constructed foundation model is pretrained on numerous public Non-IID datasets, ensuring that the model acquires generic classification capabilities. Finally, the pretrained large-scale model (LM) is tailored for deployment at different low-computing-power terminals to conduct various process/system-specific Non-IID diagnostic tasks, with lightweight customization achieved through fine-tuning and label smoothing based on terminal datasets. The performance of ILDM is evaluated across three representative Non-IID industrial diagnostic tasks in few-shot and full-shot experiments, as well as in experiments with varying sample lengths. In addition, this article discusses the potential of large-scale models and longer data transmission chains. Code is available at this repository: <uri>https://github.com/JMu-Jia/ILDM</uri>\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 14\",\"pages\":\"27043-27055\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11023071/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11023071/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

工业故障诊断经常面临数据和目标非独立同分布（Non-IID）的挑战。传统的故障诊断通常需要根据特定的任务场景定制模型，因此成本高，泛化能力有限。针对分布式多非iid故障诊断任务，提出了一种轻量级学习和定制部署的工业大规模诊断模型（ILDM）。ILDM可以从与下游任务完全无关的大型公共数据集中获取知识，具有学习成本低、任务适应性强、高质量部署定制等优点。首先，利用时间序列嵌入和1-D-2-D-1-D变化构建大规模基础模型，在保留原始时间信息的同时增强特征提取能力；其次，通过设计预处理器和后处理器，对构建的基础模型在大量公共Non-IID数据集上进行预训练，确保模型获得通用分类能力。最后，针对不同低计算能力终端的部署，对预训练的大规模模型（LM）进行定制，以执行各种特定于流程/系统的非iid诊断任务，并通过基于终端数据集的微调和标签平滑实现轻量级定制。ILDM的性能在三个具有代表性的非iid工业诊断任务中进行了评估，包括少量和完整的实验，以及不同样本长度的实验。此外，本文还讨论了大规模模型和更长的数据传输链的潜力。代码可在此存储库中获得：https://github.com/JMu-Jia/ILDM

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Industrial Large-Scale Diagnostic Model With Lightweight Customized Deployment for Distributed Multiple Non-IID Diagnostic Tasks

Industrial fault diagnosis frequently confronts the challenge that both the data and targets are not independent and identically distributed (Non-IID). Conventional fault diagnosis often requires tailoring the model to specific task scenarios, thereby suffering from high costs and limited generalization capabilities. An industrial large-scale diagnostic model (ILDM) is proposed, with lightweight learning and customized deployment for distributed multiple Non-IID fault diagnosis tasks. ILDM can acquire knowledge from large public datasets that are completely irrelevant to downstream tasks and have the advantages of low learning cost, robust task adaptability, and high-quality deployment customization. First, a large-scale foundation model is built utilizing time-series embedding and 1-D–2-D–1-D variation, enhancing feature extraction capability while preserving the original temporal information. Second, through designed preprocessors and postprocessors, the constructed foundation model is pretrained on numerous public Non-IID datasets, ensuring that the model acquires generic classification capabilities. Finally, the pretrained large-scale model (LM) is tailored for deployment at different low-computing-power terminals to conduct various process/system-specific Non-IID diagnostic tasks, with lightweight customization achieved through fine-tuning and label smoothing based on terminal datasets. The performance of ILDM is evaluated across three representative Non-IID industrial diagnostic tasks in few-shot and full-shot experiments, as well as in experiments with varying sample lengths. In addition, this article discusses the potential of large-scale models and longer data transmission chains. Code is available at this repository: https://github.com/JMu-Jia/ILDM

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice