Xinming Jia;Na Qin;Deqing Huang;Jiahao Du;Tianwei Wang
{"title":"工业大规模诊断模型与轻量级定制部署分布式多个非iid诊断任务","authors":"Xinming Jia;Na Qin;Deqing Huang;Jiahao Du;Tianwei Wang","doi":"10.1109/JSEN.2025.3574226","DOIUrl":null,"url":null,"abstract":"Industrial fault diagnosis frequently confronts the challenge that both the data and targets are not independent and identically distributed (Non-IID). Conventional fault diagnosis often requires tailoring the model to specific task scenarios, thereby suffering from high costs and limited generalization capabilities. An industrial large-scale diagnostic model (ILDM) is proposed, with lightweight learning and customized deployment for distributed multiple Non-IID fault diagnosis tasks. ILDM can acquire knowledge from large public datasets that are completely irrelevant to downstream tasks and have the advantages of low learning cost, robust task adaptability, and high-quality deployment customization. First, a large-scale foundation model is built utilizing time-series embedding and 1-D–2-D–1-D variation, enhancing feature extraction capability while preserving the original temporal information. Second, through designed preprocessors and postprocessors, the constructed foundation model is pretrained on numerous public Non-IID datasets, ensuring that the model acquires generic classification capabilities. Finally, the pretrained large-scale model (LM) is tailored for deployment at different low-computing-power terminals to conduct various process/system-specific Non-IID diagnostic tasks, with lightweight customization achieved through fine-tuning and label smoothing based on terminal datasets. The performance of ILDM is evaluated across three representative Non-IID industrial diagnostic tasks in few-shot and full-shot experiments, as well as in experiments with varying sample lengths. In addition, this article discusses the potential of large-scale models and longer data transmission chains. Code is available at this repository: <uri>https://github.com/JMu-Jia/ILDM</uri>","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 14","pages":"27043-27055"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Industrial Large-Scale Diagnostic Model With Lightweight Customized Deployment for Distributed Multiple Non-IID Diagnostic Tasks\",\"authors\":\"Xinming Jia;Na Qin;Deqing Huang;Jiahao Du;Tianwei Wang\",\"doi\":\"10.1109/JSEN.2025.3574226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Industrial fault diagnosis frequently confronts the challenge that both the data and targets are not independent and identically distributed (Non-IID). Conventional fault diagnosis often requires tailoring the model to specific task scenarios, thereby suffering from high costs and limited generalization capabilities. An industrial large-scale diagnostic model (ILDM) is proposed, with lightweight learning and customized deployment for distributed multiple Non-IID fault diagnosis tasks. ILDM can acquire knowledge from large public datasets that are completely irrelevant to downstream tasks and have the advantages of low learning cost, robust task adaptability, and high-quality deployment customization. First, a large-scale foundation model is built utilizing time-series embedding and 1-D–2-D–1-D variation, enhancing feature extraction capability while preserving the original temporal information. Second, through designed preprocessors and postprocessors, the constructed foundation model is pretrained on numerous public Non-IID datasets, ensuring that the model acquires generic classification capabilities. Finally, the pretrained large-scale model (LM) is tailored for deployment at different low-computing-power terminals to conduct various process/system-specific Non-IID diagnostic tasks, with lightweight customization achieved through fine-tuning and label smoothing based on terminal datasets. The performance of ILDM is evaluated across three representative Non-IID industrial diagnostic tasks in few-shot and full-shot experiments, as well as in experiments with varying sample lengths. In addition, this article discusses the potential of large-scale models and longer data transmission chains. Code is available at this repository: <uri>https://github.com/JMu-Jia/ILDM</uri>\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 14\",\"pages\":\"27043-27055\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11023071/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11023071/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Industrial Large-Scale Diagnostic Model With Lightweight Customized Deployment for Distributed Multiple Non-IID Diagnostic Tasks
Industrial fault diagnosis frequently confronts the challenge that both the data and targets are not independent and identically distributed (Non-IID). Conventional fault diagnosis often requires tailoring the model to specific task scenarios, thereby suffering from high costs and limited generalization capabilities. An industrial large-scale diagnostic model (ILDM) is proposed, with lightweight learning and customized deployment for distributed multiple Non-IID fault diagnosis tasks. ILDM can acquire knowledge from large public datasets that are completely irrelevant to downstream tasks and have the advantages of low learning cost, robust task adaptability, and high-quality deployment customization. First, a large-scale foundation model is built utilizing time-series embedding and 1-D–2-D–1-D variation, enhancing feature extraction capability while preserving the original temporal information. Second, through designed preprocessors and postprocessors, the constructed foundation model is pretrained on numerous public Non-IID datasets, ensuring that the model acquires generic classification capabilities. Finally, the pretrained large-scale model (LM) is tailored for deployment at different low-computing-power terminals to conduct various process/system-specific Non-IID diagnostic tasks, with lightweight customization achieved through fine-tuning and label smoothing based on terminal datasets. The performance of ILDM is evaluated across three representative Non-IID industrial diagnostic tasks in few-shot and full-shot experiments, as well as in experiments with varying sample lengths. In addition, this article discusses the potential of large-scale models and longer data transmission chains. Code is available at this repository: https://github.com/JMu-Jia/ILDM
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice