{"title":"将知识与时间序列对齐:跨模态领域知识激活用于llm支持的零故障诊断","authors":"Jiancheng Zhao , Chunhui Zhao , Jiaqi Yue","doi":"10.1016/j.jprocont.2025.103534","DOIUrl":null,"url":null,"abstract":"<div><div>The existing zero-shot fault diagnosis methods identify unseen categories that have no training samples by predicting fault attributes from samples. Although these methods alleviated the data scarcity issue, they raised a new challenge, i.e., professional fault attributes annotation. We recognize that the essence of fault attributes lies in describing the connections and differences between categories. Therefore, we propose a novel LLM-enabled zero-shot fault diagnosis paradigm, and the large language models (LLMs) fine-tuned based on domain-specific knowledge can capture similar information to replace manual annotation. It unlocks the potential of LLMs to handle zero-shot tasks related to industrial time-series data in a cross-modal manner. It aims to address the burden of semantic knowledge annotation posed by the existing attribute-enabled paradigm. Moreover, the domain shift problem (DSP) arising from the shortage of training samples for unseen faults is also tackled by leveraging the cross-modal activation of relevant knowledge that has been learned from domain-specific documents. Firstly, to address the issue that LLMs pretrained on general knowledge are lacking in the knowledge of the industrial field, we design prompts for industrial faults and fine-tune the LLM with domain knowledge from diagnosis reports. Subsequently, considering that LLMs lack the ability to process time-series data, we design a cross-modal transformation module to align the time-series modality with the text modality. Moreover, we propose a knowledge distillation strategy to further align these two modalities, so the unseen fault text descriptions can serve as substitutes for the unavailable samples to address the DSP. We conduct experiments on a real thermal power plant, and the proposed method achieves an average improvement of 9.83% in terms of the diagnosis accuracy of unseen faults.</div></div>","PeriodicalId":50079,"journal":{"name":"Journal of Process Control","volume":"155 ","pages":"Article 103534"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Align knowledge with time-series: Cross-modal domain knowledge activation for LLM-enabled zero-shot fault diagnosis\",\"authors\":\"Jiancheng Zhao , Chunhui Zhao , Jiaqi Yue\",\"doi\":\"10.1016/j.jprocont.2025.103534\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The existing zero-shot fault diagnosis methods identify unseen categories that have no training samples by predicting fault attributes from samples. Although these methods alleviated the data scarcity issue, they raised a new challenge, i.e., professional fault attributes annotation. We recognize that the essence of fault attributes lies in describing the connections and differences between categories. Therefore, we propose a novel LLM-enabled zero-shot fault diagnosis paradigm, and the large language models (LLMs) fine-tuned based on domain-specific knowledge can capture similar information to replace manual annotation. It unlocks the potential of LLMs to handle zero-shot tasks related to industrial time-series data in a cross-modal manner. It aims to address the burden of semantic knowledge annotation posed by the existing attribute-enabled paradigm. Moreover, the domain shift problem (DSP) arising from the shortage of training samples for unseen faults is also tackled by leveraging the cross-modal activation of relevant knowledge that has been learned from domain-specific documents. Firstly, to address the issue that LLMs pretrained on general knowledge are lacking in the knowledge of the industrial field, we design prompts for industrial faults and fine-tune the LLM with domain knowledge from diagnosis reports. Subsequently, considering that LLMs lack the ability to process time-series data, we design a cross-modal transformation module to align the time-series modality with the text modality. Moreover, we propose a knowledge distillation strategy to further align these two modalities, so the unseen fault text descriptions can serve as substitutes for the unavailable samples to address the DSP. We conduct experiments on a real thermal power plant, and the proposed method achieves an average improvement of 9.83% in terms of the diagnosis accuracy of unseen faults.</div></div>\",\"PeriodicalId\":50079,\"journal\":{\"name\":\"Journal of Process Control\",\"volume\":\"155 \",\"pages\":\"Article 103534\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Process Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0959152425001623\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Process Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959152425001623","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Align knowledge with time-series: Cross-modal domain knowledge activation for LLM-enabled zero-shot fault diagnosis
The existing zero-shot fault diagnosis methods identify unseen categories that have no training samples by predicting fault attributes from samples. Although these methods alleviated the data scarcity issue, they raised a new challenge, i.e., professional fault attributes annotation. We recognize that the essence of fault attributes lies in describing the connections and differences between categories. Therefore, we propose a novel LLM-enabled zero-shot fault diagnosis paradigm, and the large language models (LLMs) fine-tuned based on domain-specific knowledge can capture similar information to replace manual annotation. It unlocks the potential of LLMs to handle zero-shot tasks related to industrial time-series data in a cross-modal manner. It aims to address the burden of semantic knowledge annotation posed by the existing attribute-enabled paradigm. Moreover, the domain shift problem (DSP) arising from the shortage of training samples for unseen faults is also tackled by leveraging the cross-modal activation of relevant knowledge that has been learned from domain-specific documents. Firstly, to address the issue that LLMs pretrained on general knowledge are lacking in the knowledge of the industrial field, we design prompts for industrial faults and fine-tune the LLM with domain knowledge from diagnosis reports. Subsequently, considering that LLMs lack the ability to process time-series data, we design a cross-modal transformation module to align the time-series modality with the text modality. Moreover, we propose a knowledge distillation strategy to further align these two modalities, so the unseen fault text descriptions can serve as substitutes for the unavailable samples to address the DSP. We conduct experiments on a real thermal power plant, and the proposed method achieves an average improvement of 9.83% in terms of the diagnosis accuracy of unseen faults.
期刊介绍:
This international journal covers the application of control theory, operations research, computer science and engineering principles to the solution of process control problems. In addition to the traditional chemical processing and manufacturing applications, the scope of process control problems involves a wide range of applications that includes energy processes, nano-technology, systems biology, bio-medical engineering, pharmaceutical processing technology, energy storage and conversion, smart grid, and data analytics among others.
Papers on the theory in these areas will also be accepted provided the theoretical contribution is aimed at the application and the development of process control techniques.
Topics covered include:
• Control applications• Process monitoring• Plant-wide control• Process control systems• Control techniques and algorithms• Process modelling and simulation• Design methods
Advanced design methods exclude well established and widely studied traditional design techniques such as PID tuning and its many variants. Applications in fields such as control of automotive engines, machinery and robotics are not deemed suitable unless a clear motivation for the relevance to process control is provided.