将知识与时间序列对齐：跨模态领域知识激活用于llm支持的零故障诊断

IF 3.9 2区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

Journal of Process Control Pub Date : 2025-09-09 DOI:10.1016/j.jprocont.2025.103534

Jiancheng Zhao , Chunhui Zhao , Jiaqi Yue

{"title":"将知识与时间序列对齐：跨模态领域知识激活用于llm支持的零故障诊断","authors":"Jiancheng Zhao , Chunhui Zhao , Jiaqi Yue","doi":"10.1016/j.jprocont.2025.103534","DOIUrl":null,"url":null,"abstract":"<div><div>The existing zero-shot fault diagnosis methods identify unseen categories that have no training samples by predicting fault attributes from samples. Although these methods alleviated the data scarcity issue, they raised a new challenge, i.e., professional fault attributes annotation. We recognize that the essence of fault attributes lies in describing the connections and differences between categories. Therefore, we propose a novel LLM-enabled zero-shot fault diagnosis paradigm, and the large language models (LLMs) fine-tuned based on domain-specific knowledge can capture similar information to replace manual annotation. It unlocks the potential of LLMs to handle zero-shot tasks related to industrial time-series data in a cross-modal manner. It aims to address the burden of semantic knowledge annotation posed by the existing attribute-enabled paradigm. Moreover, the domain shift problem (DSP) arising from the shortage of training samples for unseen faults is also tackled by leveraging the cross-modal activation of relevant knowledge that has been learned from domain-specific documents. Firstly, to address the issue that LLMs pretrained on general knowledge are lacking in the knowledge of the industrial field, we design prompts for industrial faults and fine-tune the LLM with domain knowledge from diagnosis reports. Subsequently, considering that LLMs lack the ability to process time-series data, we design a cross-modal transformation module to align the time-series modality with the text modality. Moreover, we propose a knowledge distillation strategy to further align these two modalities, so the unseen fault text descriptions can serve as substitutes for the unavailable samples to address the DSP. We conduct experiments on a real thermal power plant, and the proposed method achieves an average improvement of 9.83% in terms of the diagnosis accuracy of unseen faults.</div></div>","PeriodicalId":50079,"journal":{"name":"Journal of Process Control","volume":"155 ","pages":"Article 103534"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Align knowledge with time-series: Cross-modal domain knowledge activation for LLM-enabled zero-shot fault diagnosis\",\"authors\":\"Jiancheng Zhao , Chunhui Zhao , Jiaqi Yue\",\"doi\":\"10.1016/j.jprocont.2025.103534\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The existing zero-shot fault diagnosis methods identify unseen categories that have no training samples by predicting fault attributes from samples. Although these methods alleviated the data scarcity issue, they raised a new challenge, i.e., professional fault attributes annotation. We recognize that the essence of fault attributes lies in describing the connections and differences between categories. Therefore, we propose a novel LLM-enabled zero-shot fault diagnosis paradigm, and the large language models (LLMs) fine-tuned based on domain-specific knowledge can capture similar information to replace manual annotation. It unlocks the potential of LLMs to handle zero-shot tasks related to industrial time-series data in a cross-modal manner. It aims to address the burden of semantic knowledge annotation posed by the existing attribute-enabled paradigm. Moreover, the domain shift problem (DSP) arising from the shortage of training samples for unseen faults is also tackled by leveraging the cross-modal activation of relevant knowledge that has been learned from domain-specific documents. Firstly, to address the issue that LLMs pretrained on general knowledge are lacking in the knowledge of the industrial field, we design prompts for industrial faults and fine-tune the LLM with domain knowledge from diagnosis reports. Subsequently, considering that LLMs lack the ability to process time-series data, we design a cross-modal transformation module to align the time-series modality with the text modality. Moreover, we propose a knowledge distillation strategy to further align these two modalities, so the unseen fault text descriptions can serve as substitutes for the unavailable samples to address the DSP. We conduct experiments on a real thermal power plant, and the proposed method achieves an average improvement of 9.83% in terms of the diagnosis accuracy of unseen faults.</div></div>\",\"PeriodicalId\":50079,\"journal\":{\"name\":\"Journal of Process Control\",\"volume\":\"155 \",\"pages\":\"Article 103534\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-09-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Process Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0959152425001623\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Process Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959152425001623","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

现有的零射击故障诊断方法通过预测样本的故障属性来识别没有训练样本的未知类别。这些方法虽然缓解了数据稀缺性问题，但也提出了新的挑战，即专业的故障属性标注。我们认识到故障属性的本质在于描述类别之间的联系和差异。因此，我们提出了一种新的基于llm的零故障诊断范式，并且基于特定领域知识进行微调的大语言模型（llm）可以捕获类似的信息，以取代人工注释。它释放了llm以跨模态方式处理与工业时间序列数据相关的零射击任务的潜力。它旨在解决现有属性支持范式给语义知识标注带来的负担。此外，通过利用从特定领域文档中学习到的相关知识的跨模态激活，还解决了由于未见故障的训练样本不足而引起的领域转移问题。首先，为了解决一般知识预训练的LLM缺乏工业领域知识的问题，我们设计了工业故障提示，并利用诊断报告中的领域知识对LLM进行微调。随后，考虑到llm缺乏处理时间序列数据的能力，我们设计了一个跨模态转换模块，将时间序列模态与文本模态对齐。此外，我们提出了一种知识蒸馏策略来进一步协调这两种模式，因此看不见的故障文本描述可以替代不可用的样本来解决DSP问题。在实际火电厂进行了实验，该方法对未见故障的诊断准确率平均提高了9.83%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Align knowledge with time-series: Cross-modal domain knowledge activation for LLM-enabled zero-shot fault diagnosis

The existing zero-shot fault diagnosis methods identify unseen categories that have no training samples by predicting fault attributes from samples. Although these methods alleviated the data scarcity issue, they raised a new challenge, i.e., professional fault attributes annotation. We recognize that the essence of fault attributes lies in describing the connections and differences between categories. Therefore, we propose a novel LLM-enabled zero-shot fault diagnosis paradigm, and the large language models (LLMs) fine-tuned based on domain-specific knowledge can capture similar information to replace manual annotation. It unlocks the potential of LLMs to handle zero-shot tasks related to industrial time-series data in a cross-modal manner. It aims to address the burden of semantic knowledge annotation posed by the existing attribute-enabled paradigm. Moreover, the domain shift problem (DSP) arising from the shortage of training samples for unseen faults is also tackled by leveraging the cross-modal activation of relevant knowledge that has been learned from domain-specific documents. Firstly, to address the issue that LLMs pretrained on general knowledge are lacking in the knowledge of the industrial field, we design prompts for industrial faults and fine-tune the LLM with domain knowledge from diagnosis reports. Subsequently, considering that LLMs lack the ability to process time-series data, we design a cross-modal transformation module to align the time-series modality with the text modality. Moreover, we propose a knowledge distillation strategy to further align these two modalities, so the unseen fault text descriptions can serve as substitutes for the unavailable samples to address the DSP. We conduct experiments on a real thermal power plant, and the proposed method achieves an average improvement of 9.83% in terms of the diagnosis accuracy of unseen faults.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Process Control 工程技术-工程：化工

CiteScore

7.00

自引率

11.90%

发文量

159

审稿时长

74 days

期刊介绍： This international journal covers the application of control theory, operations research, computer science and engineering principles to the solution of process control problems. In addition to the traditional chemical processing and manufacturing applications, the scope of process control problems involves a wide range of applications that includes energy processes, nano-technology, systems biology, bio-medical engineering, pharmaceutical processing technology, energy storage and conversion, smart grid, and data analytics among others. Papers on the theory in these areas will also be accepted provided the theoretical contribution is aimed at the application and the development of process control techniques. Topics covered include: • Control applications• Process monitoring• Plant-wide control• Process control systems• Control techniques and algorithms• Process modelling and simulation• Design methods Advanced design methods exclude well established and widely studied traditional design techniques such as PID tuning and its many variants. Applications in fields such as control of automotive engines, machinery and robotics are not deemed suitable unless a clear motivation for the relevance to process control is provided.