Hybrid semi-supervised fault detection framework under limited high-confidence data scenarios

IF 3.9 2区工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computers & Chemical Engineering Pub Date : 2025-05-14 DOI:10.1016/j.compchemeng.2025.109186

Selorme Agbleze , Lawrence J. Shadle , Fernando V. Lima

{"title":"Hybrid semi-supervised fault detection framework under limited high-confidence data scenarios","authors":"Selorme Agbleze , Lawrence J. Shadle , Fernando V. Lima","doi":"10.1016/j.compchemeng.2025.109186","DOIUrl":null,"url":null,"abstract":"<div><div>In the broad field of fault detection, approaches utilizing process conditions are established for systems with adequate datasets. However, systems including recently commissioned and novel processes have limited datasets available for model-based fault detection. Moreover, these systems have far greater proportions of normal operating data than adequate fault examples due to the time for which they have been operated. In this work, a combined hybrid framework for fault detection is developed that enables augmentation of the limited dataset available with HAZOP data, allowing for the utilization of both human expert knowledge and generated pseudo-process data. Additionally, the generation of artificial data is performed for reducing false positives in adversarial training. A semi-supervised distance variant of center loss is used to improve the consistency of deep feature activations from paired and unpaired data. A comparison between the proposed approach and an approach utilizing only process data in the limited data case is presented. Overall, the proposed approach shows 4.1 % and 8.8 % improvements in average detection rate when compared to the state-of-the-art supervised method for the Tennessee Eastman process and subcritical coal-fired power plant case studies, respectively, enabling the use of unlabeled data to supplement labeled process data for fault detection.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"200 ","pages":"Article 109186"},"PeriodicalIF":3.9000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425001905","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

In the broad field of fault detection, approaches utilizing process conditions are established for systems with adequate datasets. However, systems including recently commissioned and novel processes have limited datasets available for model-based fault detection. Moreover, these systems have far greater proportions of normal operating data than adequate fault examples due to the time for which they have been operated. In this work, a combined hybrid framework for fault detection is developed that enables augmentation of the limited dataset available with HAZOP data, allowing for the utilization of both human expert knowledge and generated pseudo-process data. Additionally, the generation of artificial data is performed for reducing false positives in adversarial training. A semi-supervised distance variant of center loss is used to improve the consistency of deep feature activations from paired and unpaired data. A comparison between the proposed approach and an approach utilizing only process data in the limited data case is presented. Overall, the proposed approach shows 4.1 % and 8.8 % improvements in average detection rate when compared to the state-of-the-art supervised method for the Tennessee Eastman process and subcritical coal-fired power plant case studies, respectively, enabling the use of unlabeled data to supplement labeled process data for fault detection.

查看原文本刊更多论文

有限高置信度数据场景下的混合半监督故障检测框架

在广泛的故障检测领域，利用过程条件的方法是为具有足够数据集的系统建立的。然而，包括最近委托和新流程在内的系统，用于基于模型的故障检测的数据集有限。此外，由于这些系统已经运行了很长时间，因此这些系统的正常运行数据比足够的故障示例的比例要大得多。在这项工作中，开发了一个用于故障检测的组合混合框架，可以增加HAZOP数据可用的有限数据集，从而允许利用人类专家知识和生成的伪过程数据。此外，人工数据的生成是为了减少对抗性训练中的误报。使用中心损失的半监督距离变体来提高来自成对和非成对数据的深度特征激活的一致性。在有限的数据情况下，将所提出的方法与仅利用过程数据的方法进行了比较。总的来说，与田纳西伊士曼过程和亚临界燃煤电厂案例研究中最先进的监督方法相比，所提出的方法在平均检出率方面分别提高了4.1%和8.8%，从而能够使用未标记数据来补充标记过程数据进行故障检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers & Chemical Engineering 工程技术-工程：化工

CiteScore

8.70

自引率

14.00%

发文量

374

审稿时长

70 days

期刊介绍： Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.