{"title":"Domain adaptation via gamma, Weibull, and lognormal distributions for fault detection in chemical and energy processes","authors":"Lingkai Yang, Jian Cheng, Yi Luo, Tianbai Zhou, Xiaoyu Zhang, Linsong Shi, Yuan Xu","doi":"10.1002/cjce.25373","DOIUrl":null,"url":null,"abstract":"<p>The burgeoning development of supervised machine learning (ML) has led to its widespread applications in chemical and energy processes, such as fault detection. However, in some scenarios, collecting labelled data can be costly, hazardous, or impossible. Moreover, data of the same process can follow varying distributions due to changes in, for example, devices and environment, causing ML models to be ineffective. These challenges pose a domain adaptation task, necessitating the refinement of existing ML models to tackle issues from related applications. This study proposes a domain adaptation approach to address label scarcity and data distribution variation. The method has three stages: data distribution modelling (knowledge discovery), adaptation of target domain samples to source domains (knowledge transformation), and classifier ensemble for fault detection (knowledge fusion). Gamma, Weibull, and lognormal distributions are applied for data modelling and domain adaptation. The effectiveness of the method is validated on synthetic datasets and then applied to identify anomalies in coal mine pressure data and detect faults in the Tennessee Eastman (TE) process.</p>","PeriodicalId":9400,"journal":{"name":"Canadian Journal of Chemical Engineering","volume":"103 1","pages":"359-372"},"PeriodicalIF":1.6000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Journal of Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cjce.25373","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The burgeoning development of supervised machine learning (ML) has led to its widespread applications in chemical and energy processes, such as fault detection. However, in some scenarios, collecting labelled data can be costly, hazardous, or impossible. Moreover, data of the same process can follow varying distributions due to changes in, for example, devices and environment, causing ML models to be ineffective. These challenges pose a domain adaptation task, necessitating the refinement of existing ML models to tackle issues from related applications. This study proposes a domain adaptation approach to address label scarcity and data distribution variation. The method has three stages: data distribution modelling (knowledge discovery), adaptation of target domain samples to source domains (knowledge transformation), and classifier ensemble for fault detection (knowledge fusion). Gamma, Weibull, and lognormal distributions are applied for data modelling and domain adaptation. The effectiveness of the method is validated on synthetic datasets and then applied to identify anomalies in coal mine pressure data and detect faults in the Tennessee Eastman (TE) process.
有监督机器学习(ML)的蓬勃发展使其广泛应用于化学和能源过程,如故障检测。然而,在某些情况下,收集标记数据可能成本高昂、危险或不可能。此外,由于设备和环境等方面的变化,同一流程的数据可能会有不同的分布,从而导致 ML 模型失效。这些挑战提出了一个领域适应任务,要求对现有的 ML 模型进行改进,以解决相关应用中的问题。本研究提出了一种领域适应方法来解决标签稀缺和数据分布变化的问题。该方法分为三个阶段:数据分布建模(知识发现)、目标域样本适应源域(知识转换)和故障检测分类器组合(知识融合)。数据建模和域适应应用了伽马分布、威布尔分布和对数正态分布。在合成数据集上验证了该方法的有效性,然后将其应用于识别煤矿压力数据中的异常,并检测田纳西伊士曼(TE)工艺中的故障。
期刊介绍:
The Canadian Journal of Chemical Engineering (CJChE) publishes original research articles, new theoretical interpretation or experimental findings and critical reviews in the science or industrial practice of chemical and biochemical processes. Preference is given to papers having a clearly indicated scope and applicability in any of the following areas: Fluid mechanics, heat and mass transfer, multiphase flows, separations processes, thermodynamics, process systems engineering, reactors and reaction kinetics, catalysis, interfacial phenomena, electrochemical phenomena, bioengineering, minerals processing and natural products and environmental and energy engineering. Papers that merely describe or present a conventional or routine analysis of existing processes will not be considered.