基于鲁棒Mahalanobis距离的惰性学习方法在高维过程中的故障检测

IF 1.6 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC
Jungwon Yu, Kwang-Ju Kim, In-Su Jang
{"title":"基于鲁棒Mahalanobis距离的惰性学习方法在高维过程中的故障检测","authors":"Jungwon Yu,&nbsp;Kwang-Ju Kim,&nbsp;In-Su Jang","doi":"10.4218/etrij.2024-0253","DOIUrl":null,"url":null,"abstract":"<p>When using lazy learners based on the Mahalanobis distance (MD) function for process fault detection (FD), due to the curse of dimensionality, type I errors can increase significantly as the number of process variables increases. In high-dimensional data spaces, certain regions exist in which data samples are sparsely distributed. From the perspective of dense regions, the outlierness (i.e., degree of being statistical outliers) of samples in sparse regions increases as the data dimensions increase, leading to unstable estimations of classical covariance matrices for calculating MD function values. To solve this problem, a lazy learning method is proposed based on a robust MD function, where robust covariance matrices are estimated using a minimum covariance determinant method. Here, <i>k</i>-nearest neighbors and local outlier factor are employed as baseline learners. The proposed method can be applied to all types of lazy learning techniques. To verify FD performance, the proposed method is applied to two benchmark processes. The experimental results show that the proposed method can perform FD on very high-dimensional processes successfully without rapid increases in type I errors.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 5","pages":"865-880"},"PeriodicalIF":1.6000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0253","citationCount":"0","resultStr":"{\"title\":\"Robust Mahalanobis distance-based lazy learning method for fault detection in high-dimensional processes\",\"authors\":\"Jungwon Yu,&nbsp;Kwang-Ju Kim,&nbsp;In-Su Jang\",\"doi\":\"10.4218/etrij.2024-0253\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>When using lazy learners based on the Mahalanobis distance (MD) function for process fault detection (FD), due to the curse of dimensionality, type I errors can increase significantly as the number of process variables increases. In high-dimensional data spaces, certain regions exist in which data samples are sparsely distributed. From the perspective of dense regions, the outlierness (i.e., degree of being statistical outliers) of samples in sparse regions increases as the data dimensions increase, leading to unstable estimations of classical covariance matrices for calculating MD function values. To solve this problem, a lazy learning method is proposed based on a robust MD function, where robust covariance matrices are estimated using a minimum covariance determinant method. Here, <i>k</i>-nearest neighbors and local outlier factor are employed as baseline learners. The proposed method can be applied to all types of lazy learning techniques. To verify FD performance, the proposed method is applied to two benchmark processes. The experimental results show that the proposed method can perform FD on very high-dimensional processes successfully without rapid increases in type I errors.</p>\",\"PeriodicalId\":11901,\"journal\":{\"name\":\"ETRI Journal\",\"volume\":\"47 5\",\"pages\":\"865-880\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0253\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ETRI Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.4218/etrij.2024-0253\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ETRI Journal","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.4218/etrij.2024-0253","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

在使用基于马氏距离(MD)函数的惰性学习器进行过程故障检测(FD)时,由于维数的限制,I型误差会随着过程变量数量的增加而显著增加。在高维数据空间中,存在数据样本稀疏分布的特定区域。从密集区域来看,稀疏区域样本的离群值(即统计离群值的程度)随着数据维数的增加而增加,导致用于计算MD函数值的经典协方差矩阵估计不稳定。为了解决这一问题,提出了一种基于鲁棒MD函数的惰性学习方法,其中鲁棒协方差矩阵的估计采用最小协方差行列式方法。在这里,使用k近邻和局部离群因子作为基线学习器。该方法适用于所有类型的懒惰学习方法。为了验证FD的性能,将该方法应用于两个基准过程。实验结果表明,该方法可以成功地在非常高维的过程中执行FD,而不会导致I型误差的快速增加。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Robust Mahalanobis distance-based lazy learning method for fault detection in high-dimensional processes

Robust Mahalanobis distance-based lazy learning method for fault detection in high-dimensional processes

When using lazy learners based on the Mahalanobis distance (MD) function for process fault detection (FD), due to the curse of dimensionality, type I errors can increase significantly as the number of process variables increases. In high-dimensional data spaces, certain regions exist in which data samples are sparsely distributed. From the perspective of dense regions, the outlierness (i.e., degree of being statistical outliers) of samples in sparse regions increases as the data dimensions increase, leading to unstable estimations of classical covariance matrices for calculating MD function values. To solve this problem, a lazy learning method is proposed based on a robust MD function, where robust covariance matrices are estimated using a minimum covariance determinant method. Here, k-nearest neighbors and local outlier factor are employed as baseline learners. The proposed method can be applied to all types of lazy learning techniques. To verify FD performance, the proposed method is applied to two benchmark processes. The experimental results show that the proposed method can perform FD on very high-dimensional processes successfully without rapid increases in type I errors.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ETRI Journal
ETRI Journal 工程技术-电信学
CiteScore
4.00
自引率
7.10%
发文量
98
审稿时长
6.9 months
期刊介绍: ETRI Journal is an international, peer-reviewed multidisciplinary journal published bimonthly in English. The main focus of the journal is to provide an open forum to exchange innovative ideas and technology in the fields of information, telecommunications, and electronics. Key topics of interest include high-performance computing, big data analytics, cloud computing, multimedia technology, communication networks and services, wireless communications and mobile computing, material and component technology, as well as security. With an international editorial committee and experts from around the world as reviewers, ETRI Journal publishes high-quality research papers on the latest and best developments from the global community.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信