Domain Knowledge Informed Unsupervised Fault Detection for Rolling Element Bearings

Douw Marx, K. Gryllias
{"title":"Domain Knowledge Informed Unsupervised Fault Detection for Rolling Element Bearings","authors":"Douw Marx, K. Gryllias","doi":"10.36001/phme.2022.v7i1.3348","DOIUrl":null,"url":null,"abstract":"Early and accurate detection of rolling element bearing faults in rotating machinery is important for minimizing production downtime and reducing unnecessary preventative maintenance. Several fault detection methods based on signal processing and machine learning methods have been proposed. Particularly, supervised, data-driven approaches have proved to be very effective for fault detection and diagnostics of rolling element bearings. However, supervised methods rely heavily on the availability of failure data with volume, variety and veracity, which is mostly unavailable in industry. As an alternative data-driven strategy, unsupervised methods are trained on healthy data only and do not require any failure data.\nIn contrast to supervised and un-supervised data-driven models, physics-based and phenomenological models are based on domain knowledge and not on historical data. Although these models are useful for studying the way in which damage is expected to manifest in a measured signal, they are difficult to calibrate and often lack the fidelity required to model reality. In this paper, an unsupervised data-driven anomaly detection method that exploits informative domain knowledge is proposed. Hereby, the versatility of unsupervised data-driven methods are combined with domain knowledge.\nIn this approach, supplementary training data is generated by augmenting healthy data towards its possible future faulty state based on the characteristic bearing fault frequencies. Both healthy and augmented squared envelope spectrum data is used to train an autoencoder model that includes regularisation designed to constrain the latent features at the autoencoder bottleneck. Regularisation in the autoencoder loss enforces that the expected deviation of the healthy latent representation towards the augmented latent representation at dam aged conditions, is constrained to be maximally different for different fault modes. Consequently, the likelihood of a new test sample being healthy can be evaluated based on the projection of the sample onto an expected failure direction in the latent representation.\nA phenomenological and experimental dataset is used to demonstrate that the addition of augmented training data and a specialized autoencoder loss function can create a separable latent representation that can be used to generate interpretable health indicators.","PeriodicalId":422825,"journal":{"name":"PHM Society European Conference","volume":"160 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PHM Society European Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36001/phme.2022.v7i1.3348","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Early and accurate detection of rolling element bearing faults in rotating machinery is important for minimizing production downtime and reducing unnecessary preventative maintenance. Several fault detection methods based on signal processing and machine learning methods have been proposed. Particularly, supervised, data-driven approaches have proved to be very effective for fault detection and diagnostics of rolling element bearings. However, supervised methods rely heavily on the availability of failure data with volume, variety and veracity, which is mostly unavailable in industry. As an alternative data-driven strategy, unsupervised methods are trained on healthy data only and do not require any failure data. In contrast to supervised and un-supervised data-driven models, physics-based and phenomenological models are based on domain knowledge and not on historical data. Although these models are useful for studying the way in which damage is expected to manifest in a measured signal, they are difficult to calibrate and often lack the fidelity required to model reality. In this paper, an unsupervised data-driven anomaly detection method that exploits informative domain knowledge is proposed. Hereby, the versatility of unsupervised data-driven methods are combined with domain knowledge. In this approach, supplementary training data is generated by augmenting healthy data towards its possible future faulty state based on the characteristic bearing fault frequencies. Both healthy and augmented squared envelope spectrum data is used to train an autoencoder model that includes regularisation designed to constrain the latent features at the autoencoder bottleneck. Regularisation in the autoencoder loss enforces that the expected deviation of the healthy latent representation towards the augmented latent representation at dam aged conditions, is constrained to be maximally different for different fault modes. Consequently, the likelihood of a new test sample being healthy can be evaluated based on the projection of the sample onto an expected failure direction in the latent representation. A phenomenological and experimental dataset is used to demonstrate that the addition of augmented training data and a specialized autoencoder loss function can create a separable latent representation that can be used to generate interpretable health indicators.
基于领域知识的滚动轴承无监督故障检测
旋转机械中滚动轴承故障的早期和准确检测对于最大限度地减少生产停机时间和减少不必要的预防性维护至关重要。提出了几种基于信号处理和机器学习方法的故障检测方法。特别是,有监督的、数据驱动的方法已被证明对滚动轴承的故障检测和诊断非常有效。然而,监督方法在很大程度上依赖于故障数据的数量、种类和准确性,这在工业中大多是不可用的。作为一种替代的数据驱动策略,无监督方法仅在健康数据上进行训练,不需要任何故障数据。与有监督和无监督的数据驱动模型相比,基于物理和现象学的模型是基于领域知识而不是历史数据的。尽管这些模型对于研究被测信号中预期的损伤表现方式很有用,但它们很难校准,而且往往缺乏模拟现实所需的保真度。本文提出了一种利用信息性领域知识的无监督数据驱动异常检测方法。因此,将无监督数据驱动方法的通用性与领域知识相结合。在这种方法中,根据轴承的特征故障频率,将健康数据增强到未来可能出现的故障状态,从而生成补充训练数据。健康和增强的平方包络谱数据都用于训练一个自编码器模型,该模型包括正则化,旨在约束自编码器瓶颈处的潜在特征。自编码器损耗的正则化使得在坝龄条件下健康潜在表示对增强潜在表示的期望偏差,在不同的故障模式下被约束为最大的不同。因此,新测试样本健康的可能性可以根据样本在潜在表示中的预期失效方向上的投影来评估。使用现象学和实验数据集来证明添加增强训练数据和专门的自编码器损失函数可以创建可分离的潜在表示,可用于生成可解释的健康指标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信