A heterogeneous ensemble approach for the prediction of the remaining useful life of packaging industry machinery

F. Cannarile, P. Baraldi, M. Compare, D. Borghi, Luca Capelli, E. Zio
{"title":"A heterogeneous ensemble approach for the prediction of the remaining useful life of packaging industry machinery","authors":"F. Cannarile, P. Baraldi, M. Compare, D. Borghi, Luca Capelli, E. Zio","doi":"10.1201/9781351174664-11","DOIUrl":null,"url":null,"abstract":"We present a method based on heterogeneous ensemble learning for the prediction of the Remaining Useful Life (RUL) of cutting tools (knives) used in the packaging industry. Ensemble diversity is achieved by training multiple prognostic models using different learning algorithms. The combination of the outcomes of the models in the ensemble is based on a weighted averaging strategy, which assigns weights proportional to the individual model performances on patterns of a validation set. The proposed heterogeneous ensemble has been applied to real condition monitoring knife data. It has provided more accurate RUL predictions compared to those of each individual base model. radation beyond which the equipment fails performing its function (failure threshold). Examples of modeling techniques used in degradationbased approaches are Auto-Regressive models (Gorjian et al., 2009), Relevance Vector Machines (Di Maio et al., 2012) and Semi-Markov Models (Cannarile et al., 2017a) (Cannarile et al., 2018). Direct RUL predictions approaches, instead, typically resort to machine learning techniques that directly map the relation between the observable parameters and the equipment RUL, without the need of predicting the equipment degradation state evolution towards a failure threshold (Schwabacher et al., 2007). Techniques used in direct RUL prediction approaches are, for example, Artifical Neural Networks (Wang & Vachtsenavos, 2001), Extreme Learning Machines (ELM) (Yang et al., 2017), Gaussian Processes (GP) (Baraldi et al., 2015b), etc. When few run-to-failure degradation trajectories are available, direct RUL approaches may overfit, i.e., these algorithms customize themselves too much to learn the relationship between the observable parameters and the corresponding RUL in the training set. Therefore, these methods tend to lose their generalization power, which leads to poor performance on new data. To overcome this, ensemble approaches, based on the aggregation of multiple model outcomes, have been introduced (Baraldi et al., 2013a). The basic idea is that the diverse models in the ensemble complement each other by leveraging their strengths and overcoming their drawbacks. Thus, the combination of the outcomes of the individual models in the ensemble improves the accuracy of the predictions compared to the performance of a single model (Brown et al., 2005) (Baraldi et al., 2013a). Different methods, such as ANN (Baraldi et al., 2013b), Support Vector Machine (SVM) (Liu et al., 2006) and kernel learning (Liu et al., 2015), have been used with success to build the individual models. For example, an ensemble of feedforward Artificial Neural Networks (ANN) has been embedded into a Particle Filter (PF) for the prediction of crack length evolution (Baraldi et al., 2013b) and an ensemble of datadriven regression models has been exploited for the RUL prediction of lithium-ion batteries (Xing et al., 2013). In (Rigamonti et al., 2017) a local ensemble of Echo State Networks (ESN) has been proposed to improve the RUL prediction accuracy of turbofan engines. The objective of this work is to predict the RUL of knives installed on Tetra Pak® A3/Flex filling machines used to cut package material. The prognostic task is complicated by the fact that few run-to-failure degradation trajectories are available, and a failure threshold is not available. To cope with these issues, this work proposes an ensemble formed by multiple datadriven direct RUL prediction models, capable of aggregating the RUL predictions for good performance throughout the entire degradation trajectory of a knife. Ensemble diversity is achieved by heterogeneous ensemble generation, i.e., by training the models using different prognostics algorithms. Aggregation is obtained by averaging the output of the individual base models with weights proportional to the inverse of their Empirical Generalization Error (EGE) on retrieved patterns in a validation set. The application of the proposed heterogeneous ensemble method to real condition monitoring knife data has shown to provide more accurate RUL prediction compared to that of each individual base learner in the ensemble. The paper is organized as follows: in Section 2, the objectives of this work and the assumptions are discussed; in Section 3, ensemble learning main concepts for data-driven direct RUL prediction are illustrated; in Section 4, performance metrics to compare different prognostic models are discussed. The application of the methodology to Tetra Pak® A3/Flex filling data is described in Section 5, whereas Section 6 draws the work conclusions. 2 ASSUMPTIONS AND OBJECTIVES We assume to have available run-to-failure degradation trajectories of N pieces of equipment similar to the one currently monitored (test equipment). Let xi(τi) ∈ R , i = 1, . . , N; τ = 1, ... , ni be the vector of m features extracted from signal measurements performed at time τi on the i equipment, with ni indicating the total number of data acquisitions performed on the i equipment before its failure. The ground truth RUL of the i piece pf equipment at time τi will be referred to as yi(τi), i = 1, ... , N; τi = 1, ... , ni . We consider a case in which the failure thresholds for the extracted features are not known. In this setting, fault prognostics is framed as a regression problem: given the historical dataset U formed by N realizations (degradation trajectories) {xi(τi), yi(τi), τi = 1, ... , ni}, i = 1, ... , N, of a stochastic process (X(τ), Y(τ)) ∈ Rx (0, +∞), our task is to find a function f: R → (0, +∞) such that it associates to a test pattern xtest(τtest) ∈ R , the corresponding output ytest(τtest). In what follows, we refer to f as base model or base learner (Zhou, 2012). 3 ENSEMBLE LEARNING FOR FAULT PROGNOSTICS In contrast to ordinary learning approaches which try to construct one base learner from training data, ensemble methods try to construct a set of learners f1̃, ... , f?̃? and combine them to obtain an ensemble learner fen?̃?. In this work, we consider combination of base learners based on weighted averaging (Zhou, 2012), i.e., the combined output fen?̃? is obtained by averaging the output of the individual learners with different weights αh, which implies that the different learners have different importance fen?̃?(x(τ)) = ∑ αh H","PeriodicalId":278087,"journal":{"name":"Safety and Reliability – Safe Societies in a Changing World","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Safety and Reliability – Safe Societies in a Changing World","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1201/9781351174664-11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

We present a method based on heterogeneous ensemble learning for the prediction of the Remaining Useful Life (RUL) of cutting tools (knives) used in the packaging industry. Ensemble diversity is achieved by training multiple prognostic models using different learning algorithms. The combination of the outcomes of the models in the ensemble is based on a weighted averaging strategy, which assigns weights proportional to the individual model performances on patterns of a validation set. The proposed heterogeneous ensemble has been applied to real condition monitoring knife data. It has provided more accurate RUL predictions compared to those of each individual base model. radation beyond which the equipment fails performing its function (failure threshold). Examples of modeling techniques used in degradationbased approaches are Auto-Regressive models (Gorjian et al., 2009), Relevance Vector Machines (Di Maio et al., 2012) and Semi-Markov Models (Cannarile et al., 2017a) (Cannarile et al., 2018). Direct RUL predictions approaches, instead, typically resort to machine learning techniques that directly map the relation between the observable parameters and the equipment RUL, without the need of predicting the equipment degradation state evolution towards a failure threshold (Schwabacher et al., 2007). Techniques used in direct RUL prediction approaches are, for example, Artifical Neural Networks (Wang & Vachtsenavos, 2001), Extreme Learning Machines (ELM) (Yang et al., 2017), Gaussian Processes (GP) (Baraldi et al., 2015b), etc. When few run-to-failure degradation trajectories are available, direct RUL approaches may overfit, i.e., these algorithms customize themselves too much to learn the relationship between the observable parameters and the corresponding RUL in the training set. Therefore, these methods tend to lose their generalization power, which leads to poor performance on new data. To overcome this, ensemble approaches, based on the aggregation of multiple model outcomes, have been introduced (Baraldi et al., 2013a). The basic idea is that the diverse models in the ensemble complement each other by leveraging their strengths and overcoming their drawbacks. Thus, the combination of the outcomes of the individual models in the ensemble improves the accuracy of the predictions compared to the performance of a single model (Brown et al., 2005) (Baraldi et al., 2013a). Different methods, such as ANN (Baraldi et al., 2013b), Support Vector Machine (SVM) (Liu et al., 2006) and kernel learning (Liu et al., 2015), have been used with success to build the individual models. For example, an ensemble of feedforward Artificial Neural Networks (ANN) has been embedded into a Particle Filter (PF) for the prediction of crack length evolution (Baraldi et al., 2013b) and an ensemble of datadriven regression models has been exploited for the RUL prediction of lithium-ion batteries (Xing et al., 2013). In (Rigamonti et al., 2017) a local ensemble of Echo State Networks (ESN) has been proposed to improve the RUL prediction accuracy of turbofan engines. The objective of this work is to predict the RUL of knives installed on Tetra Pak® A3/Flex filling machines used to cut package material. The prognostic task is complicated by the fact that few run-to-failure degradation trajectories are available, and a failure threshold is not available. To cope with these issues, this work proposes an ensemble formed by multiple datadriven direct RUL prediction models, capable of aggregating the RUL predictions for good performance throughout the entire degradation trajectory of a knife. Ensemble diversity is achieved by heterogeneous ensemble generation, i.e., by training the models using different prognostics algorithms. Aggregation is obtained by averaging the output of the individual base models with weights proportional to the inverse of their Empirical Generalization Error (EGE) on retrieved patterns in a validation set. The application of the proposed heterogeneous ensemble method to real condition monitoring knife data has shown to provide more accurate RUL prediction compared to that of each individual base learner in the ensemble. The paper is organized as follows: in Section 2, the objectives of this work and the assumptions are discussed; in Section 3, ensemble learning main concepts for data-driven direct RUL prediction are illustrated; in Section 4, performance metrics to compare different prognostic models are discussed. The application of the methodology to Tetra Pak® A3/Flex filling data is described in Section 5, whereas Section 6 draws the work conclusions. 2 ASSUMPTIONS AND OBJECTIVES We assume to have available run-to-failure degradation trajectories of N pieces of equipment similar to the one currently monitored (test equipment). Let xi(τi) ∈ R , i = 1, . . , N; τ = 1, ... , ni be the vector of m features extracted from signal measurements performed at time τi on the i equipment, with ni indicating the total number of data acquisitions performed on the i equipment before its failure. The ground truth RUL of the i piece pf equipment at time τi will be referred to as yi(τi), i = 1, ... , N; τi = 1, ... , ni . We consider a case in which the failure thresholds for the extracted features are not known. In this setting, fault prognostics is framed as a regression problem: given the historical dataset U formed by N realizations (degradation trajectories) {xi(τi), yi(τi), τi = 1, ... , ni}, i = 1, ... , N, of a stochastic process (X(τ), Y(τ)) ∈ Rx (0, +∞), our task is to find a function f: R → (0, +∞) such that it associates to a test pattern xtest(τtest) ∈ R , the corresponding output ytest(τtest). In what follows, we refer to f as base model or base learner (Zhou, 2012). 3 ENSEMBLE LEARNING FOR FAULT PROGNOSTICS In contrast to ordinary learning approaches which try to construct one base learner from training data, ensemble methods try to construct a set of learners f1̃, ... , f?̃? and combine them to obtain an ensemble learner fen?̃?. In this work, we consider combination of base learners based on weighted averaging (Zhou, 2012), i.e., the combined output fen?̃? is obtained by averaging the output of the individual learners with different weights αh, which implies that the different learners have different importance fen?̃?(x(τ)) = ∑ αh H
包装工业机械剩余使用寿命预测的异质集成方法
我们提出了一种基于异构集成学习的方法,用于预测包装工业中使用的刀具的剩余使用寿命(RUL)。集成多样性是通过使用不同的学习算法训练多个预测模型来实现的。集成中模型结果的组合基于加权平均策略,该策略根据验证集模式上的单个模型性能分配成比例的权重。该方法已应用于实际状态监测刀具数据。与每个单独的基本模型相比,它提供了更准确的RUL预测。超过设备不能执行其功能的辐射(失效阈值)。在基于退化的方法中使用的建模技术的例子有自回归模型(Gorjian等人,2009)、相关向量机(Di Maio等人,2012)和半马尔可夫模型(Cannarile等人,2017a) (Cannarile等人,2018)。相反,直接RUL预测方法通常采用机器学习技术,直接映射可观察参数与设备RUL之间的关系,而无需预测设备退化状态向故障阈值的演变(Schwabacher等人,2007)。直接RUL预测方法中使用的技术有人工神经网络(Wang & Vachtsenavos, 2001)、极限学习机(ELM) (Yang等人,2017)、高斯过程(GP) (Baraldi等人,2015b)等。当可用的运行到失效退化轨迹很少时,直接RUL方法可能会过拟合,即这些算法自定义过多,无法学习可观察参数与训练集中相应RUL之间的关系。因此,这些方法往往会失去泛化能力,从而导致在处理新数据时性能不佳。为了克服这个问题,引入了基于多个模型结果聚合的集成方法(Baraldi et al., 2013)。其基本思想是,集成中的各种模型通过利用各自的优势和克服各自的缺点来相互补充。因此,与单一模型相比,集成中各个模型结果的组合提高了预测的准确性(Brown et al., 2005) (Baraldi et al., 2013)。不同的方法,如人工神经网络(Baraldi et al., 2013)、支持向量机(SVM) (Liu et al., 2006)和核学习(Liu et al., 2015),已经成功地用于构建单个模型。例如,前馈人工神经网络(ANN)集成已被嵌入到粒子滤波器(PF)中,用于预测裂纹长度演变(Baraldi等人,2013),数据驱动回归模型集成已被用于锂离子电池的RUL预测(Xing等人,2013)。在(Rigamonti et al., 2017)中,提出了回声状态网络(ESN)的局部集成,以提高涡扇发动机的RUL预测精度。这项工作的目的是预测安装在利乐®A3/Flex灌装机上用于切割包装材料的刀具的RUL。由于很少有运行到故障的退化轨迹可用,并且没有可用的故障阈值,因此预测任务变得复杂。为了解决这些问题,这项工作提出了一个由多个数据驱动的直接RUL预测模型组成的集成,能够在刀具的整个降解轨迹中聚合RUL预测以获得良好的性能。集成多样性是通过异构集成生成来实现的,即通过使用不同的预测算法来训练模型。聚合是通过对单个基本模型的输出进行平均,其权重与验证集中检索模式的经验泛化误差(Empirical Generalization Error, EGE)的倒数成正比而获得的。本文提出的异构集成方法在实际状态监测刀具数据中的应用表明,与集成中的每个单独的基础学习器相比,该方法提供了更准确的RUL预测。本文的组织如下:在第2节中,讨论了本工作的目标和假设;在第3节中,说明了数据驱动的直接规则预测的集成学习主要概念;在第4节中,讨论了比较不同预后模型的性能指标。第5节描述了该方法在利乐®A3/Flex填充数据中的应用,而第6节则得出了工作结论。2假设和目标我们假设有N个设备的运行到失效退化轨迹,与当前监测的设备(测试设备)相似。令xi(τi)∈R, i = 1,。N;τ = 1,…
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信