Applicability of Algorithm Evaluation Metrics for Predictive Maintenance in Production Systems

2020 6th IEEE Congress on Information Science and Technology (CiSt) Pub Date : 2020-06-05 DOI:10.1109/CiSt49399.2021.9357277

Hendrik Engbers, A. Alla, Markus Kreutz, M. Freitag

{"title":"Applicability of Algorithm Evaluation Metrics for Predictive Maintenance in Production Systems","authors":"Hendrik Engbers, A. Alla, Markus Kreutz, M. Freitag","doi":"10.1109/CiSt49399.2021.9357277","DOIUrl":null,"url":null,"abstract":"Algorithm evaluation metrics are used to measure and compare the performance of diagnostic and prognostic algorithms. However, there is no consensus on which of the various metrics are the most suitable for classifiers' performance evaluation. This paper examines the applicability of common evaluation metrics for predictive maintenance applications in production systems. It is intended to clarify (1) which metrics are well suited for this use case, and (2) whether they are sufficient as a sole decision criterion for algorithm selection. Further, (3) the significance of evaluation metrics concerning the practical impact on the production system's performance is investigated. Moreover, we analyze (4) how increasing the production system's complexity affects the correlation between algorithm evaluation metrics and performance. We conducted 960 simulation runs of a flexible flow shop production system to examine the impact of machine breakdowns and different failure distributions with varying confusion matrices. In addition, two different configurations of the production system, which differ in system complexity, were investigated. Eventually, we determined the product-moment correlation coefficient between common evaluation metrics and the production system's performance. The simulation results reveal that metrics that are sensitive to false negative values (FN), like the False Omission Rate (FOR), are very well suited. Nevertheless, a detailed analysis shows that even (FOR) can only provide reliable statements combined with other metrics. We found that as the system becomes more complex, the informative value of the metrics in terms of their impact on the production system's performance decreases. Finally, we propose parameters that could be relevant for developing new metrics considering the current system configuration.","PeriodicalId":253233,"journal":{"name":"2020 6th IEEE Congress on Information Science and Technology (CiSt)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th IEEE Congress on Information Science and Technology (CiSt)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CiSt49399.2021.9357277","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Algorithm evaluation metrics are used to measure and compare the performance of diagnostic and prognostic algorithms. However, there is no consensus on which of the various metrics are the most suitable for classifiers' performance evaluation. This paper examines the applicability of common evaluation metrics for predictive maintenance applications in production systems. It is intended to clarify (1) which metrics are well suited for this use case, and (2) whether they are sufficient as a sole decision criterion for algorithm selection. Further, (3) the significance of evaluation metrics concerning the practical impact on the production system's performance is investigated. Moreover, we analyze (4) how increasing the production system's complexity affects the correlation between algorithm evaluation metrics and performance. We conducted 960 simulation runs of a flexible flow shop production system to examine the impact of machine breakdowns and different failure distributions with varying confusion matrices. In addition, two different configurations of the production system, which differ in system complexity, were investigated. Eventually, we determined the product-moment correlation coefficient between common evaluation metrics and the production system's performance. The simulation results reveal that metrics that are sensitive to false negative values (FN), like the False Omission Rate (FOR), are very well suited. Nevertheless, a detailed analysis shows that even (FOR) can only provide reliable statements combined with other metrics. We found that as the system becomes more complex, the informative value of the metrics in terms of their impact on the production system's performance decreases. Finally, we propose parameters that could be relevant for developing new metrics considering the current system configuration.

查看原文本刊更多论文

预测维护算法评价指标在生产系统中的适用性

算法评估指标用于衡量和比较诊断和预后算法的性能。然而，对于哪些指标最适合分类器的性能评估，目前还没有达成共识。本文研究了生产系统中预测性维护应用的通用评估度量的适用性。它的目的是澄清(1)哪些指标非常适合这个用例，以及(2)它们是否足以作为算法选择的唯一决策标准。进一步，(3)探讨了评价指标对生产系统绩效的实际影响的重要性。此外，我们分析了(4)增加生产系统的复杂性如何影响算法评估指标与性能之间的相关性。我们对一个灵活的流水车间生产系统进行了960次模拟运行，以检查机器故障和不同混淆矩阵的不同故障分布的影响。此外，还研究了系统复杂性不同的两种不同的生产系统配置。最后，我们确定了常用评价指标与生产系统绩效之间的产品矩相关系数。仿真结果表明，对假负值(FN)敏感的指标，如假遗漏率(FOR)，非常适合。然而，详细的分析表明，即使(FOR)也只能提供与其他指标相结合的可靠语句。我们发现，随着系统变得更加复杂，度量的信息价值就其对生产系统性能的影响而言减少了。最后，我们提出了一些参数，这些参数可能与开发考虑当前系统配置的新度量相关。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 6th IEEE Congress on Information Science and Technology (CiSt)

自引率

0.00%

发文量