Real-Time Integrated Learning and Decision Making for Cumulative Shock Degradation

Manuf. Serv. Oper. Manag. Pub Date : 2022-09-16 DOI:10.1287/msom.2022.1149

Collin Drent, M. Drent, J. Arts, S. Kapodistria

{"title":"Real-Time Integrated Learning and Decision Making for Cumulative Shock Degradation","authors":"Collin Drent, M. Drent, J. Arts, S. Kapodistria","doi":"10.1287/msom.2022.1149","DOIUrl":null,"url":null,"abstract":"Problem definition: Unexpected failures of equipment can have severe consequences and costs. Such unexpected failures can be prevented by performing preventive replacement based on real-time degradation data. We study a component that degrades according to a compound Poisson process and fails when the degradation exceeds the failure threshold. An online sensor measures the degradation in real time, but interventions are only possible during planned downtime. Academic/practical relevance: We characterize the optimal replacement policy that integrates real-time learning from the online sensor. We demonstrate the effectiveness in practice with a case study on interventional x-ray machines. The data set of this case study is available in the online companion. As such, it can serve as a benchmark data set for future studies on stochastically deteriorating systems. Methodology: The degradation parameters vary from one component to the next but cannot be observed directly; the component population is heterogeneous. These parameters must therefore be inferred by observing the real-time degradation signal. We model this situation as a partially observable Markov decision process (POMDP) so that decision making and learning are integrated. We collapse the information state space of this POMDP to three dimensions so that optimal policies can be analyzed and computed tractably. Results: The optimal policy is a state dependent control limit. The control limit increases with age but may decrease as a result of other information in the degradation signal. Numerical case study analyses reveal that integration of learning and decision making leads to cost reductions of 10.50% relative to approaches that do not learn from the real-time signal and 4.28% relative to approaches that separate learning and decision making. Managerial implications: Real-time sensor information can reduce the cost of maintenance and unplanned downtime by a considerable amount. The integration of learning and decision making is tractably possible for industrial systems with our state space collapse. Finally, the benefit of our model increases with the amount of data available for initial model calibration, whereas additional data are much less valuable for approaches that ignore population heterogeneity.","PeriodicalId":18108,"journal":{"name":"Manuf. Serv. Oper. Manag.","volume":"5 1","pages":"235-253"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Manuf. Serv. Oper. Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/msom.2022.1149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Problem definition: Unexpected failures of equipment can have severe consequences and costs. Such unexpected failures can be prevented by performing preventive replacement based on real-time degradation data. We study a component that degrades according to a compound Poisson process and fails when the degradation exceeds the failure threshold. An online sensor measures the degradation in real time, but interventions are only possible during planned downtime. Academic/practical relevance: We characterize the optimal replacement policy that integrates real-time learning from the online sensor. We demonstrate the effectiveness in practice with a case study on interventional x-ray machines. The data set of this case study is available in the online companion. As such, it can serve as a benchmark data set for future studies on stochastically deteriorating systems. Methodology: The degradation parameters vary from one component to the next but cannot be observed directly; the component population is heterogeneous. These parameters must therefore be inferred by observing the real-time degradation signal. We model this situation as a partially observable Markov decision process (POMDP) so that decision making and learning are integrated. We collapse the information state space of this POMDP to three dimensions so that optimal policies can be analyzed and computed tractably. Results: The optimal policy is a state dependent control limit. The control limit increases with age but may decrease as a result of other information in the degradation signal. Numerical case study analyses reveal that integration of learning and decision making leads to cost reductions of 10.50% relative to approaches that do not learn from the real-time signal and 4.28% relative to approaches that separate learning and decision making. Managerial implications: Real-time sensor information can reduce the cost of maintenance and unplanned downtime by a considerable amount. The integration of learning and decision making is tractably possible for industrial systems with our state space collapse. Finally, the benefit of our model increases with the amount of data available for initial model calibration, whereas additional data are much less valuable for approaches that ignore population heterogeneity.

查看原文本刊更多论文

累积冲击退化的实时集成学习与决策

问题定义:设备的意外故障可能会产生严重的后果和成本。通过基于实时退化数据执行预防性更换，可以防止此类意外故障。我们研究了一种根据复合泊松过程退化的部件，当退化超过失效阈值时失效。在线传感器可以实时测量退化情况，但只有在计划停机期间才能进行干预。学术/实践相关性:我们描述了集成在线传感器实时学习的最佳替换策略。我们以介入x光机为例，证明了该方法的有效性。本案例研究的数据集可在在线同伴中获得。因此，它可以作为未来随机退化系统研究的基准数据集。方法:降解参数因组分而异，但不能直接观察;组件群是异构的。因此，这些参数必须通过观察实时退化信号来推断。我们将这种情况建模为部分可观察的马尔可夫决策过程(POMDP)，以便将决策和学习集成在一起。我们将该POMDP的信息状态空间分解为三维空间，从而可以跟踪地分析和计算最优策略。结果:最优策略为状态依赖控制极限。控制极限随年龄增长而增加，但可能由于退化信号中的其他信息而降低。数值案例研究分析表明，与不从实时信号中学习的方法相比，学习和决策相结合的方法成本降低了10.50%，与将学习和决策分离的方法相比，成本降低了4.28%。管理意义:实时传感器信息可以大大减少维护成本和计划外停机时间。对于状态空间崩溃的工业系统，学习与决策的集成是可行的。最后，我们的模型的好处随着初始模型校准可用数据量的增加而增加，而对于忽略群体异质性的方法来说，额外的数据价值要低得多。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Manuf. Serv. Oper. Manag.

自引率

0.00%

发文量