多状态系统的最优任务中止和选择性替换策略

IF 11 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL
Xian Zhao , Zuheng Lv , Qingan Qiu , Yaguang Wu
{"title":"多状态系统的最优任务中止和选择性替换策略","authors":"Xian Zhao ,&nbsp;Zuheng Lv ,&nbsp;Qingan Qiu ,&nbsp;Yaguang Wu","doi":"10.1016/j.ress.2025.111366","DOIUrl":null,"url":null,"abstract":"<div><div>To mitigate the failure risk in safety-critical systems, it is beneficial to implement mission abort and rescue procedures when specific malfunction conditions are identified. Existing mission abort models predominantly focus on multi-state systems with binary-state components, often operating under the assumption that all failed components will be completely replaced after each rescue operation. However, many real-world engineering systems employ multi-state components, where replacing all failed components may not be the optimal approach due to constraints on replacement resources. Therefore, the design of effective mission abort and selective replacement policies for systems with multi-state components becomes imperative. Additionally, existing models for selective replacement primarily focus on the condition of system degradation, often overlooking the progress of missions, which can lead to suboptimal maintenance decisions, as it does not account for how mission progress and system performance interact with the demand for component replacement. This paper introduces dynamic condition-based mission abort and selective replacement policies for <em>k</em>-out-of-n: <em>F</em> systems with multi-state components, which dynamically assess the condition of system components’ state and mission execution. Mission success probability and system survivability are derived by employing recursive and discretization algorithms. We develop optimization models aimed at maximizing these probabilities while minimizing expected costs associated with maintenance and replacement actions. A case study involving a cloud computing system illustrates the advantages of the proposed policies, demonstrating their effectiveness in comparison to existing alternatives.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":"264 ","pages":"Article 111366"},"PeriodicalIF":11.0000,"publicationDate":"2025-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal mission abort and selective replacement policies for multi-state systems\",\"authors\":\"Xian Zhao ,&nbsp;Zuheng Lv ,&nbsp;Qingan Qiu ,&nbsp;Yaguang Wu\",\"doi\":\"10.1016/j.ress.2025.111366\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>To mitigate the failure risk in safety-critical systems, it is beneficial to implement mission abort and rescue procedures when specific malfunction conditions are identified. Existing mission abort models predominantly focus on multi-state systems with binary-state components, often operating under the assumption that all failed components will be completely replaced after each rescue operation. However, many real-world engineering systems employ multi-state components, where replacing all failed components may not be the optimal approach due to constraints on replacement resources. Therefore, the design of effective mission abort and selective replacement policies for systems with multi-state components becomes imperative. Additionally, existing models for selective replacement primarily focus on the condition of system degradation, often overlooking the progress of missions, which can lead to suboptimal maintenance decisions, as it does not account for how mission progress and system performance interact with the demand for component replacement. This paper introduces dynamic condition-based mission abort and selective replacement policies for <em>k</em>-out-of-n: <em>F</em> systems with multi-state components, which dynamically assess the condition of system components’ state and mission execution. Mission success probability and system survivability are derived by employing recursive and discretization algorithms. We develop optimization models aimed at maximizing these probabilities while minimizing expected costs associated with maintenance and replacement actions. A case study involving a cloud computing system illustrates the advantages of the proposed policies, demonstrating their effectiveness in comparison to existing alternatives.</div></div>\",\"PeriodicalId\":54500,\"journal\":{\"name\":\"Reliability Engineering & System Safety\",\"volume\":\"264 \",\"pages\":\"Article 111366\"},\"PeriodicalIF\":11.0000,\"publicationDate\":\"2025-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Reliability Engineering & System Safety\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0951832025005678\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832025005678","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

摘要

为了降低安全关键系统的故障风险,在确定特定故障条件时实施任务中止和救援程序是有益的。现有的任务中止模型主要关注具有二元状态组件的多状态系统,通常在每次救援行动后所有失效组件将被完全替换的假设下运行。然而,许多现实世界的工程系统采用多状态组件,由于更换资源的限制,更换所有失效组件可能不是最佳方法。因此,针对多状态部件系统设计有效的任务中止和选择性替换策略势在必行。此外,现有的选择性更换模型主要关注系统退化的情况,经常忽略任务的进度,这可能导致次优维护决策,因为它没有考虑任务进度和系统性能如何与部件更换需求相互作用。针对具有多状态组件的k- of-n: F系统,引入了基于动态条件的任务中止和选择性替换策略,动态评估系统组件的状态和任务执行情况。采用递归和离散化算法推导任务成功概率和系统生存能力。我们开发了优化模型,旨在最大化这些概率,同时最小化与维护和更换行动相关的预期成本。一个涉及云计算系统的案例研究说明了所建议的策略的优点,并证明了与现有替代方案相比它们的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimal mission abort and selective replacement policies for multi-state systems
To mitigate the failure risk in safety-critical systems, it is beneficial to implement mission abort and rescue procedures when specific malfunction conditions are identified. Existing mission abort models predominantly focus on multi-state systems with binary-state components, often operating under the assumption that all failed components will be completely replaced after each rescue operation. However, many real-world engineering systems employ multi-state components, where replacing all failed components may not be the optimal approach due to constraints on replacement resources. Therefore, the design of effective mission abort and selective replacement policies for systems with multi-state components becomes imperative. Additionally, existing models for selective replacement primarily focus on the condition of system degradation, often overlooking the progress of missions, which can lead to suboptimal maintenance decisions, as it does not account for how mission progress and system performance interact with the demand for component replacement. This paper introduces dynamic condition-based mission abort and selective replacement policies for k-out-of-n: F systems with multi-state components, which dynamically assess the condition of system components’ state and mission execution. Mission success probability and system survivability are derived by employing recursive and discretization algorithms. We develop optimization models aimed at maximizing these probabilities while minimizing expected costs associated with maintenance and replacement actions. A case study involving a cloud computing system illustrates the advantages of the proposed policies, demonstrating their effectiveness in comparison to existing alternatives.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Reliability Engineering & System Safety
Reliability Engineering & System Safety 管理科学-工程:工业
CiteScore
15.20
自引率
39.50%
发文量
621
审稿时长
67 days
期刊介绍: Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信