S. Garg, A. Puliafito, M. Telek, Kishor S. Trivedi
{"title":"Analysis of software rejuvenation using Markov Regenerative Stochastic Petri Net","authors":"S. Garg, A. Puliafito, M. Telek, Kishor S. Trivedi","doi":"10.1109/ISSRE.1995.497656","DOIUrl":null,"url":null,"abstract":"In a client-server type system, the server software is required to run continuously for very long periods. Due to repeated and potentially faulty usage by many clients, such software \"ages\" with time and eventually fails. (Huang et al., 1995) proposed a technique called \"software rejuvenation\" in which the software is periodically stopped and then restarted in a \"robust\" state after proper maintenance. This \"renewal\" of software prevents (or at least postpones) the crash failure. As the time lost (or the cost incurred) due to the software failure is typically more than the time lost (or the cost incurred) due to rejuvenation, the technique reduces the expected unavailability of the software. We present a quantitative analysis of software rejuvenation. The behavior of the system is represented through a Markov Regenerative Stochastic Petri Net (MRSPN) model which is solved both for steady state as well as transient conditions. We provide a closed-form analytical solution for the steady state expected down time (and the expected cost incurred) due to system unavailability. We also evaluate the optimal rejuvenation interval which minimizes the expected unavailability of the software.","PeriodicalId":408394,"journal":{"name":"Proceedings of Sixth International Symposium on Software Reliability Engineering. ISSRE'95","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"232","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of Sixth International Symposium on Software Reliability Engineering. ISSRE'95","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSRE.1995.497656","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 232
Abstract
In a client-server type system, the server software is required to run continuously for very long periods. Due to repeated and potentially faulty usage by many clients, such software "ages" with time and eventually fails. (Huang et al., 1995) proposed a technique called "software rejuvenation" in which the software is periodically stopped and then restarted in a "robust" state after proper maintenance. This "renewal" of software prevents (or at least postpones) the crash failure. As the time lost (or the cost incurred) due to the software failure is typically more than the time lost (or the cost incurred) due to rejuvenation, the technique reduces the expected unavailability of the software. We present a quantitative analysis of software rejuvenation. The behavior of the system is represented through a Markov Regenerative Stochastic Petri Net (MRSPN) model which is solved both for steady state as well as transient conditions. We provide a closed-form analytical solution for the steady state expected down time (and the expected cost incurred) due to system unavailability. We also evaluate the optimal rejuvenation interval which minimizes the expected unavailability of the software.
在客户机-服务器类型的系统中,服务器软件需要长时间连续运行。由于许多客户端的重复使用和潜在的错误,这样的软件会随着时间“老化”并最终失败。(Huang et al., 1995)提出了一种称为“软件复兴”的技术,即定期停止软件,然后在适当维护后以“健壮”状态重新启动软件。这种软件的“更新”防止(或至少推迟)崩溃失败。由于由于软件故障而损失的时间(或产生的成本)通常比由于恢复而损失的时间(或产生的成本)要多,因此该技术减少了软件的预期不可用性。我们提出了软件复兴的定量分析。系统的行为通过马尔可夫再生随机Petri网(MRSPN)模型来表示,该模型同时求解稳态和瞬态条件。我们为由于系统不可用而导致的稳态预期停机时间(和预期成本)提供了一个封闭形式的分析解决方案。我们还评估了使软件的预期不可用性最小化的最佳恢复间隔。