A new methodology for calculating distributions of reward accumulated during a finite interval

Proceedings of Annual Symposium on Fault Tolerant Computing Pub Date : 1996-06-25 DOI:10.1109/FTCS.1996.534600

M. Qureshi, W. Sanders

{"title":"A new methodology for calculating distributions of reward accumulated during a finite interval","authors":"M. Qureshi, W. Sanders","doi":"10.1109/FTCS.1996.534600","DOIUrl":null,"url":null,"abstract":"Markov reward models are an important formalism by which to obtain dependability and performability measures of computer systems and networks. In this context, it is particularly important to determine the probability distribution function of the reward accumulated during a finite interval. The interval may correspond to the mission period in a mission-critical system, the time between scheduled maintenances, or a warranty period. In such models, changes in state correspond to changes in system structure (due to faults and repairs), and the reward structure depends on the measure of interest. For example, the reward rates may represent a productivity rate while in that state, if performability is considered, or the binary values zero and one, if interval availability is of interest. We present a new methodology to calculate the distribution of reward accumulated over a finite interval. In particular, we derive recursive expressions for the distribution of reward accumulated given that a particular sequence of state changes occurs during the interval, and we explore paths one at a time. The expressions for conditional accumulated reward are new and are numerically stable. In addition, by exploring paths individually, we avoid the memory growth problems experienced when applying previous approaches to large models. The utility of the methodology is illustrated via application to a realistic fault-tolerant multiprocessor model with over half a million states.","PeriodicalId":191163,"journal":{"name":"Proceedings of Annual Symposium on Fault Tolerant Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"32","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of Annual Symposium on Fault Tolerant Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FTCS.1996.534600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 32

Abstract

Markov reward models are an important formalism by which to obtain dependability and performability measures of computer systems and networks. In this context, it is particularly important to determine the probability distribution function of the reward accumulated during a finite interval. The interval may correspond to the mission period in a mission-critical system, the time between scheduled maintenances, or a warranty period. In such models, changes in state correspond to changes in system structure (due to faults and repairs), and the reward structure depends on the measure of interest. For example, the reward rates may represent a productivity rate while in that state, if performability is considered, or the binary values zero and one, if interval availability is of interest. We present a new methodology to calculate the distribution of reward accumulated over a finite interval. In particular, we derive recursive expressions for the distribution of reward accumulated given that a particular sequence of state changes occurs during the interval, and we explore paths one at a time. The expressions for conditional accumulated reward are new and are numerically stable. In addition, by exploring paths individually, we avoid the memory growth problems experienced when applying previous approaches to large models. The utility of the methodology is illustrated via application to a realistic fault-tolerant multiprocessor model with over half a million states.

查看原文本刊更多论文

一种计算有限时间内累积奖励分配的新方法

马尔可夫奖励模型是衡量计算机系统和网络可靠性和可执行性的一种重要形式。在这种情况下，确定有限时间内累积奖励的概率分布函数就显得尤为重要。这个时间间隔可以对应于关键任务系统的任务周期、计划维护之间的时间间隔或保修期。在这样的模型中，状态的变化对应于系统结构的变化(由于故障和修复)，奖励结构取决于兴趣的度量。例如，如果考虑可执行性，奖励率可能表示在该状态下的生产率，或者如果关心间隔可用性，奖励率可能表示二进制值0和1。提出了一种计算有限区间内累积奖励分布的新方法。特别地，我们推导出了在给定区间内发生的特定状态变化序列的累积奖励分布的递归表达式，并一次探索一条路径。条件累积奖励的表达式是新的，在数值上是稳定的。此外，通过单独探索路径，我们避免了将以前的方法应用于大型模型时遇到的内存增长问题。通过一个实际的超过50万个状态的容错多处理器模型，说明了该方法的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of Annual Symposium on Fault Tolerant Computing

自引率

0.00%

发文量