容错系统的优化设计配置

S. Amari
{"title":"容错系统的优化设计配置","authors":"S. Amari","doi":"10.1109/RAMS.2013.6517667","DOIUrl":null,"url":null,"abstract":"Fault tolerance is an essential architectural attribute for achieving high reliability in many critical applications of digital systems. Automatic recovery and reconfiguration mechanisms play a crucial role in implementing fault tolerance because an uncovered fault may lead to a system or subsystem failure even when adequate redundancy exists. An excessive level of redundancy may even reduce the system reliability in addition to consuming system resources. Therefore, an accurate reliability analysis must account for not only the system structure but also the system fault and error handling behavior. The models that capture the fault and error handling behavior are called coverage models. The appropriate coverage modeling approach depends on the type of fault-tolerant techniques used. This paper describes and demonstrates a solution methodology that determines optimal design configurations that maximize the reliability of fault-tolerant systems subject to imperfect fault coverage and resource constraints. It is assumed that the system consists of several subsystems in series where each subsystem contains multiple redundant components. The problem formulation considers the generic type of fault-tolerant mechanisms and associated coverage models for each subsystem. The objective of the optimal design is to select the design configuration, type of components, and fault-tolerant mechanism for each subsystem from the applicable/available choices. Optimal solutions are determined based on an equivalent problem formulation and integer programming. The methodology presented here is flexible and can accurately model a wide range of faulttolerant systems used in safety-critical applications. The methodology is successfully demonstrated on a large problem with 14 subsystems and 4 component choices for each subsystem.","PeriodicalId":189714,"journal":{"name":"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Optimal design configurations of fault-tolerant systems\",\"authors\":\"S. Amari\",\"doi\":\"10.1109/RAMS.2013.6517667\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fault tolerance is an essential architectural attribute for achieving high reliability in many critical applications of digital systems. Automatic recovery and reconfiguration mechanisms play a crucial role in implementing fault tolerance because an uncovered fault may lead to a system or subsystem failure even when adequate redundancy exists. An excessive level of redundancy may even reduce the system reliability in addition to consuming system resources. Therefore, an accurate reliability analysis must account for not only the system structure but also the system fault and error handling behavior. The models that capture the fault and error handling behavior are called coverage models. The appropriate coverage modeling approach depends on the type of fault-tolerant techniques used. This paper describes and demonstrates a solution methodology that determines optimal design configurations that maximize the reliability of fault-tolerant systems subject to imperfect fault coverage and resource constraints. It is assumed that the system consists of several subsystems in series where each subsystem contains multiple redundant components. The problem formulation considers the generic type of fault-tolerant mechanisms and associated coverage models for each subsystem. The objective of the optimal design is to select the design configuration, type of components, and fault-tolerant mechanism for each subsystem from the applicable/available choices. Optimal solutions are determined based on an equivalent problem formulation and integer programming. The methodology presented here is flexible and can accurately model a wide range of faulttolerant systems used in safety-critical applications. The methodology is successfully demonstrated on a large problem with 14 subsystems and 4 component choices for each subsystem.\",\"PeriodicalId\":189714,\"journal\":{\"name\":\"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAMS.2013.6517667\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAMS.2013.6517667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

在数字系统的许多关键应用中,容错是实现高可靠性的基本架构属性。自动恢复和重新配置机制在实现容错方面起着至关重要的作用,因为即使存在足够的冗余,未发现的故障也可能导致系统或子系统故障。过多的冗余不仅会消耗系统资源,还会降低系统的可靠性。因此,准确的可靠性分析不仅要考虑系统结构,还要考虑系统故障和错误处理行为。捕获故障和错误处理行为的模型称为覆盖模型。适当的覆盖率建模方法取决于所使用的容错技术的类型。本文描述并演示了一种解决方案方法,该方法确定了在不完全故障覆盖和资源约束下使容错系统可靠性最大化的最佳设计配置。假设系统由多个串联子系统组成,每个子系统包含多个冗余组件。问题的表述考虑了容错机制的一般类型和每个子系统的相关覆盖模型。优化设计的目标是从适用/可用的选择中为每个子系统选择设计配置、组件类型和容错机制。基于等效问题公式和整数规划确定最优解。这里提出的方法是灵活的,可以准确地为安全关键应用中使用的各种容错系统建模。该方法在一个有14个子系统的大问题上得到了成功的验证,每个子系统有4个组件选择。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimal design configurations of fault-tolerant systems
Fault tolerance is an essential architectural attribute for achieving high reliability in many critical applications of digital systems. Automatic recovery and reconfiguration mechanisms play a crucial role in implementing fault tolerance because an uncovered fault may lead to a system or subsystem failure even when adequate redundancy exists. An excessive level of redundancy may even reduce the system reliability in addition to consuming system resources. Therefore, an accurate reliability analysis must account for not only the system structure but also the system fault and error handling behavior. The models that capture the fault and error handling behavior are called coverage models. The appropriate coverage modeling approach depends on the type of fault-tolerant techniques used. This paper describes and demonstrates a solution methodology that determines optimal design configurations that maximize the reliability of fault-tolerant systems subject to imperfect fault coverage and resource constraints. It is assumed that the system consists of several subsystems in series where each subsystem contains multiple redundant components. The problem formulation considers the generic type of fault-tolerant mechanisms and associated coverage models for each subsystem. The objective of the optimal design is to select the design configuration, type of components, and fault-tolerant mechanism for each subsystem from the applicable/available choices. Optimal solutions are determined based on an equivalent problem formulation and integer programming. The methodology presented here is flexible and can accurately model a wide range of faulttolerant systems used in safety-critical applications. The methodology is successfully demonstrated on a large problem with 14 subsystems and 4 component choices for each subsystem.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信