{"title":"基于点对点通信的实时系统的高覆盖容错","authors":"K. Kim, C. Subbaraman, E. Shokri","doi":"10.1109/HASE.1997.648053","DOIUrl":null,"url":null,"abstract":"The distributed recovery block (DRB) scheme is a widely applicable approach for realizing both hardware and software fault tolerance in real-time distributed and parallel computer systems. One of the most important extensions of the DRB scheme which has been outlined in recent years (but not developed fully) is the integration of the DRB scheme and a network surveillance (NS) scheme. We have developed an NS scheme that is effective in a variety of point-to-point networks, called the supervisor-based NS (SNS) scheme. In this paper, we present an integration of the DRB scheme with the SNS scheme, called the DRB/SNS scheme. This scheme is a significant improvement over the previous versions of the DRB scheme with respect to the fault coverage and recovery time bound achieved in those systems that are based on point-to-point networks. The execution support for the integrated scheme has been implemented as a part of the DREAM kernel prototype, a timeliness-guaranteed operating system kernel developed at the University of California, Irvine. The recovery time bound of the DRB/SNS scheme is analyzed on the basis of the prototype implementation.","PeriodicalId":319609,"journal":{"name":"Proceedings 1997 High-Assurance Engineering Workshop","volume":"197 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High-coverage fault tolerance in real-time systems based on point-to-point communication\",\"authors\":\"K. Kim, C. Subbaraman, E. Shokri\",\"doi\":\"10.1109/HASE.1997.648053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The distributed recovery block (DRB) scheme is a widely applicable approach for realizing both hardware and software fault tolerance in real-time distributed and parallel computer systems. One of the most important extensions of the DRB scheme which has been outlined in recent years (but not developed fully) is the integration of the DRB scheme and a network surveillance (NS) scheme. We have developed an NS scheme that is effective in a variety of point-to-point networks, called the supervisor-based NS (SNS) scheme. In this paper, we present an integration of the DRB scheme with the SNS scheme, called the DRB/SNS scheme. This scheme is a significant improvement over the previous versions of the DRB scheme with respect to the fault coverage and recovery time bound achieved in those systems that are based on point-to-point networks. The execution support for the integrated scheme has been implemented as a part of the DREAM kernel prototype, a timeliness-guaranteed operating system kernel developed at the University of California, Irvine. The recovery time bound of the DRB/SNS scheme is analyzed on the basis of the prototype implementation.\",\"PeriodicalId\":319609,\"journal\":{\"name\":\"Proceedings 1997 High-Assurance Engineering Workshop\",\"volume\":\"197 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1997-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 1997 High-Assurance Engineering Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HASE.1997.648053\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 1997 High-Assurance Engineering Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HASE.1997.648053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
High-coverage fault tolerance in real-time systems based on point-to-point communication
The distributed recovery block (DRB) scheme is a widely applicable approach for realizing both hardware and software fault tolerance in real-time distributed and parallel computer systems. One of the most important extensions of the DRB scheme which has been outlined in recent years (but not developed fully) is the integration of the DRB scheme and a network surveillance (NS) scheme. We have developed an NS scheme that is effective in a variety of point-to-point networks, called the supervisor-based NS (SNS) scheme. In this paper, we present an integration of the DRB scheme with the SNS scheme, called the DRB/SNS scheme. This scheme is a significant improvement over the previous versions of the DRB scheme with respect to the fault coverage and recovery time bound achieved in those systems that are based on point-to-point networks. The execution support for the integrated scheme has been implemented as a part of the DREAM kernel prototype, a timeliness-guaranteed operating system kernel developed at the University of California, Irvine. The recovery time bound of the DRB/SNS scheme is analyzed on the basis of the prototype implementation.