{"title":"Distributed reconfiguration and recovery in the advanced architecture on-board processor","authors":"M. Iacoponi, S. McDonald","doi":"10.1109/FTCS.1991.146698","DOIUrl":null,"url":null,"abstract":"The reconfiguration and recovery approach employed in the advanced architecture on-board processor (AAOP), a fault-tolerant multiprocessor for space applications, is presented. The AAOP is designed to accommodate large numbers of processing elements organized in a distributed fault-tolerant system. The operation of distributed reconfiguration is discussed, and the recovery time is analyzed. Performance of the reconfiguration algorithm under inconsistent observer conditions and against multiple faults is considered. the chordal skiplink ring topology employed in the AAOP is analyzed with respect to its node-pair distance distribution as a function of the number of faults injected. Extensions of this topology are also considered. These would reduce the network diameter and increase fault robustness but also increase the network management overhead.<<ETX>>","PeriodicalId":300397,"journal":{"name":"[1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FTCS.1991.146698","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
The reconfiguration and recovery approach employed in the advanced architecture on-board processor (AAOP), a fault-tolerant multiprocessor for space applications, is presented. The AAOP is designed to accommodate large numbers of processing elements organized in a distributed fault-tolerant system. The operation of distributed reconfiguration is discussed, and the recovery time is analyzed. Performance of the reconfiguration algorithm under inconsistent observer conditions and against multiple faults is considered. the chordal skiplink ring topology employed in the AAOP is analyzed with respect to its node-pair distance distribution as a function of the number of faults injected. Extensions of this topology are also considered. These would reduce the network diameter and increase fault robustness but also increase the network management overhead.<>