{"title":"Probabilistic diagnosis in wafer-scale systems","authors":"Arun Kumar Somani, J. Wang","doi":"10.1109/DFTVS.1992.224378","DOIUrl":null,"url":null,"abstract":"Studies fault diagnosis based on a realistic probabilistic model for wafer-scale multiprocessor systems. In this model, an individual processor fails independently with probability p. The authors use a comparison testing approach. The testing is performed in multiple stages by the processors. They assume that different testing tasks are executed in different stages, and the coverage of each testing task is imperfect and is represented by a parameter c/sub v/. Imperfect coverage can be used to model intermittent faults where individual test may be incapable of detecting a fault. The authors present an efficient distributed self-diagnosis algorithm that probabilistically identifies faulty and fault free units. They show that our algorithm achieves very high accuracy even when the system is sparsely interconnected, and a large number of faulty units are present in the system.<<ETX>>","PeriodicalId":319218,"journal":{"name":"Proceedings 1992 IEEE International Workshop on Defect and Fault Tolerance in VLSI Systems","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 1992 IEEE International Workshop on Defect and Fault Tolerance in VLSI Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DFTVS.1992.224378","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Studies fault diagnosis based on a realistic probabilistic model for wafer-scale multiprocessor systems. In this model, an individual processor fails independently with probability p. The authors use a comparison testing approach. The testing is performed in multiple stages by the processors. They assume that different testing tasks are executed in different stages, and the coverage of each testing task is imperfect and is represented by a parameter c/sub v/. Imperfect coverage can be used to model intermittent faults where individual test may be incapable of detecting a fault. The authors present an efficient distributed self-diagnosis algorithm that probabilistically identifies faulty and fault free units. They show that our algorithm achieves very high accuracy even when the system is sparsely interconnected, and a large number of faulty units are present in the system.<>