{"title":"Increasing the Efficiency and Efficacy of Selective-Hardening for Parallel Applications","authors":"Daniel Oliveira, P. Navaux, P. Rech","doi":"10.1109/DFT.2019.8875300","DOIUrl":null,"url":null,"abstract":"Selective hardening is a promising solution to efficiently improve the reliability of high performance and safety-critical real-time applications. One of the most significant challenges of selective hardening is to choose how many resources or code portions to protect to avoid unnecessary system performances degradation (at least 2x overhead for the naive duplication). In this paper, we propose a selective hardening strategy for parallel algorithms. We first identify through extensive fault-injection campaigns the code portions whose protection significantly increases the algorithm reliability. Then, we select the code portions that, once protected, maximize the reliability/overhead ratio. We can achieve fault coverage as high as 60% with a 3% overhead. We show that the hardening efficiency can be higher than 90% when compared to naive full duplication.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DFT.2019.8875300","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Selective hardening is a promising solution to efficiently improve the reliability of high performance and safety-critical real-time applications. One of the most significant challenges of selective hardening is to choose how many resources or code portions to protect to avoid unnecessary system performances degradation (at least 2x overhead for the naive duplication). In this paper, we propose a selective hardening strategy for parallel algorithms. We first identify through extensive fault-injection campaigns the code portions whose protection significantly increases the algorithm reliability. Then, we select the code portions that, once protected, maximize the reliability/overhead ratio. We can achieve fault coverage as high as 60% with a 3% overhead. We show that the hardening efficiency can be higher than 90% when compared to naive full duplication.