Jit fault detection: increasing availability in 1oo2 systems just-in-time

L. Botler, N. Kajtazovic, K. Diwold, K. Römer
{"title":"Jit fault detection: increasing availability in 1oo2 systems just-in-time","authors":"L. Botler, N. Kajtazovic, K. Diwold, K. Römer","doi":"10.1145/3407023.3407054","DOIUrl":null,"url":null,"abstract":"With silicon technology decreasing in size, memories get more susceptible to external influences, which can lead to soft errors. Although temporary, these errors constitute a challenge for safety-critical systems. Redundancy-based error detection is commonly used in industry to increase safety and mitigate these errors. When an error is detected, safety-critical systems are usually switched to a safe state. While this prevents failures, it negatively affects the system's availability. In this work, we propose Just-in-Time fault detection, a novel method which enables a system to be switched to the safe state only in case a detected error would affect the system's behavior. A software tool enabling the deployment of this method on an off-the-shelf processor is implemented, and the method is validated and compared with a state-of-the-art alternative approach using mixed-critical memories. Our results show an availability gain between 25.2% and 100% compared with the state-of-the-art approach while executing two different standard algorithms.","PeriodicalId":121225,"journal":{"name":"Proceedings of the 15th International Conference on Availability, Reliability and Security","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 15th International Conference on Availability, Reliability and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3407023.3407054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With silicon technology decreasing in size, memories get more susceptible to external influences, which can lead to soft errors. Although temporary, these errors constitute a challenge for safety-critical systems. Redundancy-based error detection is commonly used in industry to increase safety and mitigate these errors. When an error is detected, safety-critical systems are usually switched to a safe state. While this prevents failures, it negatively affects the system's availability. In this work, we propose Just-in-Time fault detection, a novel method which enables a system to be switched to the safe state only in case a detected error would affect the system's behavior. A software tool enabling the deployment of this method on an off-the-shelf processor is implemented, and the method is validated and compared with a state-of-the-art alternative approach using mixed-critical memories. Our results show an availability gain between 25.2% and 100% compared with the state-of-the-art approach while executing two different standard algorithms.
Jit故障检测:实时提高102个系统的可用性
随着硅技术尺寸的减小,存储器更容易受到外部影响,这可能导致软错误。虽然是暂时的,但这些错误对安全关键系统构成了挑战。基于冗余的错误检测通常用于工业中,以提高安全性并减轻这些错误。当检测到错误时,对安全至关重要的系统通常会切换到安全状态。虽然这可以防止故障,但它会对系统的可用性产生负面影响。在这项工作中,我们提出了即时故障检测,这是一种新颖的方法,只有在检测到的错误会影响系统行为的情况下,系统才能切换到安全状态。实现了一个软件工具,可以在现成的处理器上部署该方法,并对该方法进行了验证,并与使用混合关键存储器的最先进的替代方法进行了比较。我们的结果表明,在执行两种不同的标准算法时,与最先进的方法相比,可用性增益在25.2%到100%之间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信