Boosting Microprocessor Efficiency: Circuit- and Workload-Aware Assessment of Timing Errors

Ioannis Tsiokanos, G. Papadimitriou, D. Gizopoulos, G. Karakonstantis
{"title":"Boosting Microprocessor Efficiency: Circuit- and Workload-Aware Assessment of Timing Errors","authors":"Ioannis Tsiokanos, G. Papadimitriou, D. Gizopoulos, G. Karakonstantis","doi":"10.1109/IISWC53511.2021.00022","DOIUrl":null,"url":null,"abstract":"Aggressive technology scaling and increased static and dynamic variability caused by process, temperature, voltage, and aging effects make nanometer circuits prone to timing errors which threaten system functionality. Accurately evaluating the impact of those circuit-level errors on the resilience of a CPU and the executed applications remains a first-class design issue. However, existing error assessment frameworks fail to accurately model the effects of timing errors because they neglect microarchitecture- and workload-dependent parameters that critically affect the error manifestation and propagation. This paper provides a novel, cross-layer framework that addresses the lack of a holistic methodology for the understanding of the full system impact of hardware timing errors as they propagate from the circuit-level through the microarchitecture up to the application software. The proposed microarchitecture-aware tool is able to realistically inject timing errors considering circuit and workload features, accurately assessing timing error effects on any application binary. We estimate the location (bit position and instruction) and the time (cycle) of the injected errors via a workload-aware error model which relies on post place-and-route dynamic timing analysis. We also leverage microarchitectural error injection to access the timing error reliability of a widely deployed pipelined processor under several workloads and voltage reduction levels. To evaluate the proposed tool, our fully automated toolflow is also configured to support timing error injection based on existing workload-agnostic error models. Evaluation results for various workloads and voltage reduction levels, show that our circuit- and workload-aware error injection model improves the accuracy of the error injection ratio by ~ 250× on average compared to workload-agnostic models. Finally, we quantify the degree to which various applications are prone to timing errors using an application vulnerability metric that can be used early in the design cycle to guide the adoption of energy-efficient error mitigation strategies.","PeriodicalId":203713,"journal":{"name":"2021 IEEE International Symposium on Workload Characterization (IISWC)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Symposium on Workload Characterization (IISWC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISWC53511.2021.00022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Aggressive technology scaling and increased static and dynamic variability caused by process, temperature, voltage, and aging effects make nanometer circuits prone to timing errors which threaten system functionality. Accurately evaluating the impact of those circuit-level errors on the resilience of a CPU and the executed applications remains a first-class design issue. However, existing error assessment frameworks fail to accurately model the effects of timing errors because they neglect microarchitecture- and workload-dependent parameters that critically affect the error manifestation and propagation. This paper provides a novel, cross-layer framework that addresses the lack of a holistic methodology for the understanding of the full system impact of hardware timing errors as they propagate from the circuit-level through the microarchitecture up to the application software. The proposed microarchitecture-aware tool is able to realistically inject timing errors considering circuit and workload features, accurately assessing timing error effects on any application binary. We estimate the location (bit position and instruction) and the time (cycle) of the injected errors via a workload-aware error model which relies on post place-and-route dynamic timing analysis. We also leverage microarchitectural error injection to access the timing error reliability of a widely deployed pipelined processor under several workloads and voltage reduction levels. To evaluate the proposed tool, our fully automated toolflow is also configured to support timing error injection based on existing workload-agnostic error models. Evaluation results for various workloads and voltage reduction levels, show that our circuit- and workload-aware error injection model improves the accuracy of the error injection ratio by ~ 250× on average compared to workload-agnostic models. Finally, we quantify the degree to which various applications are prone to timing errors using an application vulnerability metric that can be used early in the design cycle to guide the adoption of energy-efficient error mitigation strategies.
提高微处理器效率:电路和工作负载感知的时序误差评估
由于工艺、温度、电压和老化效应导致的侵略性技术缩放和增加的静态和动态可变性,使得纳米电路容易出现威胁系统功能的定时误差。准确评估这些电路级错误对CPU弹性和执行的应用程序的影响仍然是头等设计问题。然而,现有的错误评估框架无法准确地模拟计时错误的影响,因为它们忽略了与微架构和工作负载相关的参数,这些参数严重影响错误的表现和传播。本文提供了一个新颖的跨层框架,解决了缺乏整体方法来理解硬件时序错误从电路级通过微体系结构传播到应用软件时的完整系统影响的问题。所提出的微体系结构感知工具能够考虑电路和工作负载特征,真实地注入时序误差,准确地评估时序误差对任何应用程序二进制文件的影响。我们通过工作负载感知误差模型来估计注入误差的位置(位和指令)和时间(周期),该模型依赖于位置和路径动态时序分析。我们还利用微架构错误注入来访问广泛部署的流水线处理器在多个工作负载和电压降低水平下的定时错误可靠性。为了评估建议的工具,我们的全自动工具流也被配置为支持基于现有工作负载无关错误模型的定时错误注入。对各种工作负载和电压降低水平的评估结果表明,与工作负载无关的模型相比,我们的电路和工作负载感知错误注入模型的错误注入比精度平均提高了约250倍。最后,我们使用应用程序漏洞度量来量化各种应用程序容易出现时间错误的程度,该度量可以在设计周期的早期使用,以指导采用节能的错误缓解策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信