- Book学术

发布求助

文献互助智能选刊最新文献

2018 IEEE International Conference on Networking, Architecture and Storage (NAS) Pub Date : 2018-10-01 DOI:10.1109/NAS.2018.8515692

Arash AziziMazreah, Yongbin Gu, Xiang Gu, Lizhong Chen

{"title":"Tolerating Soft Errors in Deep Learning Accelerators with Reliable On-Chip Memory Designs","authors":"Arash AziziMazreah, Yongbin Gu, Xiang Gu, Lizhong Chen","doi":"10.1109/NAS.2018.8515692","DOIUrl":null,"url":null,"abstract":"Deep learning neural network (DNN) accelerators have been increasingly deployed in many fields recently, including safety-critical applications such as autonomous vehicles and unmanned aircrafts. Meanwhile, the vulnerability of DNN accelerators to soft errors (e.g., caused by high-energy particle strikes) rapidly increases as manufacturing technology continues to scale down. A failure in the operation of DNN accelerators may lead to catastrophic consequences. Among the existing reliability techniques that can be applied to DNN accelerators, fully-hardened SRAM cells are more attractive due to their low overhead in terms of area, power and delay. However, current fully-hardened SRAM cells can only tolerate soft errors produced by single-node-upsets (SNUs), and cannot fully resist the soft errors caused by multiple-node-upsets (MNUs). In this paper, a Zero-Biased MNU-Aware SRAM Cell (ZBMA) is proposed for DNN accelerators based on two observations: first, the data (feature maps, weights) in DNNs has a strong bias towards zero; second, data flipping from zero to one is more likely to cause a failure of DNN outputs. The proposed memory cell provides a robust immunity against node upsets, and reduces the leakage current dramatically when zero is stored in the cell. Evaluation results show that when the proposed memory cell is integrated in a DNN accelerator, the total static power of the accelerator is reduced by 2.6X and 1.79X compared with the one based on the conventional and on state-of-the-art full-hardened memory cells, respectively. In terms of reliability, the DNN accelerator based on the proposed memory cell can reduce 99.99% of false outputs caused by soft errors across different DNNs.","PeriodicalId":115970,"journal":{"name":"2018 IEEE International Conference on Networking, Architecture and Storage (NAS)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Networking, Architecture and Storage (NAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAS.2018.8515692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 37

摘要

深度学习神经网络(DNN)加速器最近在许多领域得到了越来越多的应用，包括自动驾驶汽车和无人驾驶飞机等安全关键应用。同时，随着制造技术的不断缩小，DNN加速器对软误差(例如，由高能粒子撞击引起的)的脆弱性迅速增加。DNN加速器运行失败可能会导致灾难性的后果。在现有的可应用于深度神经网络加速器的可靠性技术中，全硬化SRAM单元由于其在面积、功率和延迟方面的低开销而更具吸引力。然而，目前的全硬化SRAM单元只能容忍由单节点异常(snu)产生的软错误，而不能完全抵抗由多节点异常(mnu)引起的软错误。本文基于两个观察结果，提出了一种零偏mnu感知SRAM单元(zero - biased MNU-Aware Cell, ZBMA)用于DNN加速器:首先，DNN中的数据(特征映射、权重)具有强烈的零偏;其次，数据从0到1翻转更有可能导致DNN输出失败。所提出的存储单元提供了对节点扰动的强大抗扰度，并且在存储单元为零时显著降低了泄漏电流。评估结果表明，将所提出的记忆单元集成到DNN加速器中，与基于传统和最先进的全硬化记忆单元相比，加速器的总静态功率分别降低了2.6倍和1.79倍。在可靠性方面，基于所提存储单元的深度神经网络加速器可以减少99.99%由不同深度神经网络之间的软错误引起的假输出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Tolerating Soft Errors in Deep Learning Accelerators with Reliable On-Chip Memory Designs

Deep learning neural network (DNN) accelerators have been increasingly deployed in many fields recently, including safety-critical applications such as autonomous vehicles and unmanned aircrafts. Meanwhile, the vulnerability of DNN accelerators to soft errors (e.g., caused by high-energy particle strikes) rapidly increases as manufacturing technology continues to scale down. A failure in the operation of DNN accelerators may lead to catastrophic consequences. Among the existing reliability techniques that can be applied to DNN accelerators, fully-hardened SRAM cells are more attractive due to their low overhead in terms of area, power and delay. However, current fully-hardened SRAM cells can only tolerate soft errors produced by single-node-upsets (SNUs), and cannot fully resist the soft errors caused by multiple-node-upsets (MNUs). In this paper, a Zero-Biased MNU-Aware SRAM Cell (ZBMA) is proposed for DNN accelerators based on two observations: first, the data (feature maps, weights) in DNNs has a strong bias towards zero; second, data flipping from zero to one is more likely to cause a failure of DNN outputs. The proposed memory cell provides a robust immunity against node upsets, and reduces the leakage current dramatically when zero is stored in the cell. Evaluation results show that when the proposed memory cell is integrated in a DNN accelerator, the total static power of the accelerator is reduced by 2.6X and 1.79X compared with the one based on the conventional and on state-of-the-art full-hardened memory cells, respectively. In terms of reliability, the DNN accelerator based on the proposed memory cell can reduce 99.99% of false outputs caused by soft errors across different DNNs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE International Conference on Networking, Architecture and Storage (NAS)

自引率

0.00%

发文量