Efficient Functional In-Field Self-Test for Deep Learning Accelerators

2021 IEEE International Test Conference (ITC) Pub Date : 2021-10-01 DOI:10.1109/ITC50571.2021.00017

Yi He, T. Uezono, Yanjing Li

{"title":"Efficient Functional In-Field Self-Test for Deep Learning Accelerators","authors":"Yi He, T. Uezono, Yanjing Li","doi":"10.1109/ITC50571.2021.00017","DOIUrl":null,"url":null,"abstract":"We present a technique that generates high-quality functional in-field self-tests specifically targeting deep learning (DL) accelerators. These functional tests can be applied in the field during normal operation of a DL accelerator, which is crucial to ensure that the safety and/or reliability requirements are met for any given application, including safety-critical applications such as self-driving cars, robotics, and more.Our technique takes advantage of special architectural characteristics and application properties to achieve high functional test coverage while incurring minimal system-level costs. Moreover, we devise different strategies for the compute units (which support computation operations) and the control units (which control data movement) because these two types of units exhibit different properties. For the compute units of a DL accelerator, we first use combinational ATPG to generate test patterns with high test coverage, which is possible because these units do not contain complex sequential logic. Next, we map the ATPG patterns to one or more equivalent deep neural networks (DNNs) that can be directly executed on the accelerator, which is possible given the well-defined dataflow/reuse algorithm of a DL accelerator. For the control units, we leverage the property that typically only one or a few fixed DNNs are deployed at a time in many application domains (e.g., self-driving cars). Thus, it is sufficient to target only the faults that can directly affect the correctness of the DNNs that are currently deployed. This is done by executing different layers of each target DNN using carefully-crafted input and weight values to maximize test coverage while minimizing test time.We apply our technique using Nvidia’s open-source accelerator as a case study to demonstrate its efficacy. Our results show that our technique achieves high test coverage. For the compute units, 99.9% single stuck-at functional test coverage is achieved. For the control units, we are able to prove that, given any target DNN, 100% coverage can be achieved for a large class of single and multiple fault models. The in-field functional self-test time is also very low, < 17 ms for various representative DNNs. These functional tests can be applied during boot-up, reset, and even concurrently with normal operation by executing DNN test programs directly on the accelerator, without requiring any test support in the hardware.","PeriodicalId":147006,"journal":{"name":"2021 IEEE International Test Conference (ITC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Test Conference (ITC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITC50571.2021.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

We present a technique that generates high-quality functional in-field self-tests specifically targeting deep learning (DL) accelerators. These functional tests can be applied in the field during normal operation of a DL accelerator, which is crucial to ensure that the safety and/or reliability requirements are met for any given application, including safety-critical applications such as self-driving cars, robotics, and more.Our technique takes advantage of special architectural characteristics and application properties to achieve high functional test coverage while incurring minimal system-level costs. Moreover, we devise different strategies for the compute units (which support computation operations) and the control units (which control data movement) because these two types of units exhibit different properties. For the compute units of a DL accelerator, we first use combinational ATPG to generate test patterns with high test coverage, which is possible because these units do not contain complex sequential logic. Next, we map the ATPG patterns to one or more equivalent deep neural networks (DNNs) that can be directly executed on the accelerator, which is possible given the well-defined dataflow/reuse algorithm of a DL accelerator. For the control units, we leverage the property that typically only one or a few fixed DNNs are deployed at a time in many application domains (e.g., self-driving cars). Thus, it is sufficient to target only the faults that can directly affect the correctness of the DNNs that are currently deployed. This is done by executing different layers of each target DNN using carefully-crafted input and weight values to maximize test coverage while minimizing test time.We apply our technique using Nvidia’s open-source accelerator as a case study to demonstrate its efficacy. Our results show that our technique achieves high test coverage. For the compute units, 99.9% single stuck-at functional test coverage is achieved. For the control units, we are able to prove that, given any target DNN, 100% coverage can be achieved for a large class of single and multiple fault models. The in-field functional self-test time is also very low, < 17 ms for various representative DNNs. These functional tests can be applied during boot-up, reset, and even concurrently with normal operation by executing DNN test programs directly on the accelerator, without requiring any test support in the hardware.

查看原文本刊更多论文

我们提出了一种专门针对深度学习(DL)加速器生成高质量功能现场自我测试的技术。这些功能测试可以在DL加速器正常运行期间在现场进行，这对于确保满足任何给定应用(包括自动驾驶汽车、机器人等安全关键应用)的安全性和/或可靠性要求至关重要。我们的技术利用了特殊的体系结构特征和应用程序属性来实现高功能测试覆盖率，同时产生最小的系统级成本。此外，我们为计算单元(支持计算操作)和控制单元(控制数据移动)设计了不同的策略，因为这两种类型的单元表现出不同的属性。对于DL加速器的计算单元，我们首先使用组合ATPG来生成具有高测试覆盖率的测试模式，这是可能的，因为这些单元不包含复杂的顺序逻辑。接下来，我们将ATPG模式映射到一个或多个等效的深度神经网络(dnn)，这些深度神经网络可以直接在加速器上执行，这是可能的，因为DL加速器具有定义良好的数据流/重用算法。对于控制单元，我们利用通常在许多应用领域(例如，自动驾驶汽车)中一次只部署一个或几个固定dnn的属性。因此，仅针对可能直接影响当前部署的dnn的正确性的错误就足够了。这是通过使用精心设计的输入和权重值执行每个目标DNN的不同层来实现的，以最大化测试覆盖率，同时最小化测试时间。我们使用Nvidia的开源加速器作为案例研究来证明我们的技术的有效性。我们的结果表明，我们的技术达到了很高的测试覆盖率。对于计算单元，99.9%的单卡功能测试覆盖率被实现。对于控制单元，我们能够证明，给定任何目标DNN，对于大类别的单故障和多故障模型可以实现100%的覆盖率。现场功能自检时间也很低，各代表性dnn均< 17 ms。通过直接在加速器上执行DNN测试程序，这些功能测试可以在启动、复位甚至与正常操作同时进行，而不需要硬件中的任何测试支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Test Conference (ITC)

自引率

0.00%

发文量