Work-in-Progress: Ultra-fast yet Accurate Performance Prediction for Deep Neural Network Accelerators

2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES) Pub Date : 2022-10-01 DOI:10.1109/CASES55004.2022.00020

Konstantin Lübeck, Alexander Louis-Ferdinand Jung, Felix Wedlich, O. Bringmann

引用次数: 0

Abstract

We present an automatic methodology to accurately predict the performance of Deep Neural Network (DNN) accelerators using abstract descriptions of accelerator architectures and DNNs with a high degree of flexibility. By mapping partially unrolled neural network layers onto accelerator architectures, we automatically construct an analytical performance model, exploiting the dataflow-driven nature of DNNs that allows us to evaluate only a few loop iterations to determine the performance of a whole DNN layer.

查看原文本刊更多论文

正在进行的工作:深度神经网络加速器的超快速而准确的性能预测

我们提出了一种自动方法来准确预测深度神经网络(DNN)加速器的性能，该方法使用了具有高度灵活性的加速器架构和DNN的抽象描述。通过将部分展开的神经网络层映射到加速器架构上，我们自动构建了一个分析性能模型，利用深度神经网络的数据流驱动特性，允许我们仅评估几个循环迭代来确定整个深度神经网络层的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES)

自引率

0.00%

发文量