Special Session: When Dataflows Converge: Reconfigurable and Approximate Computing for Emerging Neural Networks

2021 IEEE 39th International Conference on Computer Design (ICCD) Pub Date : 2021-10-01 DOI:10.1109/ICCD53106.2021.00014

Di Wu, Joshua San Miguel

引用次数: 4

Abstract

Deep Neural Networks (DNNs) have gained significant attention in both academia and industry due to the superior application-level accuracy. As DNNs rely on compute- or memory-intensive general matrix multiply (GEMM) operations, approximate computing has been widely explored across the computing stack to mitigate the hardware overheads. However, better-performing DNNs are emerging with growing complexity in their use of nonlinear operations, which incurs even more hardware cost. In this work, we address this challenge by proposing a reconfigurable systolic array to execute both GEMM and nonlinear operations via approximation with distinguished dataflows. Experiments demonstrate that such converging of dataflows significantly saves the hardware cost of emerging DNN inference.

查看原文本刊更多论文

专题会议:当数据流收敛:新兴神经网络的可重构和近似计算

深度神经网络(Deep Neural Networks, dnn)由于其优越的应用级精度，在学术界和工业界都受到了极大的关注。由于深度神经网络依赖于计算或内存密集型的一般矩阵乘法(GEMM)操作，因此在计算堆栈上广泛探索近似计算以减少硬件开销。然而，性能更好的深度神经网络正在出现，其使用的非线性运算越来越复杂，这导致了更多的硬件成本。在这项工作中，我们提出了一个可重构的收缩阵列来执行GEMM和非线性操作，通过近似不同的数据流来解决这一挑战。实验表明，这种数据流的收敛大大节省了新兴深度神经网络推理的硬件成本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 39th International Conference on Computer Design (ICCD)

自引率

0.00%

发文量