Exploring deep reuse in winograd CNN inference

Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2021-02-17 DOI:10.1145/3437801.3441588

Ruofan Wu, Feng Zhang, Zhen Zheng, Xiaoyong Du, Xipeng Shen

引用次数: 7

Abstract

Convolutional neural networks (CNNs), as representatives of deep learning, are one of the most commonly used neural networks in applications such as graphic image analysis. However, CNN has heavy computation patterns; network training processes could take several hours even with modern processors. Different from the training process, the inference process is more often executed on devices with low computing power, such as CPUs. Fortunately, a minimal filtering algorithm, Winograd, can reduce the convolution computations by reducing the number of multiplication operations. We find that the Winograd convolution can be further accelerated by reusing the similar data and computation patterns, which is called deep reuse.

查看原文本刊更多论文

winograd CNN推理中的深度重用探索

卷积神经网络(cnn)作为深度学习的代表，是图形图像分析等应用中最常用的神经网络之一。然而，CNN有大量的计算模式;即使使用现代处理器，网络训练过程也可能需要几个小时。与训练过程不同，推理过程更多地在cpu等计算能力较低的设备上执行。幸运的是，最小过滤算法Winograd可以通过减少乘法操作的次数来减少卷积计算。我们发现通过重用相似的数据和计算模式可以进一步加速Winograd卷积，这被称为深度重用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming

自引率

0.00%

发文量