Accelerating Matrix Multiplication in Deep Learning by Using Low-Rank Approximation

2017 International Conference on High Performance Computing & Simulation (HPCS) Pub Date : 2017-07-17 DOI:10.1109/HPCS.2017.37

Kazuki Osawa, Akira Sekiya, Hiroki Naganuma, Rio Yokota

引用次数: 12

Abstract

The open source frameworks of deep learning including TensorFlow, Caffe, Torch, etc. are widely used all over the world and its acceleration have great meaning. In these frameworks, a lot of computation time is spent on convolution, and highly tuned libraries such as cuDNN play important role on accelerating convolution. In these libraries, however, a convolution computation is performed without approximating a dense matrices. In this research, we propose a method to introduce the low-rank approximation method, widely used in the field of scientific and technical computation, into the convolution computation. As a result of investigating the influence on the recognition accuracy of the existing model, it is possible to reduce up to about 90% of rank of data matrices while keeping recognition accuracy −2% of baseline.

查看原文本刊更多论文

利用低秩逼近加速深度学习中的矩阵乘法

包括TensorFlow、Caffe、Torch等在内的深度学习开源框架在世界范围内被广泛使用，其加速具有重要意义。在这些框架中，大量的计算时间花在卷积上，而高度调优的库(如cuDNN)在加速卷积方面发挥了重要作用。然而，在这些库中，不需要逼近密集矩阵就可以执行卷积计算。在本研究中，我们提出了一种将在科技计算领域广泛应用的低秩近似方法引入卷积计算的方法。通过研究现有模型对识别精度的影响，可以将数据矩阵的秩降低约90%，同时保持识别精度为基线的- 2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on High Performance Computing & Simulation (HPCS)

自引率

0.00%

发文量