Low-Rank Compression of Neural Nets: Learning the Rank of Each Layer

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2020-06-01 DOI:10.1109/cvpr42600.2020.00807

Yerlan Idelbayev, M. A. Carreira-Perpiñán

引用次数: 91

Abstract

Neural net compression can be achieved by approximating each layer's weight matrix by a low-rank matrix. The real difficulty in doing this is not in training the resulting neural net (made up of one low-rank matrix per layer), but in determining what the optimal rank of each layer is—effectively, an architecture search problem with one hyperparameter per layer. We show that, with a suitable formulation, this problem is amenable to a mixed discrete-continuous optimization jointly over the ranks and over the matrix elements, and give a corresponding algorithm. We show that this indeed can select ranks much better than existing approaches, making low-rank compression much more attractive than previously thought. For example, we can make a VGG network faster than a ResNet and with nearly the same classification error.

查看原文本刊更多论文

神经网络的低秩压缩:学习每层的秩

神经网络压缩可以通过用低秩矩阵逼近每层的权值矩阵来实现。这样做的真正困难不在于训练得到的神经网络(每层由一个低秩矩阵组成)，而在于确定每层的最优秩是什么——实际上，这是一个每层有一个超参数的架构搜索问题。我们用合适的公式证明了该问题可适用于对秩和对矩阵元素的混合离散-连续优化，并给出了相应的算法。我们表明，这确实可以比现有的方法更好地选择秩，使低秩压缩比以前想象的更有吸引力。例如，我们可以使VGG网络比ResNet更快，并且具有几乎相同的分类误差。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量