Efficient deep convolutional model compression with an active stepwise pruning approach

Int. J. Comput. Sci. Eng. Pub Date : 2020-08-26 DOI:10.1504/ijcse.2020.10031600

Sheng-sheng Wang, Chunshang Xing, Dong Liu

引用次数: 3

Abstract

Deep models are structurally tremendous and complex, thus making it hard to deploy on the embedded hardware with restricted memory and computing power. Although, the existing compression methods have pruned the deep models effectively, some issues exist in those methods, such as multiple iterations needed in fine-tuning phase, difficulty in pruning granularity control and numerous hyperparameters needed to set. In this paper, we propose an active stepwise pruning method of a logarithmic function which only needs to set three hyperparameters and a few epochs. We also propose a recovery strategy to repair the incorrect pruning thus ensuring the prediction accuracy of model. Pruning and repairing alternately constitute cyclic process along with updating the weights in layers. Our method can prune the parameters of MobileNet, AlexNet, VGG-16 and ZFNet by a factor of 5.6×, 11.7×, 16.6× and 15× respectively without any accuracy loss, which precedes the existing methods.

查看原文本刊更多论文

有效的深度卷积模型压缩与主动逐步修剪方法

深度模型结构庞大且复杂，难以在内存和计算能力有限的嵌入式硬件上部署。现有的压缩方法虽然对深度模型进行了有效的剪枝，但存在微调阶段需要多次迭代、剪枝粒度控制困难以及需要设置大量超参数等问题。本文提出了一种对数函数的主动逐步剪枝方法，该方法只需要设置三个超参数和几个epoch。我们还提出了一种修复错误剪枝的恢复策略，从而保证了模型的预测精度。随着层间权值的更新，剪枝和修复交替构成循环过程。我们的方法可以对MobileNet、AlexNet、VGG-16和ZFNet的参数分别进行5.6倍、11.7倍、16.6倍和15倍的裁剪，而没有任何精度损失，优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. Comput. Sci. Eng.

自引率

0.00%

发文量