Automatic model redundancy reduction for fast back-propagation for deep neural networks in speech recognition

2015 International Joint Conference on Neural Networks (IJCNN) Pub Date : 2015-07-12 DOI:10.1109/IJCNN.2015.7280335

Y. Qian, Tianxing He, Wei Deng, Kai Yu

引用次数: 6

Abstract

Although deep neural networks (DNNs) have achieved great performance gain, the immense computational cost of DNN model training has become a major block to utilize massive speech data for DNN training. Previous research on DNN training acceleration mostly focussed on hardware-based parallelization. In this paper, node pruning and arc restructuring are proposed to explore model redundancy after a novel lightly discriminative pretraining process. With some measures of node/arc importance, model redundancies are automatically removed to form a much more compact DNN. This significantly accelerates the subsequent back-propagation (BP) training process. Model redundancy reduction can be combined with multiple GPU parallelization to achieve further acceleration. Experiments showed that the combined acceleration framework can achieve about 85% model size reduction and over 4.2 times speed-up factor for BP training on 2 GPUs, at no loss of recognition accuracy.

查看原文本刊更多论文

语音识别中深度神经网络快速反向传播的自动模型冗余削减

尽管深度神经网络(deep neural networks, DNN)已经取得了很大的性能提升，但DNN模型训练的巨大计算成本已经成为阻碍利用海量语音数据进行DNN训练的主要障碍。以往关于深度神经网络训练加速的研究主要集中在基于硬件的并行化上。本文提出了一种新的轻判别预训练过程，通过节点修剪和圆弧重构来探索模型冗余。通过一些节点/弧重要性的度量，模型冗余被自动去除，形成一个更紧凑的DNN。这大大加快了随后的反向传播(BP)训练过程。模型冗余减少可以与多个GPU并行化相结合，以实现进一步的加速。实验表明，在不影响识别精度的情况下，该组合加速框架在2个gpu上进行BP训练，可以实现约85%的模型尺寸缩减和超过4.2倍的加速系数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 International Joint Conference on Neural Networks (IJCNN)

自引率

0.00%

发文量