Self-stabilized deep neural network

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2016-12-20 DOI:10.1109/ICASSP.2016.7472719

Pegah Ghahremani, J. Droppo

引用次数: 7

Abstract

Deep neural network models have been successfully applied to many tasks such as image labeling and speech recognition. Mini-batch stochastic gradient descent is the most prevalent method for training these models. A critical part of successfully applying this method is choosing appropriate initial values, as well as local and global learning rate scheduling algorithms. In this paper, we present a method which is less sensitive to choice of initial values, works better than popular learning rate adjustment algorithms, and speeds convergence on model parameters. We show that using the Self-stabilized DNN method, we no longer require initial learning rate tuning and training converges quickly with a fixed global learning rate. The proposed method provides promising results over conventional DNN structure with better convergence rate.

查看原文本刊更多论文

自稳定深度神经网络

深度神经网络模型已经成功地应用于许多任务，如图像标记和语音识别。小批量随机梯度下降法是训练这些模型最常用的方法。成功应用该方法的关键是选择合适的初始值，以及局部和全局学习率调度算法。在本文中，我们提出了一种对初始值的选择不太敏感的方法，比常用的学习率调整算法效果更好，并且加快了模型参数的收敛速度。我们表明，使用自稳定DNN方法，我们不再需要初始学习率调整，并且训练以固定的全局学习率快速收敛。与传统的深度神经网络结构相比，该方法具有更好的收敛速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

自引率

0.00%

发文量