An improved recurrent neural network language model with context vector features

2014 IEEE 5th International Conference on Software Engineering and Service Science Pub Date : 2014-06-27 DOI:10.1109/ICSESS.2014.6933694

Jian Zhang, Dan Qu, Zhen Li

引用次数: 4

Abstract

Recurrent neural network language models have solved the problems of data sparseness and dimensionality disaster which exist in traditional N-gram models. RNNLMs have recently demonstrated state-of-the-art performance in speech recognition, machine translation and other tasks. In this paper, we improve the model performance by providing contextual word vectors in association with RNNLMs. This method can reinforce the ability of learning long-distance information using vectors training from Skip-gram model. The experimental results show that the proposed method can improve the perplexity performance significantly on Penn Treebank data. And we further apply the models to speech recognition task on the Wall Street Journal corpora, where we achieve obvious improvements in word-error-rate.

查看原文本刊更多论文

基于上下文向量特征的改进递归神经网络语言模型

递归神经网络语言模型解决了传统n图模型存在的数据稀疏性和维数灾难问题。rnnlm最近在语音识别、机器翻译和其他任务中表现出了最先进的性能。在本文中，我们通过提供与rnnlm相关联的上下文词向量来提高模型性能。该方法利用Skip-gram模型的向量训练增强了远程信息的学习能力。实验结果表明，该方法可以显著提高Penn Treebank数据的perplexity性能。我们进一步将模型应用于华尔街日报语料库的语音识别任务，在错误率方面取得了明显的改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE 5th International Conference on Software Engineering and Service Science

自引率

0.00%

发文量