A hybrid continuous speech recognition system using segmental neural nets with hidden Markov models

Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop Pub Date : 1991-09-30 DOI:10.1109/NNSP.1991.239507

G. Zavaliagkos, S. Austin, J. Makhoul, R. Schwartz

引用次数: 21

Abstract

The authors present the concept of a 'segmental neural net' (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how than can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, they have developed a system that combines the SNN with a hidden Markov model (HMM) system. They believe that this is the first system incorporating a neural network for which the performance has exceeded the state of the art in large-vocabulary, continuous speech recognition. To take advantage of the training and decoding speed of HMMs, the authors have developed a novel hybrid SNN/HMM system that combines the advantages of both types of approaches. In this hybrid system, use is made of the N-best paradigm to generate likely phonetic segmentations, which are then scored by the SNN. The HMM and SNN scores are then combined to optimize performance.<>

查看原文本刊更多论文

基于隐马尔可夫模型的分段神经网络混合连续语音识别系统

作者提出了用于连续语音识别(CSR)语音建模的“分段神经网络”(SNN)的概念，并演示了如何将其与多假设(或N-Best)范例一起使用，以组合不同的CSR系统。特别是，他们开发了一种将SNN与隐马尔可夫模型(HMM)系统相结合的系统。他们认为，这是第一个结合了神经网络的系统，其性能在大词汇量、连续语音识别方面超过了目前的水平。为了利用HMM的训练和解码速度，作者开发了一种新的混合SNN/HMM系统，该系统结合了两种方法的优点。在这个混合系统中，使用N-best范式来生成可能的语音切分，然后由SNN进行评分。然后结合HMM和SNN分数来优化性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop

自引率

0.00%

发文量