基于隐马尔可夫模型的分段神经网络混合连续语音识别系统

Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop Pub Date : 1991-09-30 DOI:10.1109/NNSP.1991.239507

G. Zavaliagkos, S. Austin, J. Makhoul, R. Schwartz

{"title":"基于隐马尔可夫模型的分段神经网络混合连续语音识别系统","authors":"G. Zavaliagkos, S. Austin, J. Makhoul, R. Schwartz","doi":"10.1109/NNSP.1991.239507","DOIUrl":null,"url":null,"abstract":"The authors present the concept of a 'segmental neural net' (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how than can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, they have developed a system that combines the SNN with a hidden Markov model (HMM) system. They believe that this is the first system incorporating a neural network for which the performance has exceeded the state of the art in large-vocabulary, continuous speech recognition. To take advantage of the training and decoding speed of HMMs, the authors have developed a novel hybrid SNN/HMM system that combines the advantages of both types of approaches. In this hybrid system, use is made of the N-best paradigm to generate likely phonetic segmentations, which are then scored by the SNN. The HMM and SNN scores are then combined to optimize performance.<<ETX>>","PeriodicalId":354832,"journal":{"name":"Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"A hybrid continuous speech recognition system using segmental neural nets with hidden Markov models\",\"authors\":\"G. Zavaliagkos, S. Austin, J. Makhoul, R. Schwartz\",\"doi\":\"10.1109/NNSP.1991.239507\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The authors present the concept of a 'segmental neural net' (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how than can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, they have developed a system that combines the SNN with a hidden Markov model (HMM) system. They believe that this is the first system incorporating a neural network for which the performance has exceeded the state of the art in large-vocabulary, continuous speech recognition. To take advantage of the training and decoding speed of HMMs, the authors have developed a novel hybrid SNN/HMM system that combines the advantages of both types of approaches. In this hybrid system, use is made of the N-best paradigm to generate likely phonetic segmentations, which are then scored by the SNN. The HMM and SNN scores are then combined to optimize performance.<<ETX>>\",\"PeriodicalId\":354832,\"journal\":{\"name\":\"Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1991-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NNSP.1991.239507\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NNSP.1991.239507","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

摘要

作者提出了用于连续语音识别(CSR)语音建模的“分段神经网络”(SNN)的概念，并演示了如何将其与多假设(或N-Best)范例一起使用，以组合不同的CSR系统。特别是，他们开发了一种将SNN与隐马尔可夫模型(HMM)系统相结合的系统。他们认为，这是第一个结合了神经网络的系统，其性能在大词汇量、连续语音识别方面超过了目前的水平。为了利用HMM的训练和解码速度，作者开发了一种新的混合SNN/HMM系统，该系统结合了两种方法的优点。在这个混合系统中，使用N-best范式来生成可能的语音切分，然后由SNN进行评分。然后结合HMM和SNN分数来优化性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A hybrid continuous speech recognition system using segmental neural nets with hidden Markov models

The authors present the concept of a 'segmental neural net' (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how than can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, they have developed a system that combines the SNN with a hidden Markov model (HMM) system. They believe that this is the first system incorporating a neural network for which the performance has exceeded the state of the art in large-vocabulary, continuous speech recognition. To take advantage of the training and decoding speed of HMMs, the authors have developed a novel hybrid SNN/HMM system that combines the advantages of both types of approaches. In this hybrid system, use is made of the N-best paradigm to generate likely phonetic segmentations, which are then scored by the SNN. The HMM and SNN scores are then combined to optimize performance.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop

自引率

0.00%

发文量