Methodologies for language modeling and search in continuous speech recognition

Proceedings IEEE Southeastcon '95. Visualize the Future Pub Date : 1995-03-26 DOI:10.1109/SECON.1995.513083

N. Deshmukh, J. Picone

引用次数: 4

Abstract

Automatic speech recognition has made significant strides from the days of recognizing isolated words. State-of-the-art systems are capable of recognizing tens of thousands of words in complex domains such as newspaper correspondence and travel planning. A major part of this success is due to advances in language modeling and search techniques that support efficient, sub-optimal decoding over large search spaces. The benefit from focusing a recognition system on a particular domain has motivated a steady progression from static language models towards more adaptive models that consist of mixtures of bigrams, trigrams and long-distance n-grams. Similarly, the availability of multiple sources of information about the correct word hypothesis has led to the advent of efficient multi-pass search strategies. The result is a powerful pattern-matching paradigm that has applications to a wide range of signal detection problems. Future research in large vocabulary continuous speech recognition will be directed towards developing more efficient means of dynamically integrating such information.

查看原文本刊更多论文

连续语音识别中的语言建模和搜索方法

自动语音识别已经从识别孤立单词的时代取得了重大进展。最先进的系统能够在报纸通信和旅行计划等复杂领域识别成千上万个单词。这种成功的主要原因是语言建模和搜索技术的进步，这些技术支持在大型搜索空间上进行高效的次优解码。专注于特定领域的识别系统的好处促使了从静态语言模型向由双元、三元和远距离n元混合组成的更具适应性的模型的稳步发展。同样，关于正确单词假设的多种信息来源的可用性导致了高效的多遍搜索策略的出现。其结果是一个强大的模式匹配范例，可应用于广泛的信号检测问题。未来在大词汇量连续语音识别方面的研究将朝着开发更有效的动态整合这些信息的方法方向发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings IEEE Southeastcon '95. Visualize the Future

自引率

0.00%

发文量