Methodologies for language modeling and search in continuous speech recognition

N. Deshmukh, J. Picone
{"title":"Methodologies for language modeling and search in continuous speech recognition","authors":"N. Deshmukh, J. Picone","doi":"10.1109/SECON.1995.513083","DOIUrl":null,"url":null,"abstract":"Automatic speech recognition has made significant strides from the days of recognizing isolated words. State-of-the-art systems are capable of recognizing tens of thousands of words in complex domains such as newspaper correspondence and travel planning. A major part of this success is due to advances in language modeling and search techniques that support efficient, sub-optimal decoding over large search spaces. The benefit from focusing a recognition system on a particular domain has motivated a steady progression from static language models towards more adaptive models that consist of mixtures of bigrams, trigrams and long-distance n-grams. Similarly, the availability of multiple sources of information about the correct word hypothesis has led to the advent of efficient multi-pass search strategies. The result is a powerful pattern-matching paradigm that has applications to a wide range of signal detection problems. Future research in large vocabulary continuous speech recognition will be directed towards developing more efficient means of dynamically integrating such information.","PeriodicalId":334874,"journal":{"name":"Proceedings IEEE Southeastcon '95. Visualize the Future","volume":"188 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings IEEE Southeastcon '95. Visualize the Future","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SECON.1995.513083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Automatic speech recognition has made significant strides from the days of recognizing isolated words. State-of-the-art systems are capable of recognizing tens of thousands of words in complex domains such as newspaper correspondence and travel planning. A major part of this success is due to advances in language modeling and search techniques that support efficient, sub-optimal decoding over large search spaces. The benefit from focusing a recognition system on a particular domain has motivated a steady progression from static language models towards more adaptive models that consist of mixtures of bigrams, trigrams and long-distance n-grams. Similarly, the availability of multiple sources of information about the correct word hypothesis has led to the advent of efficient multi-pass search strategies. The result is a powerful pattern-matching paradigm that has applications to a wide range of signal detection problems. Future research in large vocabulary continuous speech recognition will be directed towards developing more efficient means of dynamically integrating such information.
连续语音识别中的语言建模和搜索方法
自动语音识别已经从识别孤立单词的时代取得了重大进展。最先进的系统能够在报纸通信和旅行计划等复杂领域识别成千上万个单词。这种成功的主要原因是语言建模和搜索技术的进步,这些技术支持在大型搜索空间上进行高效的次优解码。专注于特定领域的识别系统的好处促使了从静态语言模型向由双元、三元和远距离n元混合组成的更具适应性的模型的稳步发展。同样,关于正确单词假设的多种信息来源的可用性导致了高效的多遍搜索策略的出现。其结果是一个强大的模式匹配范例,可应用于广泛的信号检测问题。未来在大词汇量连续语音识别方面的研究将朝着开发更有效的动态整合这些信息的方法方向发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信