中文语音识别中一种有效的非关键字拒绝方案

Wei-Chih Hsu, Shih-Chang Hsia
{"title":"中文语音识别中一种有效的非关键字拒绝方案","authors":"Wei-Chih Hsu, Shih-Chang Hsia","doi":"10.1109/TENCON.1999.818356","DOIUrl":null,"url":null,"abstract":"During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.","PeriodicalId":121142,"journal":{"name":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An efficient nonkeyword rejection scheme for Mandarin speech recognition\",\"authors\":\"Wei-Chih Hsu, Shih-Chang Hsia\",\"doi\":\"10.1109/TENCON.1999.818356\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.\",\"PeriodicalId\":121142,\"journal\":{\"name\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON.1999.818356\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.1999.818356","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在20世纪90年代,许多研究者尝试将自动语音识别(ASR)系统应用于现实世界,并且已经有一些系统在运行。但是,在语音自动输入方面还存在一些问题需要解决。一个典型的问题是拒绝不包含任何有效关键字的话语或验证嵌入在输入话语中的关键字。通常,拒绝或接受一个话语作为关键字的决定是将非标准化分数与阈值进行比较。近年来,人们提出了一些基于归一化分数的方案来解决这一问题。此外,研究表明,基于归一化分数的方法比基于非归一化分数的方法提供了更好的性能。然而,这种方法的关键是确定从识别系统及其对应的反模型的输出中间接获得的两个似然之比。在本文中,我们将提出一个有效的方法来处理这个问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An efficient nonkeyword rejection scheme for Mandarin speech recognition
During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信