中文语音识别中一种有效的非关键字拒绝方案

Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030) Pub Date : 1999-09-15 DOI:10.1109/TENCON.1999.818356

Wei-Chih Hsu, Shih-Chang Hsia

{"title":"中文语音识别中一种有效的非关键字拒绝方案","authors":"Wei-Chih Hsu, Shih-Chang Hsia","doi":"10.1109/TENCON.1999.818356","DOIUrl":null,"url":null,"abstract":"During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.","PeriodicalId":121142,"journal":{"name":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An efficient nonkeyword rejection scheme for Mandarin speech recognition\",\"authors\":\"Wei-Chih Hsu, Shih-Chang Hsia\",\"doi\":\"10.1109/TENCON.1999.818356\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.\",\"PeriodicalId\":121142,\"journal\":{\"name\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON.1999.818356\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.1999.818356","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在20世纪90年代，许多研究者尝试将自动语音识别(ASR)系统应用于现实世界，并且已经有一些系统在运行。但是，在语音自动输入方面还存在一些问题需要解决。一个典型的问题是拒绝不包含任何有效关键字的话语或验证嵌入在输入话语中的关键字。通常，拒绝或接受一个话语作为关键字的决定是将非标准化分数与阈值进行比较。近年来，人们提出了一些基于归一化分数的方案来解决这一问题。此外，研究表明，基于归一化分数的方法比基于非归一化分数的方法提供了更好的性能。然而，这种方法的关键是确定从识别系统及其对应的反模型的输出中间接获得的两个似然之比。在本文中，我们将提出一个有效的方法来处理这个问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An efficient nonkeyword rejection scheme for Mandarin speech recognition

During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)

自引率

0.00%

发文量