{"title":"中文语音识别中一种有效的非关键字拒绝方案","authors":"Wei-Chih Hsu, Shih-Chang Hsia","doi":"10.1109/TENCON.1999.818356","DOIUrl":null,"url":null,"abstract":"During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.","PeriodicalId":121142,"journal":{"name":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An efficient nonkeyword rejection scheme for Mandarin speech recognition\",\"authors\":\"Wei-Chih Hsu, Shih-Chang Hsia\",\"doi\":\"10.1109/TENCON.1999.818356\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.\",\"PeriodicalId\":121142,\"journal\":{\"name\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON.1999.818356\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.1999.818356","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An efficient nonkeyword rejection scheme for Mandarin speech recognition
During the decade of 1990's, many researchers have tried to make automatic speech recognition (ASR) systems available in the real world and there are some systems operating already. However there are still some problems which need to be solved, especially for spontaneous speech input. One of the typical problems is to reject utterance that does not include any valid keyword or to verify the keyword embedded in the input utterance. Conventionally, the decision to reject or accept an utterance as keyword is to compare an unnormalized score with a threshold. Recently, some schemes based on normalized scores are proposed to cope with this problem. Furthermore, if has been shown that the methods based on normalized score many provide better performance than unnormalized ones. However the key point of this kind of method is to determine the ratio of two likelihoods which are obtained indirectly from the output of a recognition system and its corresponding antimodel, respectively. In this paper, we will propose an efficient approach to deal with this problem.