随机显式语音识别系统中的新词添加与自适应

1993 IEEE International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1993-04-27 DOI:10.1109/ICASSP.1993.319894

A. Asadi, H. Leung

{"title":"随机显式语音识别系统中的新词添加与自适应","authors":"A. Asadi, H. Leung","doi":"10.1109/ICASSP.1993.319894","DOIUrl":null,"url":null,"abstract":"The authors extend on automatic procedure for the addition of new words to a speech recognition system to include alternative pronunciations for the new words. They investigate methods for adaptation to new words after these are added to the system. For adaptation, the goal was the improvement of the accuracy of the system on the new words, using only a limited amount of speech data. All the experiments are performed within the stochastic explicit-segment speech recognition system. The authors evaluated 25 isolated city names from a speech corpus, CITRON, collected from real users over the telephone network. For this task, improvement in accuracy is shown from a 34% error rate, when trained on the NTIMIT database alone, to 8% after adapting to 30 tokens, on average, from each new word.<<ETX>>","PeriodicalId":428449,"journal":{"name":"1993 IEEE International Conference on Acoustics, Speech, and Signal Processing","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"New-word addition and adaptation in a stochastic explicit-segment speech recognition system\",\"authors\":\"A. Asadi, H. Leung\",\"doi\":\"10.1109/ICASSP.1993.319894\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The authors extend on automatic procedure for the addition of new words to a speech recognition system to include alternative pronunciations for the new words. They investigate methods for adaptation to new words after these are added to the system. For adaptation, the goal was the improvement of the accuracy of the system on the new words, using only a limited amount of speech data. All the experiments are performed within the stochastic explicit-segment speech recognition system. The authors evaluated 25 isolated city names from a speech corpus, CITRON, collected from real users over the telephone network. For this task, improvement in accuracy is shown from a 34% error rate, when trained on the NTIMIT database alone, to 8% after adapting to 30 tokens, on average, from each new word.<<ETX>>\",\"PeriodicalId\":428449,\"journal\":{\"name\":\"1993 IEEE International Conference on Acoustics, Speech, and Signal Processing\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1993-04-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"1993 IEEE International Conference on Acoustics, Speech, and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1993.319894\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"1993 IEEE International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1993.319894","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

作者扩展了语音识别系统中添加新词的自动程序，以包括新词的替代发音。他们研究新词加入系统后的适应方法。对于适应，目标是提高系统对新词的准确性，只使用有限数量的语音数据。所有实验都是在随机显式语音识别系统中进行的。作者评估了语音语料库CITRON中25个孤立的城市名称，这些语料库是通过电话网络从真实用户那里收集的。对于这个任务，准确率从仅在NTIMIT数据库上训练时的34%错误率提高到平均每个新词适应30个标记后的8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

New-word addition and adaptation in a stochastic explicit-segment speech recognition system

The authors extend on automatic procedure for the addition of new words to a speech recognition system to include alternative pronunciations for the new words. They investigate methods for adaptation to new words after these are added to the system. For adaptation, the goal was the improvement of the accuracy of the system on the new words, using only a limited amount of speech data. All the experiments are performed within the stochastic explicit-segment speech recognition system. The authors evaluated 25 isolated city names from a speech corpus, CITRON, collected from real users over the telephone network. For this task, improvement in accuracy is shown from a 34% error rate, when trained on the NTIMIT database alone, to 8% after adapting to 30 tokens, on average, from each new word.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

1993 IEEE International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量