一种鲁棒的语音自动识别特征归一化算法

2009 International Joint Conference on Artificial Intelligence Pub Date : 2009-04-25 DOI:10.1109/JCAI.2009.208

Jianjun Lei, Zhendi Yang, Jian Wang

{"title":"一种鲁棒的语音自动识别特征归一化算法","authors":"Jianjun Lei, Zhendi Yang, Jian Wang","doi":"10.1109/JCAI.2009.208","DOIUrl":null,"url":null,"abstract":"In this paper, we present an effective feature normalization algorithm to improve the robustness of automatic speech recognition systems. At front-end, minimum mean square error log-spectral amplitude estimation speech enhancement is adopted to suppress noise from noisy speech. Then, at back-end, the histogram equalization feature normalization is used to deal with the residual mismatch between enhanced speech and clean speech. We have evaluated recognition performance under noisy environments using NOISEX-92 database and recorded speech signals in continuous speech recognition task. Experimental results show that our approach exhibits considerable improvements in the degraded environment.","PeriodicalId":154425,"journal":{"name":"2009 International Joint Conference on Artificial Intelligence","volume":"40 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Robust Feature Normalization Algorithm for Automatic Speech Recognition\",\"authors\":\"Jianjun Lei, Zhendi Yang, Jian Wang\",\"doi\":\"10.1109/JCAI.2009.208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present an effective feature normalization algorithm to improve the robustness of automatic speech recognition systems. At front-end, minimum mean square error log-spectral amplitude estimation speech enhancement is adopted to suppress noise from noisy speech. Then, at back-end, the histogram equalization feature normalization is used to deal with the residual mismatch between enhanced speech and clean speech. We have evaluated recognition performance under noisy environments using NOISEX-92 database and recorded speech signals in continuous speech recognition task. Experimental results show that our approach exhibits considerable improvements in the degraded environment.\",\"PeriodicalId\":154425,\"journal\":{\"name\":\"2009 International Joint Conference on Artificial Intelligence\",\"volume\":\"40 11\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Joint Conference on Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCAI.2009.208\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Joint Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCAI.2009.208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种有效的特征归一化算法来提高自动语音识别系统的鲁棒性。前端采用最小均方误差对数谱幅度估计语音增强来抑制噪声语音。然后在后端使用直方图均衡化特征归一化处理增强语音与干净语音之间的残差不匹配。我们使用NOISEX-92数据库评估了噪声环境下的识别性能，并在连续语音识别任务中记录了语音信号。实验结果表明，我们的方法在退化环境中表现出相当大的改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Robust Feature Normalization Algorithm for Automatic Speech Recognition

In this paper, we present an effective feature normalization algorithm to improve the robustness of automatic speech recognition systems. At front-end, minimum mean square error log-spectral amplitude estimation speech enhancement is adopted to suppress noise from noisy speech. Then, at back-end, the histogram equalization feature normalization is used to deal with the residual mismatch between enhanced speech and clean speech. We have evaluated recognition performance under noisy environments using NOISEX-92 database and recorded speech signals in continuous speech recognition task. Experimental results show that our approach exhibits considerable improvements in the degraded environment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2009 International Joint Conference on Artificial Intelligence

自引率

0.00%

发文量