Yi-Long Wan, Tian-qi Zhang, Zhi-Chao Wang, Jing Jin
{"title":"基于多波段谱减法的鲁棒语音识别","authors":"Yi-Long Wan, Tian-qi Zhang, Zhi-Chao Wang, Jing Jin","doi":"10.1109/CISP.2013.6744019","DOIUrl":null,"url":null,"abstract":"In order to reduce the degradation of the speech recognition accuracy while the testing condition are mismatched with the training condition around noisy environment, a kind of multi-band spectral subtraction has been proposed. The estimated noise signals were extracted from the first few frames of the noisy speech. The noisy speech and estimation of noise signals by the frequency were divided into non-overlapping M frequency bands. According to the SNR (signal-to-noise ratio) of noise speech in each frequency band, the band noise spectral subtraction parameters can be determined. The front-end speech enhancement module and the speech recognizer constitute a robust speech recognition system. The results of simulation experiments indicate that the recognition accuracy of multi-band spectral subtraction robust speech recognition system is obviously superior to the basic spectral subtraction in different signal-to-noise ratios and different noise's types.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Robust speech recognition based on multi-band spectral subtraction\",\"authors\":\"Yi-Long Wan, Tian-qi Zhang, Zhi-Chao Wang, Jing Jin\",\"doi\":\"10.1109/CISP.2013.6744019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to reduce the degradation of the speech recognition accuracy while the testing condition are mismatched with the training condition around noisy environment, a kind of multi-band spectral subtraction has been proposed. The estimated noise signals were extracted from the first few frames of the noisy speech. The noisy speech and estimation of noise signals by the frequency were divided into non-overlapping M frequency bands. According to the SNR (signal-to-noise ratio) of noise speech in each frequency band, the band noise spectral subtraction parameters can be determined. The front-end speech enhancement module and the speech recognizer constitute a robust speech recognition system. The results of simulation experiments indicate that the recognition accuracy of multi-band spectral subtraction robust speech recognition system is obviously superior to the basic spectral subtraction in different signal-to-noise ratios and different noise's types.\",\"PeriodicalId\":442320,\"journal\":{\"name\":\"2013 6th International Congress on Image and Signal Processing (CISP)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 6th International Congress on Image and Signal Processing (CISP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISP.2013.6744019\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 6th International Congress on Image and Signal Processing (CISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP.2013.6744019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Robust speech recognition based on multi-band spectral subtraction
In order to reduce the degradation of the speech recognition accuracy while the testing condition are mismatched with the training condition around noisy environment, a kind of multi-band spectral subtraction has been proposed. The estimated noise signals were extracted from the first few frames of the noisy speech. The noisy speech and estimation of noise signals by the frequency were divided into non-overlapping M frequency bands. According to the SNR (signal-to-noise ratio) of noise speech in each frequency band, the band noise spectral subtraction parameters can be determined. The front-end speech enhancement module and the speech recognizer constitute a robust speech recognition system. The results of simulation experiments indicate that the recognition accuracy of multi-band spectral subtraction robust speech recognition system is obviously superior to the basic spectral subtraction in different signal-to-noise ratios and different noise's types.