{"title":"基于MVA处理的阿拉伯语语音识别鲁棒前端","authors":"Elhem Techini, Z. Sakka, M. Bouhlel","doi":"10.1109/ICEMIS.2017.8273064","DOIUrl":null,"url":null,"abstract":"This paper presents a noise robust technique for arabic automatic speech recognition engine. The technique is based on Cepstral Mean and Variance Normalization (CMVN) plus Auto Regressive Moving Average (ARMA) filtering which is called MVA. MVA used as a post-processing module to Mel Frequency Cepstral Coefficients (MFCC), Relative Spectral-Perceptual Linear Prediction (RASTA-PLP) and Power Normalized Cepstral Coefficients (PNCC) features to improve the recognition accuracy. While an isolated Arabic word engine was designed and developed using the Hidden Markov Model (HMM) to perform the recognition process at the back-end. Experimental results on the Arabic database demonstrate that our method provides substantial improvements in recognition accuracy for all features. The results also demonstrate that RASTA-PLP outperforms PNCC and MFCC features for word correction and word accuracy.","PeriodicalId":117908,"journal":{"name":"2017 International Conference on Engineering & MIS (ICEMIS)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Robust front-end based on MVA processing for Arabic speech recognition\",\"authors\":\"Elhem Techini, Z. Sakka, M. Bouhlel\",\"doi\":\"10.1109/ICEMIS.2017.8273064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a noise robust technique for arabic automatic speech recognition engine. The technique is based on Cepstral Mean and Variance Normalization (CMVN) plus Auto Regressive Moving Average (ARMA) filtering which is called MVA. MVA used as a post-processing module to Mel Frequency Cepstral Coefficients (MFCC), Relative Spectral-Perceptual Linear Prediction (RASTA-PLP) and Power Normalized Cepstral Coefficients (PNCC) features to improve the recognition accuracy. While an isolated Arabic word engine was designed and developed using the Hidden Markov Model (HMM) to perform the recognition process at the back-end. Experimental results on the Arabic database demonstrate that our method provides substantial improvements in recognition accuracy for all features. The results also demonstrate that RASTA-PLP outperforms PNCC and MFCC features for word correction and word accuracy.\",\"PeriodicalId\":117908,\"journal\":{\"name\":\"2017 International Conference on Engineering & MIS (ICEMIS)\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Engineering & MIS (ICEMIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEMIS.2017.8273064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Engineering & MIS (ICEMIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEMIS.2017.8273064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Robust front-end based on MVA processing for Arabic speech recognition
This paper presents a noise robust technique for arabic automatic speech recognition engine. The technique is based on Cepstral Mean and Variance Normalization (CMVN) plus Auto Regressive Moving Average (ARMA) filtering which is called MVA. MVA used as a post-processing module to Mel Frequency Cepstral Coefficients (MFCC), Relative Spectral-Perceptual Linear Prediction (RASTA-PLP) and Power Normalized Cepstral Coefficients (PNCC) features to improve the recognition accuracy. While an isolated Arabic word engine was designed and developed using the Hidden Markov Model (HMM) to perform the recognition process at the back-end. Experimental results on the Arabic database demonstrate that our method provides substantial improvements in recognition accuracy for all features. The results also demonstrate that RASTA-PLP outperforms PNCC and MFCC features for word correction and word accuracy.