{"title":"Robust front-end based on MVA processing for Arabic speech recognition","authors":"Elhem Techini, Z. Sakka, M. Bouhlel","doi":"10.1109/ICEMIS.2017.8273064","DOIUrl":null,"url":null,"abstract":"This paper presents a noise robust technique for arabic automatic speech recognition engine. The technique is based on Cepstral Mean and Variance Normalization (CMVN) plus Auto Regressive Moving Average (ARMA) filtering which is called MVA. MVA used as a post-processing module to Mel Frequency Cepstral Coefficients (MFCC), Relative Spectral-Perceptual Linear Prediction (RASTA-PLP) and Power Normalized Cepstral Coefficients (PNCC) features to improve the recognition accuracy. While an isolated Arabic word engine was designed and developed using the Hidden Markov Model (HMM) to perform the recognition process at the back-end. Experimental results on the Arabic database demonstrate that our method provides substantial improvements in recognition accuracy for all features. The results also demonstrate that RASTA-PLP outperforms PNCC and MFCC features for word correction and word accuracy.","PeriodicalId":117908,"journal":{"name":"2017 International Conference on Engineering & MIS (ICEMIS)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Engineering & MIS (ICEMIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEMIS.2017.8273064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a noise robust technique for arabic automatic speech recognition engine. The technique is based on Cepstral Mean and Variance Normalization (CMVN) plus Auto Regressive Moving Average (ARMA) filtering which is called MVA. MVA used as a post-processing module to Mel Frequency Cepstral Coefficients (MFCC), Relative Spectral-Perceptual Linear Prediction (RASTA-PLP) and Power Normalized Cepstral Coefficients (PNCC) features to improve the recognition accuracy. While an isolated Arabic word engine was designed and developed using the Hidden Markov Model (HMM) to perform the recognition process at the back-end. Experimental results on the Arabic database demonstrate that our method provides substantial improvements in recognition accuracy for all features. The results also demonstrate that RASTA-PLP outperforms PNCC and MFCC features for word correction and word accuracy.