{"title":"A statistical analysis on the impact of noise on MFCC features for speech recognition","authors":"U. Bhattacharjee, Swapnanil Gogoi, Rushali Sharma","doi":"10.1109/ICRAIE.2016.7939548","DOIUrl":null,"url":null,"abstract":"Noise is omnipresent in almost all acoustical environments. The investigation presents here seeks to quantify the impact of noise on mel-frequency cepstral coefficients (MFCC) of speech signal. MFCC is one of the most commonly used features for speech recognition systems. However, it has been observed that performance of MFCC based system degrades drastically with changing noise levels and noise types. In the present study, different noise types at different levels have been added to the clean speech signal and the changes in statistical distribution pattern of the signal has been investigated. Further, performance of two commonly used noise normalization techniques Cepstral Mean and Variance Normalization (CMVN) and Spectral Subtraction (SS) have also been evaluated.","PeriodicalId":400935,"journal":{"name":"2016 International Conference on Recent Advances and Innovations in Engineering (ICRAIE)","volume":"87 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Recent Advances and Innovations in Engineering (ICRAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAIE.2016.7939548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
Noise is omnipresent in almost all acoustical environments. The investigation presents here seeks to quantify the impact of noise on mel-frequency cepstral coefficients (MFCC) of speech signal. MFCC is one of the most commonly used features for speech recognition systems. However, it has been observed that performance of MFCC based system degrades drastically with changing noise levels and noise types. In the present study, different noise types at different levels have been added to the clean speech signal and the changes in statistical distribution pattern of the signal has been investigated. Further, performance of two commonly used noise normalization techniques Cepstral Mean and Variance Normalization (CMVN) and Spectral Subtraction (SS) have also been evaluated.