{"title":"Feature extraction based on DCT and MVDR spectral estimation for robust speech recognition","authors":"S. Seyedin, M. Ahadi","doi":"10.1109/ICOSP.2008.4697205","DOIUrl":null,"url":null,"abstract":"This paper proposes a new noise robust feature extraction method for speech recognition. It is based on the discrete cosine transform and minimum variance distortionless response (MVDR) methods of spectrum estimation and differential power spectrum technique. The large bias drawback of the periodogram method can be solved by using DCT instead of FFT. The MVDR method can also increase the robustness of the features by reducing the variance of the estimated power spectrum. The above method, when evaluated on Test set A of Aurora 2 task, gave a relative improvement of up to 63.3% in recognition accuracy in comparison with MFCC as the baseline.","PeriodicalId":445699,"journal":{"name":"2008 9th International Conference on Signal Processing","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 9th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2008.4697205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
This paper proposes a new noise robust feature extraction method for speech recognition. It is based on the discrete cosine transform and minimum variance distortionless response (MVDR) methods of spectrum estimation and differential power spectrum technique. The large bias drawback of the periodogram method can be solved by using DCT instead of FFT. The MVDR method can also increase the robustness of the features by reducing the variance of the estimated power spectrum. The above method, when evaluated on Test set A of Aurora 2 task, gave a relative improvement of up to 63.3% in recognition accuracy in comparison with MFCC as the baseline.