J. Dai, V. Vijayarajan, Xuan Peng, Li Tan, Jean Jiang
{"title":"Speech Recognition Using Sparse Discrete Wavelet Decomposition Feature Extraction","authors":"J. Dai, V. Vijayarajan, Xuan Peng, Li Tan, Jean Jiang","doi":"10.1109/EIT.2018.8500254","DOIUrl":null,"url":null,"abstract":"In this paper, a new feature extraction algorithm for speech recognition using sparse discrete wavelet decomposition (SDWD) is proposed. The recognition system contains the following stages: speech data acquisition and preprocessing, speech signal decomposition using the SDWD, feature extraction, and artificial neural network (ANN) classifier. The task of the developed SDWD is to decompose speech signal into band signals based on on the Mel filter bank frequency specifications. Similar to the Mel frequency cepstral coefficient (MFCC) method, the logarithmic values of the filter bank energies are computed and then a discrete cosine transform (DCT) is applied to these logarithmic values to extract the feature. Our experimental results using the ANN classifier demonstrate that our proposed SDWD feature extraction algorithm outperforms over the MFCC and discrete wavelet packet transform (DWPT) algorithms.","PeriodicalId":188414,"journal":{"name":"2018 IEEE International Conference on Electro/Information Technology (EIT)","volume":"433 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Electro/Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT.2018.8500254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
In this paper, a new feature extraction algorithm for speech recognition using sparse discrete wavelet decomposition (SDWD) is proposed. The recognition system contains the following stages: speech data acquisition and preprocessing, speech signal decomposition using the SDWD, feature extraction, and artificial neural network (ANN) classifier. The task of the developed SDWD is to decompose speech signal into band signals based on on the Mel filter bank frequency specifications. Similar to the Mel frequency cepstral coefficient (MFCC) method, the logarithmic values of the filter bank energies are computed and then a discrete cosine transform (DCT) is applied to these logarithmic values to extract the feature. Our experimental results using the ANN classifier demonstrate that our proposed SDWD feature extraction algorithm outperforms over the MFCC and discrete wavelet packet transform (DWPT) algorithms.