J. Dai, V. Vijayarajan, Xuan Peng, Li Tan, Jean Jiang
{"title":"基于稀疏离散小波分解的语音识别特征提取","authors":"J. Dai, V. Vijayarajan, Xuan Peng, Li Tan, Jean Jiang","doi":"10.1109/EIT.2018.8500254","DOIUrl":null,"url":null,"abstract":"In this paper, a new feature extraction algorithm for speech recognition using sparse discrete wavelet decomposition (SDWD) is proposed. The recognition system contains the following stages: speech data acquisition and preprocessing, speech signal decomposition using the SDWD, feature extraction, and artificial neural network (ANN) classifier. The task of the developed SDWD is to decompose speech signal into band signals based on on the Mel filter bank frequency specifications. Similar to the Mel frequency cepstral coefficient (MFCC) method, the logarithmic values of the filter bank energies are computed and then a discrete cosine transform (DCT) is applied to these logarithmic values to extract the feature. Our experimental results using the ANN classifier demonstrate that our proposed SDWD feature extraction algorithm outperforms over the MFCC and discrete wavelet packet transform (DWPT) algorithms.","PeriodicalId":188414,"journal":{"name":"2018 IEEE International Conference on Electro/Information Technology (EIT)","volume":"433 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Speech Recognition Using Sparse Discrete Wavelet Decomposition Feature Extraction\",\"authors\":\"J. Dai, V. Vijayarajan, Xuan Peng, Li Tan, Jean Jiang\",\"doi\":\"10.1109/EIT.2018.8500254\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a new feature extraction algorithm for speech recognition using sparse discrete wavelet decomposition (SDWD) is proposed. The recognition system contains the following stages: speech data acquisition and preprocessing, speech signal decomposition using the SDWD, feature extraction, and artificial neural network (ANN) classifier. The task of the developed SDWD is to decompose speech signal into band signals based on on the Mel filter bank frequency specifications. Similar to the Mel frequency cepstral coefficient (MFCC) method, the logarithmic values of the filter bank energies are computed and then a discrete cosine transform (DCT) is applied to these logarithmic values to extract the feature. Our experimental results using the ANN classifier demonstrate that our proposed SDWD feature extraction algorithm outperforms over the MFCC and discrete wavelet packet transform (DWPT) algorithms.\",\"PeriodicalId\":188414,\"journal\":{\"name\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"volume\":\"433 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT.2018.8500254\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Electro/Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT.2018.8500254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Recognition Using Sparse Discrete Wavelet Decomposition Feature Extraction
In this paper, a new feature extraction algorithm for speech recognition using sparse discrete wavelet decomposition (SDWD) is proposed. The recognition system contains the following stages: speech data acquisition and preprocessing, speech signal decomposition using the SDWD, feature extraction, and artificial neural network (ANN) classifier. The task of the developed SDWD is to decompose speech signal into band signals based on on the Mel filter bank frequency specifications. Similar to the Mel frequency cepstral coefficient (MFCC) method, the logarithmic values of the filter bank energies are computed and then a discrete cosine transform (DCT) is applied to these logarithmic values to extract the feature. Our experimental results using the ANN classifier demonstrate that our proposed SDWD feature extraction algorithm outperforms over the MFCC and discrete wavelet packet transform (DWPT) algorithms.