{"title":"基于MFCC和矢量量化的语音数字识别性能比较分析","authors":"Datta Rakshith KS , Rudresh MD , Shashibhushsan G","doi":"10.1016/j.gltp.2021.08.013","DOIUrl":null,"url":null,"abstract":"<div><p>The main goal of this research work is to experimentally verify the importance of spoken Speech digit signal in person authentication in controlling applications. The motivation is based on the earlier work of demonstrating the feasibility of using spoken speech digit utterance signal for person security and controlling applications. This paper work also discusses the. Comparative analysis of the cepstral analysis with the mel frequency cepstral coefficient (MFCC) by using vector quantization feature matching technique. All digits speech digit from zero utterance to nine digit utterance data has been collected for 15 subjects in three different sessions. For the thus collected spoken speech digit data, the feature extraction techniques such as cepstral and MFCC were applied to extract the Cepstral and MFCC features. In the next stage of work vector quantization was used for feature matching for both Cepstral and MFCC features and performance were recorded for two different session data. By comparing the performance of Cepstral plus VQ with the MFCC plus VQ, we can conclude that feature extraction technique MFCC gives the better performance than cepstral feature for spoken digit utterance data.</p></div>","PeriodicalId":100588,"journal":{"name":"Global Transitions Proceedings","volume":"2 2","pages":"Pages 513-519"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gltp.2021.08.013","citationCount":"11","resultStr":"{\"title\":\"Comparative performance analysis for speech digit recognition based on MFCC and vector quantization\",\"authors\":\"Datta Rakshith KS , Rudresh MD , Shashibhushsan G\",\"doi\":\"10.1016/j.gltp.2021.08.013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The main goal of this research work is to experimentally verify the importance of spoken Speech digit signal in person authentication in controlling applications. The motivation is based on the earlier work of demonstrating the feasibility of using spoken speech digit utterance signal for person security and controlling applications. This paper work also discusses the. Comparative analysis of the cepstral analysis with the mel frequency cepstral coefficient (MFCC) by using vector quantization feature matching technique. All digits speech digit from zero utterance to nine digit utterance data has been collected for 15 subjects in three different sessions. For the thus collected spoken speech digit data, the feature extraction techniques such as cepstral and MFCC were applied to extract the Cepstral and MFCC features. In the next stage of work vector quantization was used for feature matching for both Cepstral and MFCC features and performance were recorded for two different session data. By comparing the performance of Cepstral plus VQ with the MFCC plus VQ, we can conclude that feature extraction technique MFCC gives the better performance than cepstral feature for spoken digit utterance data.</p></div>\",\"PeriodicalId\":100588,\"journal\":{\"name\":\"Global Transitions Proceedings\",\"volume\":\"2 2\",\"pages\":\"Pages 513-519\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/j.gltp.2021.08.013\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Global Transitions Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666285X21000418\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Transitions Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666285X21000418","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparative performance analysis for speech digit recognition based on MFCC and vector quantization
The main goal of this research work is to experimentally verify the importance of spoken Speech digit signal in person authentication in controlling applications. The motivation is based on the earlier work of demonstrating the feasibility of using spoken speech digit utterance signal for person security and controlling applications. This paper work also discusses the. Comparative analysis of the cepstral analysis with the mel frequency cepstral coefficient (MFCC) by using vector quantization feature matching technique. All digits speech digit from zero utterance to nine digit utterance data has been collected for 15 subjects in three different sessions. For the thus collected spoken speech digit data, the feature extraction techniques such as cepstral and MFCC were applied to extract the Cepstral and MFCC features. In the next stage of work vector quantization was used for feature matching for both Cepstral and MFCC features and performance were recorded for two different session data. By comparing the performance of Cepstral plus VQ with the MFCC plus VQ, we can conclude that feature extraction technique MFCC gives the better performance than cepstral feature for spoken digit utterance data.