{"title":"Speech Mel Frequency Cepstral Coefficient feature classification using multi level support vector machine","authors":"Abhay Kumar, Sidhartha Sankar Rout, Varun Goel","doi":"10.1109/UPCON.2017.8251036","DOIUrl":null,"url":null,"abstract":"This paper presents the combined application of machine learning algorithm MLSVM (multi level support vector machine) and feature vector MFCC (Mel Frequency Cepstral Coefficient) to improve the result of speech recognition in comparison to other algorithm which involve formant as feature vector and KNN (k-nearest neighbour), Tree, LDA (linear discriminant analysis) and QDA (quadrature discriminant analysis) as machine learning algorithms. Problem with a machine learning algorithm like KNN, Tree, LDA and QDA is that the accuracy of recognition is limited to increase in the database and training set percentage with respect to total a database available. In this paper analysis and comparison is done between feature vector inspired by speech generation and hearing model to improve the result of recognition for larger database of twenty words. The twenty words are used to generate forty commands which are sufficient to control the on and off state of twenty homes or office appliances. The paper also presents the brief analysis of the effect of training set percentage on the accuracy.","PeriodicalId":422673,"journal":{"name":"2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON.2017.8251036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
This paper presents the combined application of machine learning algorithm MLSVM (multi level support vector machine) and feature vector MFCC (Mel Frequency Cepstral Coefficient) to improve the result of speech recognition in comparison to other algorithm which involve formant as feature vector and KNN (k-nearest neighbour), Tree, LDA (linear discriminant analysis) and QDA (quadrature discriminant analysis) as machine learning algorithms. Problem with a machine learning algorithm like KNN, Tree, LDA and QDA is that the accuracy of recognition is limited to increase in the database and training set percentage with respect to total a database available. In this paper analysis and comparison is done between feature vector inspired by speech generation and hearing model to improve the result of recognition for larger database of twenty words. The twenty words are used to generate forty commands which are sufficient to control the on and off state of twenty homes or office appliances. The paper also presents the brief analysis of the effect of training set percentage on the accuracy.