Speech Mel Frequency Cepstral Coefficient feature classification using multi level support vector machine

Abhay Kumar, Sidhartha Sankar Rout, Varun Goel
{"title":"Speech Mel Frequency Cepstral Coefficient feature classification using multi level support vector machine","authors":"Abhay Kumar, Sidhartha Sankar Rout, Varun Goel","doi":"10.1109/UPCON.2017.8251036","DOIUrl":null,"url":null,"abstract":"This paper presents the combined application of machine learning algorithm MLSVM (multi level support vector machine) and feature vector MFCC (Mel Frequency Cepstral Coefficient) to improve the result of speech recognition in comparison to other algorithm which involve formant as feature vector and KNN (k-nearest neighbour), Tree, LDA (linear discriminant analysis) and QDA (quadrature discriminant analysis) as machine learning algorithms. Problem with a machine learning algorithm like KNN, Tree, LDA and QDA is that the accuracy of recognition is limited to increase in the database and training set percentage with respect to total a database available. In this paper analysis and comparison is done between feature vector inspired by speech generation and hearing model to improve the result of recognition for larger database of twenty words. The twenty words are used to generate forty commands which are sufficient to control the on and off state of twenty homes or office appliances. The paper also presents the brief analysis of the effect of training set percentage on the accuracy.","PeriodicalId":422673,"journal":{"name":"2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON.2017.8251036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

This paper presents the combined application of machine learning algorithm MLSVM (multi level support vector machine) and feature vector MFCC (Mel Frequency Cepstral Coefficient) to improve the result of speech recognition in comparison to other algorithm which involve formant as feature vector and KNN (k-nearest neighbour), Tree, LDA (linear discriminant analysis) and QDA (quadrature discriminant analysis) as machine learning algorithms. Problem with a machine learning algorithm like KNN, Tree, LDA and QDA is that the accuracy of recognition is limited to increase in the database and training set percentage with respect to total a database available. In this paper analysis and comparison is done between feature vector inspired by speech generation and hearing model to improve the result of recognition for larger database of twenty words. The twenty words are used to generate forty commands which are sufficient to control the on and off state of twenty homes or office appliances. The paper also presents the brief analysis of the effect of training set percentage on the accuracy.
基于多级支持向量机的语音梅尔频率倒谱系数特征分类
本文提出了机器学习算法MLSVM (multi - level support vector machine)和特征向量MFCC (Mel Frequency Cepstral Coefficient)的联合应用,与其他以形成峰为特征向量,以KNN (k-nearest neighbour)、Tree、LDA (linear discriminant analysis)和QDA (quadrature discriminant analysis)为机器学习算法的算法相比,提高了语音识别的结果。像KNN, Tree, LDA和QDA这样的机器学习算法的问题是,识别的准确性受到数据库和训练集相对于数据库可用总量的百分比的限制。本文对基于语音生成的特征向量和基于听觉模型的特征向量进行了分析和比较,以改善20个词的大型数据库的识别结果。这20个单词被用来生成40个命令,这些命令足以控制20个家庭或办公室电器的开关状态。本文还简要分析了训练集百分比对准确率的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信