Machine Learning based human recognition via robust Features from audio signals

U. Sadique, Muhammad Suleman Khan, S. Anwar, Mehran Ahmad
{"title":"Machine Learning based human recognition via robust Features from audio signals","authors":"U. Sadique, Muhammad Suleman Khan, S. Anwar, Mehran Ahmad","doi":"10.1109/ICAI58407.2023.10136683","DOIUrl":null,"url":null,"abstract":"Biometric verification techniques are commissioned throughout the world in different applications. Human voice recognition is one of the biometric techniques. This technique consists of identifying a human from their voice characteristic. This popular and beneficial biometric technique could be employed for identity human, security purposes, and many different applications. Human audio signal recognition consists of two phases i.e., Features Extraction and Classification. The proposed work consists of extracting features through the Mel Frequency Cepstral-Coefficient (MFCC) from the human audio signal, selecting robust features through Principal Component Analysis PCA, and classifying the selected features by comparing seven Machine Learning and proposed deep learning algorithms. Finally, compare the performance of different algorithms with different percentages of selected features to evaluate the acceptance rate of the correlated features. Support Vector Machine SVM shows the best performance with an Accuracy of 99.27% with F1-score and ROC values of 1.00. In the comparison with other methods, the Random Forest and CNN-ANN are 2ndtop robust models with an accuracy of 98.7%. Some of the algorithm's accuracy decreased with fewer features, including Naïve Bayes accuracy suddenly decreases to 60% on 20% of total features. The experiment concludes the acceptance rate of correlated features in different ML and DL algorithms are different in speech processing data.","PeriodicalId":161809,"journal":{"name":"2023 3rd International Conference on Artificial Intelligence (ICAI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Artificial Intelligence (ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAI58407.2023.10136683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Biometric verification techniques are commissioned throughout the world in different applications. Human voice recognition is one of the biometric techniques. This technique consists of identifying a human from their voice characteristic. This popular and beneficial biometric technique could be employed for identity human, security purposes, and many different applications. Human audio signal recognition consists of two phases i.e., Features Extraction and Classification. The proposed work consists of extracting features through the Mel Frequency Cepstral-Coefficient (MFCC) from the human audio signal, selecting robust features through Principal Component Analysis PCA, and classifying the selected features by comparing seven Machine Learning and proposed deep learning algorithms. Finally, compare the performance of different algorithms with different percentages of selected features to evaluate the acceptance rate of the correlated features. Support Vector Machine SVM shows the best performance with an Accuracy of 99.27% with F1-score and ROC values of 1.00. In the comparison with other methods, the Random Forest and CNN-ANN are 2ndtop robust models with an accuracy of 98.7%. Some of the algorithm's accuracy decreased with fewer features, including Naïve Bayes accuracy suddenly decreases to 60% on 20% of total features. The experiment concludes the acceptance rate of correlated features in different ML and DL algorithms are different in speech processing data.
基于机器学习的人类识别,通过音频信号的鲁棒特征
生物识别验证技术在世界各地的不同应用中得到委托。人的声音识别是生物识别技术的一种。这项技术包括通过声音特征来识别一个人。这种流行且有益的生物识别技术可用于身份识别、安全目的和许多不同的应用程序。人类音频信号识别包括特征提取和分类两个阶段。提出的工作包括通过Mel频率倒频谱系数(MFCC)从人类音频信号中提取特征,通过主成分分析PCA选择鲁棒特征,并通过比较七种机器学习和提出的深度学习算法对所选特征进行分类。最后,比较不同算法在选择特征的不同百分比下的性能,评估相关特征的接受率。支持向量机SVM表现最好,准确率为99.27%,评分为f1, ROC值为1.00。与其他方法相比,Random Forest和CNN-ANN模型的鲁棒性达到了98.7%。一些算法的准确率随着特征的减少而下降,包括Naïve贝叶斯准确率突然下降到20%的总特征的60%。实验表明,在语音处理数据中,不同ML和DL算法对相关特征的接受率不同。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信