Machine Learning based human recognition via robust Features from audio signals

2023 3rd International Conference on Artificial Intelligence (ICAI) Pub Date : 2023-02-22 DOI:10.1109/ICAI58407.2023.10136683

U. Sadique, Muhammad Suleman Khan, S. Anwar, Mehran Ahmad

{"title":"Machine Learning based human recognition via robust Features from audio signals","authors":"U. Sadique, Muhammad Suleman Khan, S. Anwar, Mehran Ahmad","doi":"10.1109/ICAI58407.2023.10136683","DOIUrl":null,"url":null,"abstract":"Biometric verification techniques are commissioned throughout the world in different applications. Human voice recognition is one of the biometric techniques. This technique consists of identifying a human from their voice characteristic. This popular and beneficial biometric technique could be employed for identity human, security purposes, and many different applications. Human audio signal recognition consists of two phases i.e., Features Extraction and Classification. The proposed work consists of extracting features through the Mel Frequency Cepstral-Coefficient (MFCC) from the human audio signal, selecting robust features through Principal Component Analysis PCA, and classifying the selected features by comparing seven Machine Learning and proposed deep learning algorithms. Finally, compare the performance of different algorithms with different percentages of selected features to evaluate the acceptance rate of the correlated features. Support Vector Machine SVM shows the best performance with an Accuracy of 99.27% with F1-score and ROC values of 1.00. In the comparison with other methods, the Random Forest and CNN-ANN are 2ndtop robust models with an accuracy of 98.7%. Some of the algorithm's accuracy decreased with fewer features, including Naïve Bayes accuracy suddenly decreases to 60% on 20% of total features. The experiment concludes the acceptance rate of correlated features in different ML and DL algorithms are different in speech processing data.","PeriodicalId":161809,"journal":{"name":"2023 3rd International Conference on Artificial Intelligence (ICAI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Artificial Intelligence (ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAI58407.2023.10136683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Biometric verification techniques are commissioned throughout the world in different applications. Human voice recognition is one of the biometric techniques. This technique consists of identifying a human from their voice characteristic. This popular and beneficial biometric technique could be employed for identity human, security purposes, and many different applications. Human audio signal recognition consists of two phases i.e., Features Extraction and Classification. The proposed work consists of extracting features through the Mel Frequency Cepstral-Coefficient (MFCC) from the human audio signal, selecting robust features through Principal Component Analysis PCA, and classifying the selected features by comparing seven Machine Learning and proposed deep learning algorithms. Finally, compare the performance of different algorithms with different percentages of selected features to evaluate the acceptance rate of the correlated features. Support Vector Machine SVM shows the best performance with an Accuracy of 99.27% with F1-score and ROC values of 1.00. In the comparison with other methods, the Random Forest and CNN-ANN are 2ndtop robust models with an accuracy of 98.7%. Some of the algorithm's accuracy decreased with fewer features, including Naïve Bayes accuracy suddenly decreases to 60% on 20% of total features. The experiment concludes the acceptance rate of correlated features in different ML and DL algorithms are different in speech processing data.

查看原文本刊更多论文

基于机器学习的人类识别，通过音频信号的鲁棒特征

生物识别验证技术在世界各地的不同应用中得到委托。人的声音识别是生物识别技术的一种。这项技术包括通过声音特征来识别一个人。这种流行且有益的生物识别技术可用于身份识别、安全目的和许多不同的应用程序。人类音频信号识别包括特征提取和分类两个阶段。提出的工作包括通过Mel频率倒频谱系数(MFCC)从人类音频信号中提取特征，通过主成分分析PCA选择鲁棒特征，并通过比较七种机器学习和提出的深度学习算法对所选特征进行分类。最后，比较不同算法在选择特征的不同百分比下的性能，评估相关特征的接受率。支持向量机SVM表现最好，准确率为99.27%，评分为f1, ROC值为1.00。与其他方法相比，Random Forest和CNN-ANN模型的鲁棒性达到了98.7%。一些算法的准确率随着特征的减少而下降，包括Naïve贝叶斯准确率突然下降到20%的总特征的60%。实验表明，在语音处理数据中，不同ML和DL算法对相关特征的接受率不同。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 3rd International Conference on Artificial Intelligence (ICAI)

自引率

0.00%

发文量