Voice-based Gender and Age Recognition System

Vinayak Sudhakar Kone, A. Anagal, Swaroop Anegundi, Pranali Jadhav, Uday Kulkarni, M. M
{"title":"Voice-based Gender and Age Recognition System","authors":"Vinayak Sudhakar Kone, A. Anagal, Swaroop Anegundi, Pranali Jadhav, Uday Kulkarni, M. M","doi":"10.1109/InCACCT57535.2023.10141801","DOIUrl":null,"url":null,"abstract":"The ability to detect gender and age from voice is a valuable tool in a variety of applications, like voice-based biometric identification, natural language processing, and speech recognition. Recent advances in Deep Learning have enabled the development of highly accurate gender and age detection models. In this paper, the discussion is about the Machine Learning based gender and age detection model using voice. The various approaches used to extract features from speech, and the data-set used for model evaluation and classification are obtained using different Machine Learning algorithms. The discussion is about the opportunities and challenges in this area of research. It is concluded by highlighting some of the open challenges and future directions in this field. Age prediction from voice using a grid search pipeline is a Machine Learning technique that uses a range of algorithms to detect the age of a person using their voice. In the proposed model, RobustScalar, Principal component analysis (PCA), and Logistic Regression algorithms are used. The grid search pipeline uses a combination of models to identify the best age prediction algorithm for a given data-set. For Gender prediction sequential model with 5 hidden layers has been used. The results were obtained based on the trained model for the common voice data-set with an accuracy of around 91% for gender and 59% for age.","PeriodicalId":405272,"journal":{"name":"2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/InCACCT57535.2023.10141801","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The ability to detect gender and age from voice is a valuable tool in a variety of applications, like voice-based biometric identification, natural language processing, and speech recognition. Recent advances in Deep Learning have enabled the development of highly accurate gender and age detection models. In this paper, the discussion is about the Machine Learning based gender and age detection model using voice. The various approaches used to extract features from speech, and the data-set used for model evaluation and classification are obtained using different Machine Learning algorithms. The discussion is about the opportunities and challenges in this area of research. It is concluded by highlighting some of the open challenges and future directions in this field. Age prediction from voice using a grid search pipeline is a Machine Learning technique that uses a range of algorithms to detect the age of a person using their voice. In the proposed model, RobustScalar, Principal component analysis (PCA), and Logistic Regression algorithms are used. The grid search pipeline uses a combination of models to identify the best age prediction algorithm for a given data-set. For Gender prediction sequential model with 5 hidden layers has been used. The results were obtained based on the trained model for the common voice data-set with an accuracy of around 91% for gender and 59% for age.
基于语音的性别和年龄识别系统
从语音中检测性别和年龄的能力在各种应用中都是一个有价值的工具,比如基于语音的生物识别、自然语言处理和语音识别。深度学习的最新进展使高度准确的性别和年龄检测模型得以发展。本文讨论了基于机器学习的基于语音的性别和年龄检测模型。从语音中提取特征的各种方法,以及用于模型评估和分类的数据集,都是使用不同的机器学习算法获得的。讨论了这一研究领域的机遇和挑战。最后,强调了该领域的一些开放挑战和未来方向。使用网格搜索管道从语音中预测年龄是一种机器学习技术,它使用一系列算法通过语音来检测人的年龄。在提出的模型中,使用了鲁棒标量,主成分分析(PCA)和逻辑回归算法。网格搜索管道使用模型组合来确定给定数据集的最佳年龄预测算法。性别预测采用5个隐层序列模型。结果是基于公共语音数据集的训练模型获得的,性别的准确率约为91%,年龄的准确率约为59%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信