Information technology for gender recognition by voice

Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì Pub Date : 2023-07-15 DOI:10.23939/sisn2023.13.350

Diana Koshtura

{"title":"Information technology for gender recognition by voice","authors":"Diana Koshtura","doi":"10.23939/sisn2023.13.350","DOIUrl":null,"url":null,"abstract":"Gender recognition from voice is a challenging problem in speech processing. This task involves extracting meaningful features from speech signals and classifying them into male or female categories. In this article, was implemented a gender recognition system using Python programming. I first recorded voice samples from both male and female speakers and extracted Mel-frequency cepstral coefficients (MFCC) as features. Then trained, a Support VectorMachine (SVM) classifier was on these features and evaluated its performance using accuracy, precision, recall, and F1-score metrics. These experiments demonstrated that proposed system should achieve high accuracy on the test set and will accurately predict the gender of a speaker based on their voice. I also explored using pre-trained models to reduce the need for large amounts of training data and found that they can provide good performance while requiring less computation. This study highlights the potential of using machine learning techniques for gender recognition from voice and can be extended to other speech processing applications.","PeriodicalId":444399,"journal":{"name":"Vìsnik Nacìonalʹnogo unìversitetu \"Lʹvìvsʹka polìtehnìka\". Serìâ Ìnformacìjnì sistemi ta merežì","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vìsnik Nacìonalʹnogo unìversitetu \"Lʹvìvsʹka polìtehnìka\". Serìâ Ìnformacìjnì sistemi ta merežì","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23939/sisn2023.13.350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Gender recognition from voice is a challenging problem in speech processing. This task involves extracting meaningful features from speech signals and classifying them into male or female categories. In this article, was implemented a gender recognition system using Python programming. I first recorded voice samples from both male and female speakers and extracted Mel-frequency cepstral coefficients (MFCC) as features. Then trained, a Support VectorMachine (SVM) classifier was on these features and evaluated its performance using accuracy, precision, recall, and F1-score metrics. These experiments demonstrated that proposed system should achieve high accuracy on the test set and will accurately predict the gender of a speaker based on their voice. I also explored using pre-trained models to reduce the need for large amounts of training data and found that they can provide good performance while requiring less computation. This study highlights the potential of using machine learning techniques for gender recognition from voice and can be extended to other speech processing applications.

查看原文本刊更多论文

语音性别识别的信息技术

语音性别识别是语音处理中的一个难题。这项任务包括从语音信号中提取有意义的特征，并将其分为男性或女性类别。在本文中，使用Python编程实现了一个性别识别系统。我首先记录了男性和女性扬声器的声音样本，并提取了mel频率倒谱系数(MFCC)作为特征。然后训练支持向量机(SVM)分类器对这些特征进行分类，并使用准确性、精密度、召回率和f1得分指标评估其性能。实验结果表明，该系统在测试集上具有较高的准确率，能够根据说话人的声音准确地预测出说话人的性别。我还探索了使用预训练模型来减少对大量训练数据的需求，并发现它们可以在需要较少计算的同时提供良好的性能。这项研究强调了使用机器学习技术从语音中识别性别的潜力，并且可以扩展到其他语音处理应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì

自引率

0.00%

发文量