{"title":"Gender identification from speech signal by examining the speech production characteristics","authors":"Esther Ramdinmawii, V. K. Mittal","doi":"10.1109/ICSPCOM.2016.7980584","DOIUrl":null,"url":null,"abstract":"The term gender identification deals with finding out the gender of a person from his or her voice. Gender identification has been implemented in several Automatic Speaker Recognition (ASR) systems and has proved to be of great significance. The use of gender identification in today's technology makes it easier for user authentication and identification in high security systems. In this paper, we have discussed about the gender identification process for speech signals using three different features namely Pitch using autocorrelation, Signal energy and Mel Frequency Cepstral Coefficients (MFCCs). A linear Support Vector Machine (SVM) classifier was used for classification of features extracted from the speech signal using signal processing methods. Two sets of experiments were performed - in the first experiment, one speech file was tested against one training file as a one-on-one experiment. In the second experiment, one speech file was tested against three training files. The average accuracy of the second experiment was slightly higher than the first experiment. Performance evaluation results are encouraging. The approach can be used in wide range of applications.","PeriodicalId":213713,"journal":{"name":"2016 International Conference on Signal Processing and Communication (ICSC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Signal Processing and Communication (ICSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPCOM.2016.7980584","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20
Abstract
The term gender identification deals with finding out the gender of a person from his or her voice. Gender identification has been implemented in several Automatic Speaker Recognition (ASR) systems and has proved to be of great significance. The use of gender identification in today's technology makes it easier for user authentication and identification in high security systems. In this paper, we have discussed about the gender identification process for speech signals using three different features namely Pitch using autocorrelation, Signal energy and Mel Frequency Cepstral Coefficients (MFCCs). A linear Support Vector Machine (SVM) classifier was used for classification of features extracted from the speech signal using signal processing methods. Two sets of experiments were performed - in the first experiment, one speech file was tested against one training file as a one-on-one experiment. In the second experiment, one speech file was tested against three training files. The average accuracy of the second experiment was slightly higher than the first experiment. Performance evaluation results are encouraging. The approach can be used in wide range of applications.