M. A. N., Swaroop L R, S. Hegde, Sourabh U, Rakshith Gowda G S
{"title":"Gender Identification for Kannada Names","authors":"M. A. N., Swaroop L R, S. Hegde, Sourabh U, Rakshith Gowda G S","doi":"10.1109/ICAIT47043.2019.8987346","DOIUrl":null,"url":null,"abstract":"Gender identification using the name of a person, specifically for Kannada names, is a challenging task. We present a classification approach for gender prediction of Kannada names represented in Kannada Unicode. We have determined various features derived from extensive morphological analysis of the names in Kannada. Some of the features identified are indigenous to Kannada Language. In this work we have developed three different classification models using Support Vector Machine (SVM), Random Forest and Naive Bayes machine learning algorithms. Our system reports a top accuracy of 90.1%, F1 score of 90.1% for male names and 90.0% for female names.","PeriodicalId":221994,"journal":{"name":"2019 1st International Conference on Advances in Information Technology (ICAIT)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 1st International Conference on Advances in Information Technology (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT47043.2019.8987346","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Gender identification using the name of a person, specifically for Kannada names, is a challenging task. We present a classification approach for gender prediction of Kannada names represented in Kannada Unicode. We have determined various features derived from extensive morphological analysis of the names in Kannada. Some of the features identified are indigenous to Kannada Language. In this work we have developed three different classification models using Support Vector Machine (SVM), Random Forest and Naive Bayes machine learning algorithms. Our system reports a top accuracy of 90.1%, F1 score of 90.1% for male names and 90.0% for female names.