T. Mokgonyane, T. Sefara, T. Modipa, Mercy Mosibudi Mogale, M. J. Manamela, P. J. Manamela
{"title":"基于机器学习算法的说话人自动识别系统","authors":"T. Mokgonyane, T. Sefara, T. Modipa, Mercy Mosibudi Mogale, M. J. Manamela, P. J. Manamela","doi":"10.1109/ROBOMECH.2019.8704837","DOIUrl":null,"url":null,"abstract":"Speaker recognition is a technique used to automatically recognize a speaker from a recording of their voice or speech utterance. Speaker recognition technology has improved over recent years and has become inexpensive and and reliable method for person identification and verification. Research in the field of speaker recognition has now spanned over five decades and has shown fruitful results, however there is not much work done with regards to South African indigenous languages. This paper presents the development of an automatic speaker recognition system that incorporates classification and recognition of Sepedi home language speakers. Four classifier models, namely, Support Vector Machines, K-Nearest Neighbors, Multilayer Perceptrons (MLP) and Random Forest (RF), are trained using WEKA data mining tool. Auto-WEKA is applied to determine the best classifier model together with its best hyper-parameters. The performance of each model is evaluated in WEKA using 10-fold cross validation. MLP and RF yielded good accuracy surpassing the state-of-the-art with an accuracy of 97% and 99.9% respectively, the RF model is then implemented on a graphical user interface for development testing.","PeriodicalId":344332,"journal":{"name":"2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":"{\"title\":\"Automatic Speaker Recognition System based on Machine Learning Algorithms\",\"authors\":\"T. Mokgonyane, T. Sefara, T. Modipa, Mercy Mosibudi Mogale, M. J. Manamela, P. J. Manamela\",\"doi\":\"10.1109/ROBOMECH.2019.8704837\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speaker recognition is a technique used to automatically recognize a speaker from a recording of their voice or speech utterance. Speaker recognition technology has improved over recent years and has become inexpensive and and reliable method for person identification and verification. Research in the field of speaker recognition has now spanned over five decades and has shown fruitful results, however there is not much work done with regards to South African indigenous languages. This paper presents the development of an automatic speaker recognition system that incorporates classification and recognition of Sepedi home language speakers. Four classifier models, namely, Support Vector Machines, K-Nearest Neighbors, Multilayer Perceptrons (MLP) and Random Forest (RF), are trained using WEKA data mining tool. Auto-WEKA is applied to determine the best classifier model together with its best hyper-parameters. The performance of each model is evaluated in WEKA using 10-fold cross validation. MLP and RF yielded good accuracy surpassing the state-of-the-art with an accuracy of 97% and 99.9% respectively, the RF model is then implemented on a graphical user interface for development testing.\",\"PeriodicalId\":344332,\"journal\":{\"name\":\"2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"26\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROBOMECH.2019.8704837\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBOMECH.2019.8704837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Speaker Recognition System based on Machine Learning Algorithms
Speaker recognition is a technique used to automatically recognize a speaker from a recording of their voice or speech utterance. Speaker recognition technology has improved over recent years and has become inexpensive and and reliable method for person identification and verification. Research in the field of speaker recognition has now spanned over five decades and has shown fruitful results, however there is not much work done with regards to South African indigenous languages. This paper presents the development of an automatic speaker recognition system that incorporates classification and recognition of Sepedi home language speakers. Four classifier models, namely, Support Vector Machines, K-Nearest Neighbors, Multilayer Perceptrons (MLP) and Random Forest (RF), are trained using WEKA data mining tool. Auto-WEKA is applied to determine the best classifier model together with its best hyper-parameters. The performance of each model is evaluated in WEKA using 10-fold cross validation. MLP and RF yielded good accuracy surpassing the state-of-the-art with an accuracy of 97% and 99.9% respectively, the RF model is then implemented on a graphical user interface for development testing.