{"title":"基于KNN和SVM的语音数字识别","authors":"R. R. Porle, Suzanih Embok","doi":"10.1109/IICAIET55139.2022.9936761","DOIUrl":null,"url":null,"abstract":"Speech-Based Number Recognition is a system that recognizes numbers based on the speech of the user. Most of the research makes use of English, Bangla, Tamil, etc., but the Malay language has received little attention. In this paper, the Malay numbers one through ten are recognized and implemented on devices consisting primarily of the Arduino UNO, the ELECHOUSE Voice Recognition Module v3, Microphone, and Light Emitting Diode. This system employs database creation, preprocessing, feature extraction, Mel-frequency cepstral coefficients, and classification utilizing using K-Nearest Neighbour and Support Vector Machine. Two experiments were carried out using 900 samples. In the first experiment, 80 percent of the training samples and 20 percent of the test samples were used. The second experiment utilized 70 percent of the training samples and 30 percent of the testing samples. The results show that the Support Vector Machine outperformed K-Nearest Neighbour with an average accuracy of 91.27 percent.","PeriodicalId":142482,"journal":{"name":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech-Based Number Recognition Using KNN and SVM\",\"authors\":\"R. R. Porle, Suzanih Embok\",\"doi\":\"10.1109/IICAIET55139.2022.9936761\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech-Based Number Recognition is a system that recognizes numbers based on the speech of the user. Most of the research makes use of English, Bangla, Tamil, etc., but the Malay language has received little attention. In this paper, the Malay numbers one through ten are recognized and implemented on devices consisting primarily of the Arduino UNO, the ELECHOUSE Voice Recognition Module v3, Microphone, and Light Emitting Diode. This system employs database creation, preprocessing, feature extraction, Mel-frequency cepstral coefficients, and classification utilizing using K-Nearest Neighbour and Support Vector Machine. Two experiments were carried out using 900 samples. In the first experiment, 80 percent of the training samples and 20 percent of the test samples were used. The second experiment utilized 70 percent of the training samples and 30 percent of the testing samples. The results show that the Support Vector Machine outperformed K-Nearest Neighbour with an average accuracy of 91.27 percent.\",\"PeriodicalId\":142482,\"journal\":{\"name\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IICAIET55139.2022.9936761\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET55139.2022.9936761","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech-Based Number Recognition is a system that recognizes numbers based on the speech of the user. Most of the research makes use of English, Bangla, Tamil, etc., but the Malay language has received little attention. In this paper, the Malay numbers one through ten are recognized and implemented on devices consisting primarily of the Arduino UNO, the ELECHOUSE Voice Recognition Module v3, Microphone, and Light Emitting Diode. This system employs database creation, preprocessing, feature extraction, Mel-frequency cepstral coefficients, and classification utilizing using K-Nearest Neighbour and Support Vector Machine. Two experiments were carried out using 900 samples. In the first experiment, 80 percent of the training samples and 20 percent of the test samples were used. The second experiment utilized 70 percent of the training samples and 30 percent of the testing samples. The results show that the Support Vector Machine outperformed K-Nearest Neighbour with an average accuracy of 91.27 percent.