Yoga Handoko Agustin, Fitri Nuraeni, D. Kurniadi, Y. Septiana, A. Mulyani, W. Baswardono
{"title":"Comparison of SMOTE Sampling Based Algorithm on Imbalanced Data for Classification of New Student Admissions","authors":"Yoga Handoko Agustin, Fitri Nuraeni, D. Kurniadi, Y. Septiana, A. Mulyani, W. Baswardono","doi":"10.1109/ICISS53185.2021.9533243","DOIUrl":null,"url":null,"abstract":"One of the efforts to get quality students is through selection. The selection process must be balanced with a strategy so that the selected students are truly qualified. Classification techniques can be used to see the history of new student admissions who are accepted with the student’s lecture history. There are many classification algorithms that can be used, so comparisons need to be made to see the best performance of the algorithm. The classification algorithm used is Decision Tree C4.5, K-Nearest Neighbor, Naïve Bayes and Neural Network. The data used are 546 records in the imbalanced data category. So we need the Smote algorithm to make the data balanced so as not to result in misclassification. The classification results were tested using the Confusion Matrix, ROC and Geometric Mean (G-Mean) as well as a T-Test. The comparison results show that the best performance is on the K-Nearest Neighbor algorithm with an accuracy value of 84.99%, AUC of 0.700, G-Mean 62.95% and the T-test produces a significant different from other algorithms.","PeriodicalId":220371,"journal":{"name":"2021 International Conference on ICT for Smart Society (ICISS)","volume":"228 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on ICT for Smart Society (ICISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISS53185.2021.9533243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
One of the efforts to get quality students is through selection. The selection process must be balanced with a strategy so that the selected students are truly qualified. Classification techniques can be used to see the history of new student admissions who are accepted with the student’s lecture history. There are many classification algorithms that can be used, so comparisons need to be made to see the best performance of the algorithm. The classification algorithm used is Decision Tree C4.5, K-Nearest Neighbor, Naïve Bayes and Neural Network. The data used are 546 records in the imbalanced data category. So we need the Smote algorithm to make the data balanced so as not to result in misclassification. The classification results were tested using the Confusion Matrix, ROC and Geometric Mean (G-Mean) as well as a T-Test. The comparison results show that the best performance is on the K-Nearest Neighbor algorithm with an accuracy value of 84.99%, AUC of 0.700, G-Mean 62.95% and the T-test produces a significant different from other algorithms.