Arham Tariq, Ahmad Amin, Yasir Masood, Muhammad Muzaffar, Junaid Iqbal
{"title":"Predicting Early Withdrawal of University Students: A Comparative Study between KNN and Decision Tree","authors":"Arham Tariq, Ahmad Amin, Yasir Masood, Muhammad Muzaffar, Junaid Iqbal","doi":"10.1109/ICACS55311.2023.10089706","DOIUrl":null,"url":null,"abstract":"“The rising trend of students dropping out of universities without completing their degrees is becoming a concerning issue for institutions. To address this problem, the reasons behind this phenomenon need to be explored. However, most educational data sets have small sample sizes and varying patterns. Currently, there are few machine learning approaches for Pakistani higher education student performance. This study presents a machine learning-based approach to predict student withdrawals and identify the reasons behind them. The proposed approach compares two supervised ML algorithms, K-N earestNeighbors (KNN) and Decision-Tree (DT). The most important attributes affecting student retention are also determined using the ExtraTreesClassifier ensemble learning algorithm. In our experimental evaluation, the accuracy of KNN was 75%, and 70% for DT.”","PeriodicalId":357522,"journal":{"name":"2023 4th International Conference on Advancements in Computational Sciences (ICACS)","volume":"232 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference on Advancements in Computational Sciences (ICACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACS55311.2023.10089706","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
“The rising trend of students dropping out of universities without completing their degrees is becoming a concerning issue for institutions. To address this problem, the reasons behind this phenomenon need to be explored. However, most educational data sets have small sample sizes and varying patterns. Currently, there are few machine learning approaches for Pakistani higher education student performance. This study presents a machine learning-based approach to predict student withdrawals and identify the reasons behind them. The proposed approach compares two supervised ML algorithms, K-N earestNeighbors (KNN) and Decision-Tree (DT). The most important attributes affecting student retention are also determined using the ExtraTreesClassifier ensemble learning algorithm. In our experimental evaluation, the accuracy of KNN was 75%, and 70% for DT.”