{"title":"Student Risk Assessment: Predicting Undergraduate Student Graduation Probability Using Logistic Regression, SVM, and ANN","authors":"Darvy P. Ong, J. Pedrasa","doi":"10.1109/TENCON54134.2021.9707322","DOIUrl":null,"url":null,"abstract":"Understanding how different factors affect the performance of a student in the university setting is important in policy making and providing a better environment for learning. Existing studies on student graduation rates typically employ the use of machine learning methods to correlate a student's profile and their chances of graduation. Building on the success of these methods for Western institutions, we used Logistic Regression, Support Vector Machines, and Neural Networks to build models that use available student data to predict their graduation chances. The results show that all three models are good at predicting graduation outcome, with the logistic regression model yielding slightly higher scores in classification accuracy (80.67 %) and class separation (ROC-AUC score of 83.02%). We also found that including as little as four post-matriculation factors increases the model performances significantly. Hence, the models can be used to perform student risk assessment and develop plans to increase a student's chances of graduation.","PeriodicalId":405859,"journal":{"name":"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON54134.2021.9707322","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Understanding how different factors affect the performance of a student in the university setting is important in policy making and providing a better environment for learning. Existing studies on student graduation rates typically employ the use of machine learning methods to correlate a student's profile and their chances of graduation. Building on the success of these methods for Western institutions, we used Logistic Regression, Support Vector Machines, and Neural Networks to build models that use available student data to predict their graduation chances. The results show that all three models are good at predicting graduation outcome, with the logistic regression model yielding slightly higher scores in classification accuracy (80.67 %) and class separation (ROC-AUC score of 83.02%). We also found that including as little as four post-matriculation factors increases the model performances significantly. Hence, the models can be used to perform student risk assessment and develop plans to increase a student's chances of graduation.