Algorithmic Prediction of Students On-Time Graduation from the University

TEM Journal Pub Date : 2024-02-27 DOI:10.18421/tem131-72
Ayman Alfahid
{"title":"Algorithmic Prediction of Students On-Time Graduation from the University","authors":"Ayman Alfahid","doi":"10.18421/tem131-72","DOIUrl":null,"url":null,"abstract":"This study develops statistical learning models to assess the probability of undergraduate students graduating within a predetermined period, utilizing admission, performance, and demographic data. The urgency of addressing student attrition is highlighted by recent data from the National Center for Education Statistics (NCES), indicating a 59% completion rate by full-time undergraduates within six years. This research leverages institutional data from a Saudi University, focusing on freshmen enrolled in the 2012-2013 and 2013-2014 academic years, to identify students at risk of dropping out, thereby enabling timely interventions. Ten algorithms, including decision trees, ensemble models, SVM, and ANN, were built and evaluated on a test set representing 33.3% of the entire dataset using precision, recall, accuracy, and Matthews correlation coefficient (MCC). The findings show that SVM and Random Forest models were the most reliable, achieving accuracies of 0.830 and 0.831 respectively, and maintaining balance in precision, recall, and MCC. Conversely, the naïve Bayes model recorded the worst performance. The comparative analysis revealed the superior performance of ensemble models over decision tree models in predicting student attrition, emphasizing the importance of model selection in developing effective early intervention strategies. In addition, our analysis revealed that academic data is a better predictor of on-time graduation than admission data, emphasizing the need for institutions to focus on continuous academic assessment data.","PeriodicalId":515899,"journal":{"name":"TEM Journal","volume":"30 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"TEM Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18421/tem131-72","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study develops statistical learning models to assess the probability of undergraduate students graduating within a predetermined period, utilizing admission, performance, and demographic data. The urgency of addressing student attrition is highlighted by recent data from the National Center for Education Statistics (NCES), indicating a 59% completion rate by full-time undergraduates within six years. This research leverages institutional data from a Saudi University, focusing on freshmen enrolled in the 2012-2013 and 2013-2014 academic years, to identify students at risk of dropping out, thereby enabling timely interventions. Ten algorithms, including decision trees, ensemble models, SVM, and ANN, were built and evaluated on a test set representing 33.3% of the entire dataset using precision, recall, accuracy, and Matthews correlation coefficient (MCC). The findings show that SVM and Random Forest models were the most reliable, achieving accuracies of 0.830 and 0.831 respectively, and maintaining balance in precision, recall, and MCC. Conversely, the naïve Bayes model recorded the worst performance. The comparative analysis revealed the superior performance of ensemble models over decision tree models in predicting student attrition, emphasizing the importance of model selection in developing effective early intervention strategies. In addition, our analysis revealed that academic data is a better predictor of on-time graduation than admission data, emphasizing the need for institutions to focus on continuous academic assessment data.
学生按时从大学毕业的算法预测
本研究开发了统计学习模型,利用录取、成绩和人口统计数据,评估本科生在预定时间内毕业的概率。美国国家教育统计中心(NCES)的最新数据显示,全日制本科生六年内的毕业率为 59%,这凸显了解决学生流失问题的紧迫性。本研究利用沙特一所大学的机构数据,重点关注2012-2013和2013-2014学年入学的新生,以识别有辍学风险的学生,从而及时采取干预措施。在占整个数据集 33.3% 的测试集上,使用精确度、召回率、准确度和马修斯相关系数 (MCC) 建立并评估了十种算法,包括决策树、集合模型、SVM 和 ANN。结果表明,SVM 和随机森林模型最可靠,准确率分别达到 0.830 和 0.831,并在精确度、召回率和 MCC 方面保持平衡。相反,天真贝叶斯模型的性能最差。对比分析表明,在预测学生流失方面,集合模型的表现优于决策树模型,这强调了模型选择在制定有效的早期干预策略中的重要性。此外,我们的分析表明,学业数据比录取数据更能预测学生的按时毕业情况,这强调了各院校关注持续学业评估数据的必要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信