Latent Factors for Consistently Predicting Student Success

R. Rawatlal, M. Chetty, Andrew Kisten Naicker
{"title":"Latent Factors for Consistently Predicting Student Success","authors":"R. Rawatlal, M. Chetty, Andrew Kisten Naicker","doi":"10.1109/WEEF-GEDC54384.2022.9996209","DOIUrl":null,"url":null,"abstract":"Developing Machine Learning models to predict the likelihood of a student's graduation has received significant interest in recent times. One clear application is to identify students most in need of support before actual failure occurs. There is a growing concern, however, about the range of applicability of such models. Machine Learning models are often limited by the consistency of their performance across years or even by programme in other words, although a model may be developed for a given course/module in a given year, the model accuracy tends to degrade when small differences occur in the time or field of study. In this study, the focus is on the identification of so-called Latent Factors, which are more fundamental characteristics derived from the student and field of study meta-data. Basing Machine Learning models on these more fundamental characteristics tends to produce models which, although reduces in accuracy, tend to preserve the prediction capacity over a broader period of time and scale of study area. The study investigates latent factors that include a student's “credit load capacity”, level of activity in accessing course material (LMS access frequency), overall performance (measured as mean marks), the rate of change of performance (measured as the rate of change of mean) and consistency (measured as standard deviation). In addition, the modelling also considers the matric mean score of the students undertaking the coursework, historical consistency with peer modules (given by the Pearson R-Coefficient), course position in curriculum (given by the academic year of study when undertaken by students) and the mean number of attempts required to pass the course. It is shown that when these characteristics are integrated into a Machine Learning framework, the accuracy improves on the order of 24%","PeriodicalId":206250,"journal":{"name":"2022 IEEE IFEES World Engineering Education Forum - Global Engineering Deans Council (WEEF-GEDC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE IFEES World Engineering Education Forum - Global Engineering Deans Council (WEEF-GEDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WEEF-GEDC54384.2022.9996209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Developing Machine Learning models to predict the likelihood of a student's graduation has received significant interest in recent times. One clear application is to identify students most in need of support before actual failure occurs. There is a growing concern, however, about the range of applicability of such models. Machine Learning models are often limited by the consistency of their performance across years or even by programme in other words, although a model may be developed for a given course/module in a given year, the model accuracy tends to degrade when small differences occur in the time or field of study. In this study, the focus is on the identification of so-called Latent Factors, which are more fundamental characteristics derived from the student and field of study meta-data. Basing Machine Learning models on these more fundamental characteristics tends to produce models which, although reduces in accuracy, tend to preserve the prediction capacity over a broader period of time and scale of study area. The study investigates latent factors that include a student's “credit load capacity”, level of activity in accessing course material (LMS access frequency), overall performance (measured as mean marks), the rate of change of performance (measured as the rate of change of mean) and consistency (measured as standard deviation). In addition, the modelling also considers the matric mean score of the students undertaking the coursework, historical consistency with peer modules (given by the Pearson R-Coefficient), course position in curriculum (given by the academic year of study when undertaken by students) and the mean number of attempts required to pass the course. It is shown that when these characteristics are integrated into a Machine Learning framework, the accuracy improves on the order of 24%
持续预测学生成功的潜在因素
近年来,开发机器学习模型来预测学生毕业的可能性受到了极大的关注。一个明确的应用是在实际失败发生之前确定最需要支持的学生。然而,人们越来越关注这些模型的适用范围。换句话说,机器学习模型通常受到其多年表现一致性的限制,甚至受到程序的限制,尽管可以在给定年份为给定课程/模块开发模型,但当时间或研究领域出现微小差异时,模型的准确性往往会降低。在本研究中,重点是识别所谓的潜在因素,这是来自学生和研究领域元数据的更基本的特征。将机器学习模型建立在这些更基本的特征上,往往会产生尽管准确性降低,但往往在更广泛的时间和研究区域范围内保持预测能力的模型。本研究调查的潜在因素包括学生的“学分负荷能力”、访问课程材料的活动水平(LMS访问频率)、整体表现(以平均分数衡量)、表现变化率(以平均变化率衡量)和一致性(以标准差衡量)。此外,建模还考虑参加课程的学生的矩阵平均分数,与同行模块的历史一致性(由Pearson r系数给出),课程在课程中的位置(由学生学习时的学年给出)以及通过课程所需的平均尝试次数。研究表明,当这些特征集成到机器学习框架中时,准确率提高了24%
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信