Predicting academic performance of students with machine learning

IF 2 4区 管理学 Q2 INFORMATION SCIENCE & LIBRARY SCIENCE
Yavuz Selim Balcıoğlu, Melike Artar
{"title":"Predicting academic performance of students with machine learning","authors":"Yavuz Selim Balcıoğlu, Melike Artar","doi":"10.1177/02666669231213023","DOIUrl":null,"url":null,"abstract":"This study investigates the effectiveness of machine learning and deep learning models for early prediction of student performance in higher education institutions. Using the Open University Learning Analytics (OULA) dataset, various models, including Decision Tree, Support Vector Machine, Neural Network, and Ensemble Model, were employed to predict student performance in three categories: Pass/Fail, Close to Fail, and Close to Pass. The Ensemble Model (EM) consistently outperformed other models, achieving the highest overall F1 measure, precision, recall, and accuracy. These results highlight the potential of data-driven techniques in informing educational stakeholders’ decision-making processes, enabling targeted interventions, and facilitating personalized learning strategies tailored to students’ needs. By identifying at-risk students early in the academic year, institutions can provide additional support to improve academic outcomes and retention rates. The study also discusses practical implications, including the development of pedagogical policies and guidelines based on early predictions, which can help educational institutions maintain strong academic outcomes and enhance their reputation for academic excellence. Future research aims to investigate the impact of individual activities on student performance and explore day-to-day student behaviors, enabling the creation of tailored pedagogical policies and guidelines.","PeriodicalId":47137,"journal":{"name":"Information Development","volume":"13 2","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Development","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1177/02666669231213023","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

This study investigates the effectiveness of machine learning and deep learning models for early prediction of student performance in higher education institutions. Using the Open University Learning Analytics (OULA) dataset, various models, including Decision Tree, Support Vector Machine, Neural Network, and Ensemble Model, were employed to predict student performance in three categories: Pass/Fail, Close to Fail, and Close to Pass. The Ensemble Model (EM) consistently outperformed other models, achieving the highest overall F1 measure, precision, recall, and accuracy. These results highlight the potential of data-driven techniques in informing educational stakeholders’ decision-making processes, enabling targeted interventions, and facilitating personalized learning strategies tailored to students’ needs. By identifying at-risk students early in the academic year, institutions can provide additional support to improve academic outcomes and retention rates. The study also discusses practical implications, including the development of pedagogical policies and guidelines based on early predictions, which can help educational institutions maintain strong academic outcomes and enhance their reputation for academic excellence. Future research aims to investigate the impact of individual activities on student performance and explore day-to-day student behaviors, enabling the creation of tailored pedagogical policies and guidelines.
用机器学习预测学生的学习成绩
本研究探讨了机器学习和深度学习模型在早期预测高等教育机构学生成绩方面的有效性。利用开放大学学习分析(OULA)数据集,采用了包括决策树、支持向量机、神经网络和集合模型在内的各种模型来预测三个类别的学生成绩:及格/不及格、接近不及格和接近及格。集合模型(EM)的表现始终优于其他模型,在总体 F1 指标、精确度、召回率和准确度方面均名列前茅。这些结果凸显了数据驱动技术在为教育利益相关者的决策过程提供信息、实现有针对性的干预和促进针对学生需求的个性化学习策略方面的潜力。通过在学年早期识别高危学生,院校可以提供额外的支持,以提高学习成绩和保留率。研究还讨论了实际意义,包括根据早期预测制定教学政策和指导方针,这有助于教育机构保持优异的学业成绩,提高其学术声誉。未来的研究旨在调查个别活动对学生成绩的影响,并探索学生的日常行为,从而制定有针对性的教学政策和指导方针。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Information Development
Information Development INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
5.10
自引率
5.30%
发文量
40
期刊介绍: Information Development is a peer-reviewed journal that aims to provide authoritative coverage of current developments in the provision, management and use of information throughout the world, with particular emphasis on the information needs and problems of developing countries. It deals with both the development of information systems, services and skills, and the role of information in personal and national development.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信