{"title":"机器学习在在线教育中风险学生预测中的应用:十年文献系统回顾","authors":"Hui Shi, Nuodi Zhang, Secil Caskurlu, Hunhui Na","doi":"10.1111/jcal.70058","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>The growth of online education has provided flexibility and access to a wide range of courses. However, the self-paced and often isolated nature of these courses has been associated with increased dropout and failure rates. Researchers employed machine learning approaches to identify at-risk students, but multiple issues have not been addressed concerning the definition of at-risk students, as well as the strengths and limitations of different machine learning models to predict at-risk students.</p>\n </section>\n \n <section>\n \n <h3> Objectives</h3>\n \n <p>This systematic review aims to provide a comprehensive overview of the past 10-year research focusing on applying machine learning techniques for predicting at-risk students (i.e., failure, dropouts) in online learning environments.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Studies were extracted from the ACM Digital Library, IEEE Xplore Digital Library, Web of Science, ERIC, ProQuest, and EBSCO. A total of 161 studies published from 2014 to 2024 were included in the review.</p>\n </section>\n \n <section>\n \n <h3> Results and Conclusions</h3>\n \n <p>Findings revealed (1) four primary at-risk definitions outlined in the reviewed studies, each focusing on specific stages of student engagement and performance in a course; (2) most studies relied on student behavioural engagement and academic factors as at-risk predictors; (3) the adoption of deep learning and ensemble deep learning networks has significantly increased in the past 5 years, often outperforming classical machine learning models. While studies in which classical machine learning excelled often relied on the ensemble methodology and smaller sample sizes; (4) current machine learning practice evaluated by a list of criteria showed concerns regarding reproducibility, generalisability, and interpretability.</p>\n </section>\n </div>","PeriodicalId":48071,"journal":{"name":"Journal of Computer Assisted Learning","volume":"41 4","pages":""},"PeriodicalIF":5.1000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Applications of Machine Learning for at-Risk Student Prediction in Online Education: A 10-Year Systematic Review of Literature\",\"authors\":\"Hui Shi, Nuodi Zhang, Secil Caskurlu, Hunhui Na\",\"doi\":\"10.1111/jcal.70058\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>The growth of online education has provided flexibility and access to a wide range of courses. However, the self-paced and often isolated nature of these courses has been associated with increased dropout and failure rates. Researchers employed machine learning approaches to identify at-risk students, but multiple issues have not been addressed concerning the definition of at-risk students, as well as the strengths and limitations of different machine learning models to predict at-risk students.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Objectives</h3>\\n \\n <p>This systematic review aims to provide a comprehensive overview of the past 10-year research focusing on applying machine learning techniques for predicting at-risk students (i.e., failure, dropouts) in online learning environments.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>Studies were extracted from the ACM Digital Library, IEEE Xplore Digital Library, Web of Science, ERIC, ProQuest, and EBSCO. A total of 161 studies published from 2014 to 2024 were included in the review.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results and Conclusions</h3>\\n \\n <p>Findings revealed (1) four primary at-risk definitions outlined in the reviewed studies, each focusing on specific stages of student engagement and performance in a course; (2) most studies relied on student behavioural engagement and academic factors as at-risk predictors; (3) the adoption of deep learning and ensemble deep learning networks has significantly increased in the past 5 years, often outperforming classical machine learning models. While studies in which classical machine learning excelled often relied on the ensemble methodology and smaller sample sizes; (4) current machine learning practice evaluated by a list of criteria showed concerns regarding reproducibility, generalisability, and interpretability.</p>\\n </section>\\n </div>\",\"PeriodicalId\":48071,\"journal\":{\"name\":\"Journal of Computer Assisted Learning\",\"volume\":\"41 4\",\"pages\":\"\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2025-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer Assisted Learning\",\"FirstCategoryId\":\"95\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/jcal.70058\",\"RegionNum\":2,\"RegionCategory\":\"教育学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"EDUCATION & EDUCATIONAL RESEARCH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer Assisted Learning","FirstCategoryId":"95","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jcal.70058","RegionNum":2,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0
摘要
在线教育的发展提供了灵活性和广泛的课程选择。然而,这些课程的自我进度和往往孤立的性质与辍学率和失败率的增加有关。研究人员使用机器学习方法来识别有风险的学生,但关于有风险学生的定义以及不同机器学习模型预测有风险学生的优势和局限性的多个问题尚未得到解决。本系统综述旨在全面概述过去10年的研究,重点是在在线学习环境中应用机器学习技术来预测有风险的学生(即失败、辍学)。方法从ACM数字图书馆、IEEE explore数字图书馆、Web of Science、ERIC、ProQuest和EBSCO中提取研究。2014年至2024年间发表的161项研究被纳入该综述。结果和结论发现:(1)在回顾的研究中概述了四个主要的风险定义,每个定义都关注学生在课程中的参与和表现的特定阶段;(2)大多数研究依赖学生行为投入和学业因素作为风险预测因子;(3)在过去5年中,深度学习和集成深度学习网络的采用显著增加,通常优于经典机器学习模型。虽然经典机器学习的研究往往依赖于集成方法和较小的样本量;(4)通过一系列标准评估的当前机器学习实践显示出对再现性、概括性和可解释性的关注。
Applications of Machine Learning for at-Risk Student Prediction in Online Education: A 10-Year Systematic Review of Literature
Background
The growth of online education has provided flexibility and access to a wide range of courses. However, the self-paced and often isolated nature of these courses has been associated with increased dropout and failure rates. Researchers employed machine learning approaches to identify at-risk students, but multiple issues have not been addressed concerning the definition of at-risk students, as well as the strengths and limitations of different machine learning models to predict at-risk students.
Objectives
This systematic review aims to provide a comprehensive overview of the past 10-year research focusing on applying machine learning techniques for predicting at-risk students (i.e., failure, dropouts) in online learning environments.
Methods
Studies were extracted from the ACM Digital Library, IEEE Xplore Digital Library, Web of Science, ERIC, ProQuest, and EBSCO. A total of 161 studies published from 2014 to 2024 were included in the review.
Results and Conclusions
Findings revealed (1) four primary at-risk definitions outlined in the reviewed studies, each focusing on specific stages of student engagement and performance in a course; (2) most studies relied on student behavioural engagement and academic factors as at-risk predictors; (3) the adoption of deep learning and ensemble deep learning networks has significantly increased in the past 5 years, often outperforming classical machine learning models. While studies in which classical machine learning excelled often relied on the ensemble methodology and smaller sample sizes; (4) current machine learning practice evaluated by a list of criteria showed concerns regarding reproducibility, generalisability, and interpretability.
期刊介绍:
The Journal of Computer Assisted Learning is an international peer-reviewed journal which covers the whole range of uses of information and communication technology to support learning and knowledge exchange. It aims to provide a medium for communication among researchers as well as a channel linking researchers, practitioners, and policy makers. JCAL is also a rich source of material for master and PhD students in areas such as educational psychology, the learning sciences, instructional technology, instructional design, collaborative learning, intelligent learning systems, learning analytics, open, distance and networked learning, and educational evaluation and assessment. This is the case for formal (e.g., schools), non-formal (e.g., workplace learning) and informal learning (e.g., museums and libraries) situations and environments. Volumes often include one Special Issue which these provides readers with a broad and in-depth perspective on a specific topic. First published in 1985, JCAL continues to have the aim of making the outcomes of contemporary research and experience accessible. During this period there have been major technological advances offering new opportunities and approaches in the use of a wide range of technologies to support learning and knowledge transfer more generally. There is currently much emphasis on the use of network functionality and the challenges its appropriate uses pose to teachers/tutors working with students locally and at a distance. JCAL welcomes: -Empirical reports, single studies or programmatic series of studies on the use of computers and information technologies in learning and assessment -Critical and original meta-reviews of literature on the use of computers for learning -Empirical studies on the design and development of innovative technology-based systems for learning -Conceptual articles on issues relating to the Aims and Scope