Daniel Andrade Girón, Juana Sandivar Rosas, W. Marín-Rodriguez, E. Susanibar-Ramirez, Eliseo Toro-Dextre, J. Ausejo-Sánchez, Henry Villarreal-Torres, Julio Angeles-Morales
{"title":"基于机器学习和深度学习的学生退学预测:系统综述","authors":"Daniel Andrade Girón, Juana Sandivar Rosas, W. Marín-Rodriguez, E. Susanibar-Ramirez, Eliseo Toro-Dextre, J. Ausejo-Sánchez, Henry Villarreal-Torres, Julio Angeles-Morales","doi":"10.4108/eetsis.3586","DOIUrl":null,"url":null,"abstract":"Student dropout is one of the most complex challenges facing the education system worldwide. In order to evaluate the success of Machine Learning and Deep Learning algorithms in predicting student dropout, a systematic review was conducted. The search was carried out in several electronic bibliographic databases, including Scopus, IEEE, and Web of Science, covering up to June 2023, having 246 articles as search reports. Exclusion criteria, such as review articles, editorials, letters, and comments, were established. The final review included 23 studies in which performance metrics such as accuracy/precision, sensitivity/recall, specificity, and area under the curve (AUC) were evaluated. In addition, aspects related to study modality, training, testing strategy, cross-validation, and confounding matrix were considered. The review results revealed that the most used Machine Learning algorithm was Random Forest, present in 21.73% of the studies; this algorithm obtained an accuracy of 99% in the prediction of student dropout, higher than all the algorithms used in the total number of studies reviewed.","PeriodicalId":43034,"journal":{"name":"EAI Endorsed Transactions on Scalable Information Systems","volume":"11 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2023-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting Student Dropout based on Machine Learning and Deep Learning: A Systematic Review\",\"authors\":\"Daniel Andrade Girón, Juana Sandivar Rosas, W. Marín-Rodriguez, E. Susanibar-Ramirez, Eliseo Toro-Dextre, J. Ausejo-Sánchez, Henry Villarreal-Torres, Julio Angeles-Morales\",\"doi\":\"10.4108/eetsis.3586\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Student dropout is one of the most complex challenges facing the education system worldwide. In order to evaluate the success of Machine Learning and Deep Learning algorithms in predicting student dropout, a systematic review was conducted. The search was carried out in several electronic bibliographic databases, including Scopus, IEEE, and Web of Science, covering up to June 2023, having 246 articles as search reports. Exclusion criteria, such as review articles, editorials, letters, and comments, were established. The final review included 23 studies in which performance metrics such as accuracy/precision, sensitivity/recall, specificity, and area under the curve (AUC) were evaluated. In addition, aspects related to study modality, training, testing strategy, cross-validation, and confounding matrix were considered. The review results revealed that the most used Machine Learning algorithm was Random Forest, present in 21.73% of the studies; this algorithm obtained an accuracy of 99% in the prediction of student dropout, higher than all the algorithms used in the total number of studies reviewed.\",\"PeriodicalId\":43034,\"journal\":{\"name\":\"EAI Endorsed Transactions on Scalable Information Systems\",\"volume\":\"11 1\",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2023-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EAI Endorsed Transactions on Scalable Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/eetsis.3586\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Transactions on Scalable Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/eetsis.3586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
学生辍学是全球教育系统面临的最复杂的挑战之一。为了评估机器学习和深度学习算法在预测学生辍学方面的成功,进行了系统回顾。在Scopus、IEEE、Web of Science等多个电子书目数据库中进行检索,检索时间截止到2023年6月,检索报告246篇。建立了排除标准,如综述文章、社论、信件和评论。最终的综述包括23项研究,其中评估了准确性/精密度、灵敏度/召回率、特异性和曲线下面积(AUC)等性能指标。此外,还考虑了与研究方式、训练、测试策略、交叉验证和混杂矩阵相关的方面。综述结果显示,使用最多的机器学习算法是随机森林,占21.73%的研究;该算法在预测学生辍学方面的准确率达到99%,高于所审查研究总数中使用的所有算法。
Predicting Student Dropout based on Machine Learning and Deep Learning: A Systematic Review
Student dropout is one of the most complex challenges facing the education system worldwide. In order to evaluate the success of Machine Learning and Deep Learning algorithms in predicting student dropout, a systematic review was conducted. The search was carried out in several electronic bibliographic databases, including Scopus, IEEE, and Web of Science, covering up to June 2023, having 246 articles as search reports. Exclusion criteria, such as review articles, editorials, letters, and comments, were established. The final review included 23 studies in which performance metrics such as accuracy/precision, sensitivity/recall, specificity, and area under the curve (AUC) were evaluated. In addition, aspects related to study modality, training, testing strategy, cross-validation, and confounding matrix were considered. The review results revealed that the most used Machine Learning algorithm was Random Forest, present in 21.73% of the studies; this algorithm obtained an accuracy of 99% in the prediction of student dropout, higher than all the algorithms used in the total number of studies reviewed.