Predicting Dropouts Before Enrollments in MOOCs: An Explainable and Self-Supervised Model

IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Jin Li;Shu Li;Yuan Zhao;Longjiang Guo;Fei Hao;Meirui Ren;Keqin Li
{"title":"Predicting Dropouts Before Enrollments in MOOCs: An Explainable and Self-Supervised Model","authors":"Jin Li;Shu Li;Yuan Zhao;Longjiang Guo;Fei Hao;Meirui Ren;Keqin Li","doi":"10.1109/TSC.2023.3311627","DOIUrl":null,"url":null,"abstract":"Massive Open Online Courses (MOOCs) belong to a new cloud-based service in education that suffers from low completion rates. Effective pre-learning intervention services, such as recommending courses with a high probability of completion or filtering courses with a very low probability of completion, will encourage students to spend more time and energy on proper courses, thus can reduce the dropout ratio. In practice, intervention services are introduced when students are predicted to drop out. However, existing methods concentrate on analyzing students’ learning actions and predicting final dropout after a period of enrollment, which are insufficient in preventing students from enrolling in unsuitable courses and withdrawing mid-way. This paper presents a neural network-based Explainable Self-supervised Model (ESM) to predict MOOC dropout before enrollment. Specifically, the student's learning actions on an unenrolled course are estimated using previous logs by the neural network. And then, the action's contribution to the completion of a course is calculated in a similar way. Therefore, the probability of completion for an unenrolled course is predicted by aggregating the learning actions and their contribution to the completion. To train the neural network, a self-supervised training strategy is proposed, where enrolled courses in the training data are randomly selected as validation in each epoch. The ESM outperforms existing methods in terms of prediction accuracy and efficiency. The average increment of Area Under the ROC Curve (AUC) and F-score (F1) in the two MOOCs datasets, XuetangX and KDDCUP, are 8.3% and 0.6%, respectively. Furthermore, the two pre-learning intervention services named courses recommendation and courses filtration are proposed. When courses are recommended, the completion rate increased from 22% to 60% in XuetangX, and from 27% to 45% in KDDCUP. By filtering courses predicted with low completion probability, 40% wasted time in uncompleted courses will be saved in XuetangX.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"16 6","pages":"4154-4167"},"PeriodicalIF":5.5000,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10238815/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Massive Open Online Courses (MOOCs) belong to a new cloud-based service in education that suffers from low completion rates. Effective pre-learning intervention services, such as recommending courses with a high probability of completion or filtering courses with a very low probability of completion, will encourage students to spend more time and energy on proper courses, thus can reduce the dropout ratio. In practice, intervention services are introduced when students are predicted to drop out. However, existing methods concentrate on analyzing students’ learning actions and predicting final dropout after a period of enrollment, which are insufficient in preventing students from enrolling in unsuitable courses and withdrawing mid-way. This paper presents a neural network-based Explainable Self-supervised Model (ESM) to predict MOOC dropout before enrollment. Specifically, the student's learning actions on an unenrolled course are estimated using previous logs by the neural network. And then, the action's contribution to the completion of a course is calculated in a similar way. Therefore, the probability of completion for an unenrolled course is predicted by aggregating the learning actions and their contribution to the completion. To train the neural network, a self-supervised training strategy is proposed, where enrolled courses in the training data are randomly selected as validation in each epoch. The ESM outperforms existing methods in terms of prediction accuracy and efficiency. The average increment of Area Under the ROC Curve (AUC) and F-score (F1) in the two MOOCs datasets, XuetangX and KDDCUP, are 8.3% and 0.6%, respectively. Furthermore, the two pre-learning intervention services named courses recommendation and courses filtration are proposed. When courses are recommended, the completion rate increased from 22% to 60% in XuetangX, and from 27% to 45% in KDDCUP. By filtering courses predicted with low completion probability, 40% wasted time in uncompleted courses will be saved in XuetangX.
预测 MOOC 注册前的辍学情况:可解释的自监督模型
大规模在线开放课程(MOOCs)属于一种新的基于云的教育服务,其完成率很低。有效的学习前干预服务,如推荐完成概率高的课程或过滤完成概率极低的课程,可以鼓励学生将更多的时间和精力花在合适的课程上,从而降低辍学率。在实践中,当学生被预测要退学时,就会引入干预服务。然而,现有的方法侧重于分析学生的学习行为,预测学生在入学一段时间后的最终退学情况,不足以防止学生选修不合适的课程,中途退学。本文提出了一种基于神经网络的可解释自监督模型(ESM)来预测MOOC学生在入学前辍学。具体来说,神经网络使用之前的日志来估计学生在未注册课程上的学习行为。然后,动作对完成课程的贡献以类似的方式计算。因此,通过汇总学习行为及其对完成的贡献来预测未注册课程的完成概率。为了训练神经网络,提出了一种自监督训练策略,随机选择训练数据中的注册课程作为每个历元的验证。该方法在预测精度和效率方面都优于现有方法。在XuetangX和KDDCUP两个mooc数据集上,ROC曲线下面积(Area Under The ROC Curve, AUC)和F-score (F-score, F1)的平均增量分别为8.3%和0.6%。在此基础上,提出了课程推荐和课程过滤两种学习前干预服务。当课程被推荐时,学堂x的完成率从22%提高到60%,KDDCUP的完成率从27%提高到45%。通过对预测完成概率较低的课程进行筛选,学堂x可以节省40%的未完成课程浪费时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Services Computing
IEEE Transactions on Services Computing COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING
CiteScore
11.50
自引率
6.20%
发文量
278
审稿时长
>12 weeks
期刊介绍: IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信