Combining Self-Training and Tri-Training for Course-Level Student Classification

Vo Thi Ngoc Chau, N. H. Phung
{"title":"Combining Self-Training and Tri-Training for Course-Level Student Classification","authors":"Vo Thi Ngoc Chau, N. H. Phung","doi":"10.1109/ICEAST.2018.8434452","DOIUrl":null,"url":null,"abstract":"Course-level student classification is one of the most popular tasks in the educational data mining area. It helps early identifying the students facing difficulties in a course by predicting their final study status after the end of the course. To perform this task more effectively, our work proposes a semi-supervised learning method, TriForest, which combines Self-Training and Tri-Training. TriForest has three base classifiers all of which are Random Forests in the learning process of the Self-Training framework. Meanwhile, TriForest explores the agreement mechanism of Tri-Training in selecting the most confidently predicted instances for training data set enhancement of each base classifier. In addition, our work also takes into account both study performance via grades and forum-related attributes in this classification task. The selected attributes are useful for the proposed method to differentiate successful students from students with failure in a course where the data sets of the past students can be used to build a classification model for the current students. Indeed, experimental results on real data sets have shown that with about 90% of Accuracy, our method can achieve better predictions than its base classifier and other semi-supervised learning methods. Such accurate student classifications help both students and lecturers forecast the students' study status and in-trouble students can be then supported appropriately for their ultimate success in the course.","PeriodicalId":138654,"journal":{"name":"2018 International Conference on Engineering, Applied Sciences, and Technology (ICEAST)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Engineering, Applied Sciences, and Technology (ICEAST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEAST.2018.8434452","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Course-level student classification is one of the most popular tasks in the educational data mining area. It helps early identifying the students facing difficulties in a course by predicting their final study status after the end of the course. To perform this task more effectively, our work proposes a semi-supervised learning method, TriForest, which combines Self-Training and Tri-Training. TriForest has three base classifiers all of which are Random Forests in the learning process of the Self-Training framework. Meanwhile, TriForest explores the agreement mechanism of Tri-Training in selecting the most confidently predicted instances for training data set enhancement of each base classifier. In addition, our work also takes into account both study performance via grades and forum-related attributes in this classification task. The selected attributes are useful for the proposed method to differentiate successful students from students with failure in a course where the data sets of the past students can be used to build a classification model for the current students. Indeed, experimental results on real data sets have shown that with about 90% of Accuracy, our method can achieve better predictions than its base classifier and other semi-supervised learning methods. Such accurate student classifications help both students and lecturers forecast the students' study status and in-trouble students can be then supported appropriately for their ultimate success in the course.
自我训练与三训练相结合的课程级学生分类
课程级学生分类是教育数据挖掘领域中最受欢迎的任务之一。它通过预测学生在课程结束后的最终学习状态,帮助学生及早发现在课程中遇到的困难。为了更有效地完成这项任务,我们的工作提出了一种半监督学习方法,TriForest,它结合了Self-Training和Tri-Training。在Self-Training框架的学习过程中,TriForest有三个基本分类器,它们都是随机森林。同时,TriForest探索了Tri-Training在选择最自信的预测实例用于每个基分类器的训练数据集增强方面的一致机制。此外,在这个分类任务中,我们的工作还考虑了成绩和论坛相关属性的学习表现。所选择的属性对于所提出的方法有用,可以区分课程中成功学生和失败学生,其中过去学生的数据集可用于构建当前学生的分类模型。事实上,在真实数据集上的实验结果表明,我们的方法可以实现比其基础分类器和其他半监督学习方法更好的预测,准确率约为90%。这种准确的学生分类有助于学生和老师预测学生的学习状况,从而为有困难的学生提供适当的支持,使他们最终在课程中取得成功。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信