Multi-objective Optimization of C4.5 Decision Tree for Predicting Student Academic Performance

Georgios Kostopoulos, Nikos Fazakis, S. Kotsiantis, K. Sgarbas
{"title":"Multi-objective Optimization of C4.5 Decision Tree for Predicting Student Academic Performance","authors":"Georgios Kostopoulos, Nikos Fazakis, S. Kotsiantis, K. Sgarbas","doi":"10.1109/IISA.2019.8900771","DOIUrl":null,"url":null,"abstract":"Applying data mining methods in the educational field has gained a lot of attention among scientists over the last years. Educational Data Mining forms an ever-developing research area aiming to unveil the hidden knowledge in educational data and improve students’ learning behavior and outcomes. To this end, a plethora of data mining methods have already been implemented in various educational settings solving a variety of tasks, among which the prediction of students’ academic performance as well. Decision trees have proven to be a quite effective method for both classification and regression problems showing a number of considerable advantages, such as efficiency, simplicity, flexibility and interpretability. Moreover, configuration of parameter values has often a material impact on building optimal trees in terms of accuracy and/or size. In this context, the main objective of our study is to yield a highly accurate and interpretable classification tree for the early prognosis of students at risk of failing in a university course. Thereby, effective intervention and support actions could be initiated to motivate students and enhance their performance. The experimental results demonstrate that the induction of the C4.5 decision tree classifier through an evolutionary algorithm, such as the Speed -constrained Multi-objective Particle Swarm Optimization algorithm, yields more accurate and easier to construe trees.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2019.8900771","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Applying data mining methods in the educational field has gained a lot of attention among scientists over the last years. Educational Data Mining forms an ever-developing research area aiming to unveil the hidden knowledge in educational data and improve students’ learning behavior and outcomes. To this end, a plethora of data mining methods have already been implemented in various educational settings solving a variety of tasks, among which the prediction of students’ academic performance as well. Decision trees have proven to be a quite effective method for both classification and regression problems showing a number of considerable advantages, such as efficiency, simplicity, flexibility and interpretability. Moreover, configuration of parameter values has often a material impact on building optimal trees in terms of accuracy and/or size. In this context, the main objective of our study is to yield a highly accurate and interpretable classification tree for the early prognosis of students at risk of failing in a university course. Thereby, effective intervention and support actions could be initiated to motivate students and enhance their performance. The experimental results demonstrate that the induction of the C4.5 decision tree classifier through an evolutionary algorithm, such as the Speed -constrained Multi-objective Particle Swarm Optimization algorithm, yields more accurate and easier to construe trees.
C4.5决策树预测学生学习成绩的多目标优化
近年来,数据挖掘方法在教育领域的应用受到了科学家们的广泛关注。教育数据挖掘是一个不断发展的研究领域,旨在揭示教育数据中隐藏的知识,改善学生的学习行为和成果。为此,大量的数据挖掘方法已经在各种教育环境中实施,解决各种任务,其中包括预测学生的学习成绩。决策树已被证明是分类和回归问题的一种相当有效的方法,显示出许多相当大的优势,如效率、简单性、灵活性和可解释性。此外,参数值的配置通常会对构建最优树的准确性和/或大小产生重大影响。在这种情况下,我们研究的主要目的是为有大学课程不及格风险的学生的早期预后提供一个高度准确和可解释的分类树。因此,可以采取有效的干预和支持行动来激励学生,提高他们的表现。实验结果表明,通过进化算法(如速度约束多目标粒子群优化算法)对C4.5决策树分类器进行归纳,可以得到更准确、更容易构造的树。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信