有监督/无监督机器学习算法与特征选择方法预测学生表现的比较研究

IF 0.4 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Alaa Khalaf Hamoud, Ali Salah Alasady, Wid Akeel Awadh, Jasim Mohammed Dahr, Mohammed B.M. Kamel, Aqeel Majeed Humadi, Ihab Ahmed Najm
{"title":"有监督/无监督机器学习算法与特征选择方法预测学生表现的比较研究","authors":"Alaa Khalaf Hamoud, Ali Salah Alasady, Wid Akeel Awadh, Jasim Mohammed Dahr, Mohammed B.M. Kamel, Aqeel Majeed Humadi, Ihab Ahmed Najm","doi":"10.1504/ijdmmm.2023.134590","DOIUrl":null,"url":null,"abstract":"The field of educational data mining (EDM) is one of the most growing fields that aims to improve the performance of students, academic staff, and overall institutional performance. The implementing process of data mining algorithms almost needs the feature selection process to find the most correlated features and improve the accuracy. In this paper, a comparative study is performed to study implementation of supervised/unsupervised algorithms in predicting the students' performance. The student's grade is classified using different fields of supervised and unsupervised algorithms such as decision trees, clustering, and neural networks. These algorithms were examined over the questionnaire dataset before/after feature selection to measure the effect of feature selection on the result accuracy. The results showed that the random forest decision tree outperformed other supervised/unsupervised algorithms. The results also showed that the performance evaluation of algorithms with the dataset after removing the less correlated attributes is enhanced for most of the algorithms.","PeriodicalId":43061,"journal":{"name":"International Journal of Data Mining Modelling and Management","volume":"2015 1","pages":"0"},"PeriodicalIF":0.4000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparative study of supervised/unsupervised machine learning algorithms with feature selection approaches to predict student performance\",\"authors\":\"Alaa Khalaf Hamoud, Ali Salah Alasady, Wid Akeel Awadh, Jasim Mohammed Dahr, Mohammed B.M. Kamel, Aqeel Majeed Humadi, Ihab Ahmed Najm\",\"doi\":\"10.1504/ijdmmm.2023.134590\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The field of educational data mining (EDM) is one of the most growing fields that aims to improve the performance of students, academic staff, and overall institutional performance. The implementing process of data mining algorithms almost needs the feature selection process to find the most correlated features and improve the accuracy. In this paper, a comparative study is performed to study implementation of supervised/unsupervised algorithms in predicting the students' performance. The student's grade is classified using different fields of supervised and unsupervised algorithms such as decision trees, clustering, and neural networks. These algorithms were examined over the questionnaire dataset before/after feature selection to measure the effect of feature selection on the result accuracy. The results showed that the random forest decision tree outperformed other supervised/unsupervised algorithms. The results also showed that the performance evaluation of algorithms with the dataset after removing the less correlated attributes is enhanced for most of the algorithms.\",\"PeriodicalId\":43061,\"journal\":{\"name\":\"International Journal of Data Mining Modelling and Management\",\"volume\":\"2015 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Data Mining Modelling and Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/ijdmmm.2023.134590\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining Modelling and Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/ijdmmm.2023.134590","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

教育数据挖掘(EDM)领域是发展最快的领域之一,旨在提高学生、学术人员和整体机构绩效的表现。数据挖掘算法的实现过程几乎都需要特征选择过程来发现最相关的特征,提高准确率。在本文中,进行了一项比较研究,以研究监督/无监督算法在预测学生成绩方面的实现。学生的成绩分类使用不同领域的监督和无监督算法,如决策树、聚类和神经网络。在特征选择前后的问卷数据集上对这些算法进行了检验,以衡量特征选择对结果准确性的影响。结果表明,随机森林决策树优于其他有监督/无监督算法。结果还表明,对于大多数算法来说,去除相关性较低的属性后,使用数据集的算法的性能评估都得到了增强。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A comparative study of supervised/unsupervised machine learning algorithms with feature selection approaches to predict student performance
The field of educational data mining (EDM) is one of the most growing fields that aims to improve the performance of students, academic staff, and overall institutional performance. The implementing process of data mining algorithms almost needs the feature selection process to find the most correlated features and improve the accuracy. In this paper, a comparative study is performed to study implementation of supervised/unsupervised algorithms in predicting the students' performance. The student's grade is classified using different fields of supervised and unsupervised algorithms such as decision trees, clustering, and neural networks. These algorithms were examined over the questionnaire dataset before/after feature selection to measure the effect of feature selection on the result accuracy. The results showed that the random forest decision tree outperformed other supervised/unsupervised algorithms. The results also showed that the performance evaluation of algorithms with the dataset after removing the less correlated attributes is enhanced for most of the algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Data Mining Modelling and Management
International Journal of Data Mining Modelling and Management COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
1.10
自引率
0.00%
发文量
22
期刊介绍: Facilitating transformation from data to information to knowledge is paramount for organisations. Companies are flooded with data and conflicting information, but with limited real usable knowledge. However, rarely should a process be looked at from limited angles or in parts. Isolated islands of data mining, modelling and management (DMMM) should be connected. IJDMMM highlightes integration of DMMM, statistics/machine learning/databases, each element of data chain management, types of information, algorithms in software; from data pre-processing to post-processing; between theory and applications. Topics covered include: -Artificial intelligence- Biomedical science- Business analytics/intelligence, process modelling- Computer science, database management systems- Data management, mining, modelling, warehousing- Engineering- Environmental science, environment (ecoinformatics)- Information systems/technology, telecommunications/networking- Management science, operations research, mathematics/statistics- Social sciences- Business/economics, (computational) finance- Healthcare, medicine, pharmaceuticals- (Computational) chemistry, biology (bioinformatics)- Sustainable mobility systems, intelligent transportation systems- National security
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信