数据挖掘:高等教育环境下学生的表现

Ravneil Nand, Ashneel Chand, E. Reddy
{"title":"数据挖掘:高等教育环境下学生的表现","authors":"Ravneil Nand, Ashneel Chand, E. Reddy","doi":"10.1109/NILES53778.2021.9600504","DOIUrl":null,"url":null,"abstract":"Student performance in higher education has become one of the most widely studied area. While modelling students' performance, data plays a pivotal role in forecasting their performance and this is where the data mining applications are now becoming widely used. There are various factors which determine the student performance. In this study, eight attributes are used as inputs which are considered most influential in determining students' performance in the Pacific. Statistical analysis is done to find out which attribute has the highest influence to student performance. In this research, different algorithms are utilized for building the classification model, each of them using various classification techniques. The classification techniques used are Artificial Neural Network, Decision Tree, Decision Table, and Naïve Bayes. The dataset of 651 records used in this research is an imbalanced set, which is later transformed to balance set through under sampling. Neural Network is one of the classification techniques that has performed well on both, imbalanced and balanced datasets with the highest prediction accuracy of 96.8%. The analysis further shows that internal assessment has weak positive relationship with student performance while demographic data has no significant relationship.","PeriodicalId":249153,"journal":{"name":"2021 3rd Novel Intelligent and Leading Emerging Sciences Conference (NILES)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Data Mining Students' performance in a Higher Learning Environment\",\"authors\":\"Ravneil Nand, Ashneel Chand, E. Reddy\",\"doi\":\"10.1109/NILES53778.2021.9600504\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Student performance in higher education has become one of the most widely studied area. While modelling students' performance, data plays a pivotal role in forecasting their performance and this is where the data mining applications are now becoming widely used. There are various factors which determine the student performance. In this study, eight attributes are used as inputs which are considered most influential in determining students' performance in the Pacific. Statistical analysis is done to find out which attribute has the highest influence to student performance. In this research, different algorithms are utilized for building the classification model, each of them using various classification techniques. The classification techniques used are Artificial Neural Network, Decision Tree, Decision Table, and Naïve Bayes. The dataset of 651 records used in this research is an imbalanced set, which is later transformed to balance set through under sampling. Neural Network is one of the classification techniques that has performed well on both, imbalanced and balanced datasets with the highest prediction accuracy of 96.8%. The analysis further shows that internal assessment has weak positive relationship with student performance while demographic data has no significant relationship.\",\"PeriodicalId\":249153,\"journal\":{\"name\":\"2021 3rd Novel Intelligent and Leading Emerging Sciences Conference (NILES)\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 3rd Novel Intelligent and Leading Emerging Sciences Conference (NILES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NILES53778.2021.9600504\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd Novel Intelligent and Leading Emerging Sciences Conference (NILES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NILES53778.2021.9600504","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在高等教育中,学生表现已成为研究最广泛的领域之一。在对学生的表现进行建模的同时,数据在预测他们的表现方面起着关键作用,这就是数据挖掘应用程序现在得到广泛应用的地方。决定学生成绩的因素有很多。在本研究中,八个属性被用作输入,这些属性被认为对决定太平洋地区学生的表现最有影响力。通过统计分析,找出哪个属性对学生成绩的影响最大。在本研究中,使用了不同的算法来构建分类模型,每种算法都使用了不同的分类技术。使用的分类技术有人工神经网络、决策树、决策表和Naïve贝叶斯。本研究使用的651条记录的数据集是一个不平衡集,通过欠采样将其转化为平衡集。神经网络是在平衡数据集和不平衡数据集上都表现良好的分类技术之一,其预测准确率最高,达到96.8%。进一步分析表明,内部评价与学生成绩呈弱正相关,而人口统计数据与学生成绩无显著关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Data Mining Students' performance in a Higher Learning Environment
Student performance in higher education has become one of the most widely studied area. While modelling students' performance, data plays a pivotal role in forecasting their performance and this is where the data mining applications are now becoming widely used. There are various factors which determine the student performance. In this study, eight attributes are used as inputs which are considered most influential in determining students' performance in the Pacific. Statistical analysis is done to find out which attribute has the highest influence to student performance. In this research, different algorithms are utilized for building the classification model, each of them using various classification techniques. The classification techniques used are Artificial Neural Network, Decision Tree, Decision Table, and Naïve Bayes. The dataset of 651 records used in this research is an imbalanced set, which is later transformed to balance set through under sampling. Neural Network is one of the classification techniques that has performed well on both, imbalanced and balanced datasets with the highest prediction accuracy of 96.8%. The analysis further shows that internal assessment has weak positive relationship with student performance while demographic data has no significant relationship.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信