教育中的大数据:学生风险案例研究

IF 1.5 0 ENGINEERING, MULTIDISCIPLINARY

Engineering, Technology & Applied Science Research Pub Date : 2023-10-13 DOI:10.48084/etasr.6190

Ahmed B. Altamimi

{"title":"教育中的大数据:学生风险案例研究","authors":"Ahmed B. Altamimi","doi":"10.48084/etasr.6190","DOIUrl":null,"url":null,"abstract":"This paper analyzes various machine learning algorithms to predict student failure in a specific educational dataset and a specific environment. The paper handles the prediction of student failure given the students' grades, course difficulty level, and GPA, differing from most of the provided studies in the literature, where focus is given to the surrounding environment. The main aim is to early detect students at risk of academic underperformance and implement specific interventions to enhance their academic outcomes. A diverse set of eleven Machine Learning (ML) algorithms was used to analyze the dataset. The data went through preprocessing, and features were engineered to effectively capture essential information that may impact students' academic performance. A meticulous process for model selection and evaluation was utilized to compare the algorithms' performance with regard to metrics such as accuracy, precision, recall, F-score, specificity, and balanced accuracy. Our results demonstrate significant variability in the performance of the different algorithms, with Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs) showing the highest overall performance, followed closely by Gradient Boosting Classifier (GBC), Neuro-Fuzzy, and Random Forest (RF). The other algorithms exhibit varying performance levels, with the Recurrent Neural Networks (RNNs) showing the weakest results in recall and F-score. Educational institutions can use the insight gained from this study to make data-driven decisions and design targeted interventions to help students at risk succeed academically. Furthermore, the methodology presented in this paper can be generalized and applied to other educational datasets for similar predictive purposes.","PeriodicalId":11826,"journal":{"name":"Engineering, Technology & Applied Science Research","volume":"55 1","pages":"0"},"PeriodicalIF":1.5000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Big Data in Education: Students at Risk as a Case Study\",\"authors\":\"Ahmed B. Altamimi\",\"doi\":\"10.48084/etasr.6190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper analyzes various machine learning algorithms to predict student failure in a specific educational dataset and a specific environment. The paper handles the prediction of student failure given the students' grades, course difficulty level, and GPA, differing from most of the provided studies in the literature, where focus is given to the surrounding environment. The main aim is to early detect students at risk of academic underperformance and implement specific interventions to enhance their academic outcomes. A diverse set of eleven Machine Learning (ML) algorithms was used to analyze the dataset. The data went through preprocessing, and features were engineered to effectively capture essential information that may impact students' academic performance. A meticulous process for model selection and evaluation was utilized to compare the algorithms' performance with regard to metrics such as accuracy, precision, recall, F-score, specificity, and balanced accuracy. Our results demonstrate significant variability in the performance of the different algorithms, with Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs) showing the highest overall performance, followed closely by Gradient Boosting Classifier (GBC), Neuro-Fuzzy, and Random Forest (RF). The other algorithms exhibit varying performance levels, with the Recurrent Neural Networks (RNNs) showing the weakest results in recall and F-score. Educational institutions can use the insight gained from this study to make data-driven decisions and design targeted interventions to help students at risk succeed academically. Furthermore, the methodology presented in this paper can be generalized and applied to other educational datasets for similar predictive purposes.\",\"PeriodicalId\":11826,\"journal\":{\"name\":\"Engineering, Technology & Applied Science Research\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2023-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering, Technology & Applied Science Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48084/etasr.6190\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering, Technology & Applied Science Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48084/etasr.6190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

本文分析了在特定教育数据集和特定环境中预测学生失败的各种机器学习算法。本文根据学生的成绩、课程难度水平和GPA来预测学生的不及格，这与大多数文献中提供的研究不同，这些研究的重点是周围环境。其主要目的是早期发现有学业表现不佳风险的学生，并实施具体的干预措施，以提高他们的学业成绩。使用11种不同的机器学习(ML)算法来分析数据集。这些数据经过预处理，特征被设计成有效地捕获可能影响学生学习成绩的基本信息。模型选择和评估是一个细致的过程，用于比较算法在准确性、精密度、召回率、f分数、特异性和平衡准确性等指标方面的性能。我们的研究结果表明，不同算法的性能存在显著差异，人工神经网络(ann)和卷积神经网络(cnn)表现出最高的整体性能，紧随其后的是梯度增强分类器(GBC)、神经模糊和随机森林(RF)。其他算法表现出不同的性能水平，其中循环神经网络(RNNs)在召回和f得分方面表现出最弱的结果。教育机构可以利用从这项研究中获得的见解来做出数据驱动的决策，并设计有针对性的干预措施，帮助有风险的学生在学业上取得成功。此外，本文提出的方法可以推广并应用于其他教育数据集，以达到类似的预测目的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Big Data in Education: Students at Risk as a Case Study

This paper analyzes various machine learning algorithms to predict student failure in a specific educational dataset and a specific environment. The paper handles the prediction of student failure given the students' grades, course difficulty level, and GPA, differing from most of the provided studies in the literature, where focus is given to the surrounding environment. The main aim is to early detect students at risk of academic underperformance and implement specific interventions to enhance their academic outcomes. A diverse set of eleven Machine Learning (ML) algorithms was used to analyze the dataset. The data went through preprocessing, and features were engineered to effectively capture essential information that may impact students' academic performance. A meticulous process for model selection and evaluation was utilized to compare the algorithms' performance with regard to metrics such as accuracy, precision, recall, F-score, specificity, and balanced accuracy. Our results demonstrate significant variability in the performance of the different algorithms, with Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs) showing the highest overall performance, followed closely by Gradient Boosting Classifier (GBC), Neuro-Fuzzy, and Random Forest (RF). The other algorithms exhibit varying performance levels, with the Recurrent Neural Networks (RNNs) showing the weakest results in recall and F-score. Educational institutions can use the insight gained from this study to make data-driven decisions and design targeted interventions to help students at risk succeed academically. Furthermore, the methodology presented in this paper can be generalized and applied to other educational datasets for similar predictive purposes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊