分析学生学业成绩的影响因素:预测模型与启示

Fahmida Faiza Ananna, Ruchira Nowreen, Sakhar Saad Rashid Al Jahwari, Enzo Anindya Costa, Lorita Angeline, Siva Raja Sindiramutty
{"title":"分析学生学业成绩的影响因素:预测模型与启示","authors":"Fahmida Faiza Ananna, Ruchira Nowreen, Sakhar Saad Rashid Al Jahwari, Enzo Anindya Costa, Lorita Angeline, Siva Raja Sindiramutty","doi":"10.54938/ijemdcsai.2023.02.1.254","DOIUrl":null,"url":null,"abstract":"The fascination with understanding student academic performance has drawn widespread attention from various stakeholders, including parents, policymakers, and businesses. The 'Students Performance in Exams' dataset, available on platforms like Kaggle, stands as a treasure trove. It extends beyond test scores, encompassing diverse student attributes like ethnicity, gender, parental education, test preparation, and even lunch type. In our tech-driven age, predicting academic success has become a compelling pursuit. This study aims to delve deep into this dataset, utilizing data mining methods and robust classification algorithms like Logistic Regression and Random Forest in a Jupyter Notebook environment. Rigorous model training, testing, and fine-tuning strive for the utmost predictive accuracy. Data cleaning and preprocessing play a crucial role in establishing a reliable dataset for accurate predictions. Beyond numbers, the project emphasizes data visualization's impact, transforming raw data into comprehensible insights for effective communication. The Logistic Regression Model exhibits an impressive 87.6% accuracy, highlighting its potential in predicting academic performance. Moreover, the Random Forest Model excels with a remarkable 100% accuracy in forecasting student grades, showcasing its effectiveness in this domain.","PeriodicalId":448083,"journal":{"name":"International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence","volume":"584 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysing Influential Factors in Student Academic Achievement: Prediction Modelling and Insight\",\"authors\":\"Fahmida Faiza Ananna, Ruchira Nowreen, Sakhar Saad Rashid Al Jahwari, Enzo Anindya Costa, Lorita Angeline, Siva Raja Sindiramutty\",\"doi\":\"10.54938/ijemdcsai.2023.02.1.254\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The fascination with understanding student academic performance has drawn widespread attention from various stakeholders, including parents, policymakers, and businesses. The 'Students Performance in Exams' dataset, available on platforms like Kaggle, stands as a treasure trove. It extends beyond test scores, encompassing diverse student attributes like ethnicity, gender, parental education, test preparation, and even lunch type. In our tech-driven age, predicting academic success has become a compelling pursuit. This study aims to delve deep into this dataset, utilizing data mining methods and robust classification algorithms like Logistic Regression and Random Forest in a Jupyter Notebook environment. Rigorous model training, testing, and fine-tuning strive for the utmost predictive accuracy. Data cleaning and preprocessing play a crucial role in establishing a reliable dataset for accurate predictions. Beyond numbers, the project emphasizes data visualization's impact, transforming raw data into comprehensible insights for effective communication. The Logistic Regression Model exhibits an impressive 87.6% accuracy, highlighting its potential in predicting academic performance. Moreover, the Random Forest Model excels with a remarkable 100% accuracy in forecasting student grades, showcasing its effectiveness in this domain.\",\"PeriodicalId\":448083,\"journal\":{\"name\":\"International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence\",\"volume\":\"584 \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54938/ijemdcsai.2023.02.1.254\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54938/ijemdcsai.2023.02.1.254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

了解学生学习成绩的魅力已引起包括家长、政策制定者和企业在内的各利益相关方的广泛关注。在 Kaggle 等平台上提供的 "学生考试成绩 "数据集是一个宝库。它不仅包括考试成绩,还包括种族、性别、父母教育程度、备考情况甚至午餐类型等各种学生属性。在我们这个技术驱动的时代,预测学业成功已成为一种引人注目的追求。本研究旨在深入研究这一数据集,在 Jupyter Notebook 环境中利用数据挖掘方法和强大的分类算法(如逻辑回归和随机森林)。严格的模型训练、测试和微调力求达到最高的预测准确性。数据清理和预处理在建立可靠的数据集以进行准确预测方面发挥着至关重要的作用。除了数字之外,该项目还强调数据可视化的影响,将原始数据转化为可理解的见解,以便进行有效的交流。逻辑回归模型的准确率达到了令人印象深刻的 87.6%,凸显了其在预测学习成绩方面的潜力。此外,随机森林模型在预测学生成绩方面的准确率达到了惊人的 100%,显示了它在这一领域的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysing Influential Factors in Student Academic Achievement: Prediction Modelling and Insight
The fascination with understanding student academic performance has drawn widespread attention from various stakeholders, including parents, policymakers, and businesses. The 'Students Performance in Exams' dataset, available on platforms like Kaggle, stands as a treasure trove. It extends beyond test scores, encompassing diverse student attributes like ethnicity, gender, parental education, test preparation, and even lunch type. In our tech-driven age, predicting academic success has become a compelling pursuit. This study aims to delve deep into this dataset, utilizing data mining methods and robust classification algorithms like Logistic Regression and Random Forest in a Jupyter Notebook environment. Rigorous model training, testing, and fine-tuning strive for the utmost predictive accuracy. Data cleaning and preprocessing play a crucial role in establishing a reliable dataset for accurate predictions. Beyond numbers, the project emphasizes data visualization's impact, transforming raw data into comprehensible insights for effective communication. The Logistic Regression Model exhibits an impressive 87.6% accuracy, highlighting its potential in predicting academic performance. Moreover, the Random Forest Model excels with a remarkable 100% accuracy in forecasting student grades, showcasing its effectiveness in this domain.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信