推进教育数据挖掘以增强学生成绩预测:特征选择算法和分类技术与动态特征集成进化的融合。

IF 3.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Saleem Malik, S Gopal Krishna Patro, Chandrakanta Mahanty, Rashmi Hegde, Quadri Noorulhasan Naveed, Ayodele Lasisi, Abdulrajak Buradi, Addisu Frinjo Emma, Naoufel Kraiem
{"title":"推进教育数据挖掘以增强学生成绩预测:特征选择算法和分类技术与动态特征集成进化的融合。","authors":"Saleem Malik, S Gopal Krishna Patro, Chandrakanta Mahanty, Rashmi Hegde, Quadri Noorulhasan Naveed, Ayodele Lasisi, Abdulrajak Buradi, Addisu Frinjo Emma, Naoufel Kraiem","doi":"10.1038/s41598-025-92324-x","DOIUrl":null,"url":null,"abstract":"<p><p>The integration of technology into educational institutions has led to the generation of vast data, creating opportunities for Educational Data Mining (EDM) to improve learning outcomes. This study introduces a novel feature selection model, \"Dynamic Feature Ensemble Evolution for Enhanced Feature Selection\" (DE-FS), which combines traditional methods such as correlation matrix analysis, information gain, and Chi-square with heat maps to select the most relevant features for predicting student performance. The core innovation of DE-FS lies in its dynamic and adaptive thresholding mechanism, which adjusts thresholds based on evolving data patterns, addressing the limitations of static methods and mitigating issues like overfitting and underfitting. This research makes three key contributions: it introduces an advanced ensemble-based feature selection methodology, incorporates dynamic and adaptive thresholding to improve accuracy and flexibility, and demonstrates DE-FS's superior predictive performance across diverse educational datasets. The results highlight DE-FS's ability to adapt to fluctuating data patterns, enabling precise and reliable student performance predictions, supporting targeted interventions, and improving resource allocation to enhance personalized learning experiences.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"8738"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11906737/pdf/","citationCount":"0","resultStr":"{\"title\":\"Advancing educational data mining for enhanced student performance prediction: a fusion of feature selection algorithms and classification techniques with dynamic feature ensemble evolution.\",\"authors\":\"Saleem Malik, S Gopal Krishna Patro, Chandrakanta Mahanty, Rashmi Hegde, Quadri Noorulhasan Naveed, Ayodele Lasisi, Abdulrajak Buradi, Addisu Frinjo Emma, Naoufel Kraiem\",\"doi\":\"10.1038/s41598-025-92324-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The integration of technology into educational institutions has led to the generation of vast data, creating opportunities for Educational Data Mining (EDM) to improve learning outcomes. This study introduces a novel feature selection model, \\\"Dynamic Feature Ensemble Evolution for Enhanced Feature Selection\\\" (DE-FS), which combines traditional methods such as correlation matrix analysis, information gain, and Chi-square with heat maps to select the most relevant features for predicting student performance. The core innovation of DE-FS lies in its dynamic and adaptive thresholding mechanism, which adjusts thresholds based on evolving data patterns, addressing the limitations of static methods and mitigating issues like overfitting and underfitting. This research makes three key contributions: it introduces an advanced ensemble-based feature selection methodology, incorporates dynamic and adaptive thresholding to improve accuracy and flexibility, and demonstrates DE-FS's superior predictive performance across diverse educational datasets. The results highlight DE-FS's ability to adapt to fluctuating data patterns, enabling precise and reliable student performance predictions, supporting targeted interventions, and improving resource allocation to enhance personalized learning experiences.</p>\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"8738\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11906737/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-92324-x\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-92324-x","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

技术与教育机构的融合产生了大量数据,为教育数据挖掘(EDM)提高学习效果创造了机会。本研究介绍了一种新颖的特征选择模型--"增强特征选择的动态特征集合演化"(DE-FS),它将相关矩阵分析、信息增益、Chi-square 等传统方法与热图相结合,选择出与预测学生成绩最相关的特征。DE-FS 的核心创新点在于其动态自适应阈值机制,可根据不断变化的数据模式调整阈值,解决了静态方法的局限性,并减轻了过拟合和欠拟合等问题。这项研究有三个主要贡献:它引入了先进的基于集合的特征选择方法,结合了动态和自适应阈值以提高准确性和灵活性,并展示了 DE-FS 在不同教育数据集上的卓越预测性能。研究结果突出表明,DE-FS 能够适应波动的数据模式,从而实现精确可靠的学生成绩预测,支持有针对性的干预措施,并改善资源分配以增强个性化学习体验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Advancing educational data mining for enhanced student performance prediction: a fusion of feature selection algorithms and classification techniques with dynamic feature ensemble evolution.

The integration of technology into educational institutions has led to the generation of vast data, creating opportunities for Educational Data Mining (EDM) to improve learning outcomes. This study introduces a novel feature selection model, "Dynamic Feature Ensemble Evolution for Enhanced Feature Selection" (DE-FS), which combines traditional methods such as correlation matrix analysis, information gain, and Chi-square with heat maps to select the most relevant features for predicting student performance. The core innovation of DE-FS lies in its dynamic and adaptive thresholding mechanism, which adjusts thresholds based on evolving data patterns, addressing the limitations of static methods and mitigating issues like overfitting and underfitting. This research makes three key contributions: it introduces an advanced ensemble-based feature selection methodology, incorporates dynamic and adaptive thresholding to improve accuracy and flexibility, and demonstrates DE-FS's superior predictive performance across diverse educational datasets. The results highlight DE-FS's ability to adapt to fluctuating data patterns, enabling precise and reliable student performance predictions, supporting targeted interventions, and improving resource allocation to enhance personalized learning experiences.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Scientific Reports
Scientific Reports Natural Science Disciplines-
CiteScore
7.50
自引率
4.30%
发文量
19567
审稿时长
3.9 months
期刊介绍: We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信