Data Mining on the Fundamental Factors Influencing Mathematics Achievement: Traditional and Modern Perspectives

IF 3.6 3区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH
Burcu Koca Guler, Fulya Gokalp Yavuz
{"title":"Data Mining on the Fundamental Factors Influencing Mathematics Achievement: Traditional and Modern Perspectives","authors":"Burcu Koca Guler,&nbsp;Fulya Gokalp Yavuz","doi":"10.1111/ejed.70281","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Assessing achievement is a complex task due to its dependence on multiple factors and the hierarchical structure of educational data, yet surveys like TIMSS offer valuable insights into its determining factors like students' mathematics anxiety. However, disregarding the nested structure of data and ignoring the assumptions of models causes poor performance such as inaccurate predictions and biased estimates. Our research utilises linear mixed models (LMMs) and machine learning (ML) techniques (e.g., REEM-tree and GP boosting) especially chosen for their abilities to model nested data and capture non-linear relationships. This study is a pioneer in the literature as these ML algorithms are implemented for the first time in TIMSS. Accordingly, mathematical tendency and emotional factors are the two primary predictors of mathematics achievement across all methods, acknowledging the possibility of potential bias due to reliance on self-report responses. However, there are variations in the effect size of the students' origins among the methods. This indicates different algorithms yield distinct results according to their inner processes and priorities, such as revealing statistical significance of predictors or contributing to predictive performance. Moreover, gender has a negligible impact across all models in our analysis, caused by cultural differences in the sample. Overall, while LMMs are widely accepted, ML methods remain competitive alternatives in prediction and flexibility. All three methods yield similar benchmarks, yet ML methods offer slightly better performance in RMSE, MAE, and MAPE while exhibiting high predictive power and capturing nonlinearity and interaction. Although they take more computation time, parallel processing mitigates this in larger datasets. Consequently, ML methods and LMMs concurrently provide broader and more precise insights in terms of predictive and inferential gains.</p>\n </div>","PeriodicalId":47585,"journal":{"name":"European Journal of Education","volume":"60 4","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Education","FirstCategoryId":"95","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/ejed.70281","RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

Assessing achievement is a complex task due to its dependence on multiple factors and the hierarchical structure of educational data, yet surveys like TIMSS offer valuable insights into its determining factors like students' mathematics anxiety. However, disregarding the nested structure of data and ignoring the assumptions of models causes poor performance such as inaccurate predictions and biased estimates. Our research utilises linear mixed models (LMMs) and machine learning (ML) techniques (e.g., REEM-tree and GP boosting) especially chosen for their abilities to model nested data and capture non-linear relationships. This study is a pioneer in the literature as these ML algorithms are implemented for the first time in TIMSS. Accordingly, mathematical tendency and emotional factors are the two primary predictors of mathematics achievement across all methods, acknowledging the possibility of potential bias due to reliance on self-report responses. However, there are variations in the effect size of the students' origins among the methods. This indicates different algorithms yield distinct results according to their inner processes and priorities, such as revealing statistical significance of predictors or contributing to predictive performance. Moreover, gender has a negligible impact across all models in our analysis, caused by cultural differences in the sample. Overall, while LMMs are widely accepted, ML methods remain competitive alternatives in prediction and flexibility. All three methods yield similar benchmarks, yet ML methods offer slightly better performance in RMSE, MAE, and MAPE while exhibiting high predictive power and capturing nonlinearity and interaction. Although they take more computation time, parallel processing mitigates this in larger datasets. Consequently, ML methods and LMMs concurrently provide broader and more precise insights in terms of predictive and inferential gains.

Abstract Image

影响数学成绩基本因素的数据挖掘:传统与现代视角
评估成绩是一项复杂的任务,因为它依赖于多种因素和教育数据的层次结构,然而像TIMSS这样的调查为学生的数学焦虑等决定因素提供了有价值的见解。然而,忽略数据的嵌套结构和忽略模型的假设会导致较差的性能,如不准确的预测和有偏差的估计。我们的研究利用线性混合模型(lmm)和机器学习(ML)技术(例如,REEM-tree和GP boosting),特别是因为它们对嵌套数据建模和捕获非线性关系的能力而被选中。这项研究是文献中的先驱,因为这些ML算法首次在TIMSS中实现。因此,数学倾向和情感因素是所有方法中数学成绩的两个主要预测因素,承认由于依赖自我报告反应而存在潜在偏差的可能性。然而,在不同的方法中,学生来源的效应大小存在差异。这表明不同的算法根据其内部过程和优先级产生不同的结果,例如揭示预测因子的统计显著性或有助于预测性能。此外,在我们的分析中,性别对所有模型的影响都可以忽略不计,这是由样本中的文化差异造成的。总体而言,虽然lmm被广泛接受,但ML方法在预测和灵活性方面仍然具有竞争力。所有三种方法都产生类似的基准,然而ML方法在RMSE, MAE和MAPE中提供稍好的性能,同时表现出高预测能力并捕获非线性和相互作用。虽然它们需要更多的计算时间,但并行处理在较大的数据集中减轻了这一点。因此,ML方法和lmm同时在预测和推理方面提供了更广泛和更精确的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Journal of Education
European Journal of Education EDUCATION & EDUCATIONAL RESEARCH-
CiteScore
4.50
自引率
0.00%
发文量
47
期刊介绍: The prime aims of the European Journal of Education are: - To examine, compare and assess education policies, trends, reforms and programmes of European countries in an international perspective - To disseminate policy debates and research results to a wide audience of academics, researchers, practitioners and students of education sciences - To contribute to the policy debate at the national and European level by providing European administrators and policy-makers in international organisations, national and local governments with comparative and up-to-date material centred on specific themes of common interest.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信