Raúl Marticorena-Sánchez, Antonio Canepa-Oneto, Carlos López-Nozal, José A. Barbero-Aparicio
{"title":"Unveiling the Differences in Early Performance Prediction Between Online Social Sciences and STEM Courses Using Educational Data Mining","authors":"Raúl Marticorena-Sánchez, Antonio Canepa-Oneto, Carlos López-Nozal, José A. Barbero-Aparicio","doi":"10.1111/exsy.13837","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Educational Data Mining and Learning Analytics in virtual environments can be used to diagnose student performance problems at an early stage. Information that is useful for guiding the decisions of teachers managing academic training, so that students can successfully complete their course. However, student interaction patterns may vary depending on the knowledge domain. Our aim is to design a framework applicable to online Social Sciences and STEM courses, recommending methods for building accurate early performance prediction models. A large-scale comparative study of the accuracy of multiple classifiers applied to classify the interaction logs of 32,593 students from 9 Social Sciences and 13 STEM courses is presented. Corroborating the results of other works, it was observed that high early performance prediction accuracy was obtained based on nothing other than student logs: accuracies of 0.75 in the 10th week, 0.80 in the 20th week, 0.85 in the 30th week and 0.90 in the 40th week. However, accuracy rates were observed to vary significantly, in relation to the classification algorithm and the knowledge domain (Social Sciences vs. STEM). These predictions are generally less accurate for Social Sciences compared to STEM courses, especially at the beginning of the course, with fewer differences observed in the final weeks. Additionally, this research identifies instances of low-accuracy outliers in the prediction of Social Sciences courses over time. These findings highlight the complex challenges and variations in early performance prediction across different domains in online education.</p>\n </div>","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 3","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.13837","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Educational Data Mining and Learning Analytics in virtual environments can be used to diagnose student performance problems at an early stage. Information that is useful for guiding the decisions of teachers managing academic training, so that students can successfully complete their course. However, student interaction patterns may vary depending on the knowledge domain. Our aim is to design a framework applicable to online Social Sciences and STEM courses, recommending methods for building accurate early performance prediction models. A large-scale comparative study of the accuracy of multiple classifiers applied to classify the interaction logs of 32,593 students from 9 Social Sciences and 13 STEM courses is presented. Corroborating the results of other works, it was observed that high early performance prediction accuracy was obtained based on nothing other than student logs: accuracies of 0.75 in the 10th week, 0.80 in the 20th week, 0.85 in the 30th week and 0.90 in the 40th week. However, accuracy rates were observed to vary significantly, in relation to the classification algorithm and the knowledge domain (Social Sciences vs. STEM). These predictions are generally less accurate for Social Sciences compared to STEM courses, especially at the beginning of the course, with fewer differences observed in the final weeks. Additionally, this research identifies instances of low-accuracy outliers in the prediction of Social Sciences courses over time. These findings highlight the complex challenges and variations in early performance prediction across different domains in online education.
期刊介绍:
Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper.
As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.