Fatema Tuz Johora , Md Nahid Hasan , Aditya Rajbongshi , Md Ashrafuzzaman , Farzana Akter
{"title":"An explainable AI-based approach for predicting undergraduate students academic performance","authors":"Fatema Tuz Johora , Md Nahid Hasan , Aditya Rajbongshi , Md Ashrafuzzaman , Farzana Akter","doi":"10.1016/j.array.2025.100384","DOIUrl":null,"url":null,"abstract":"<div><div>The accurate prediction of students' academic achievement has garnered considerable attention in the research community due to its importance in understanding students' progress and assisting them in achieving success. This study presents a novel approach for predicting undergraduate student's performance in the context of Bangladesh. The dataset contains 872 student records from multiple institutions. Initially the dataset was produced utilizing data-preprocessing techniques such as one-hot encoding, column remaining, and managing missing values. SMOTE (Synthetic Minority Oversampling Technique) and normalizing algorithms were employed to attain data balance and feature scaling, respectively. Afterwards, a total of seven distinct machine learning (ML) classifiers, with hyperparameter tuning, were employed to train and test in order to achieve the prediction of students' academic performance. Furthermore, a custom stacking ensemble classifier was utilized, which attained an accuracy of 86.38 %. This classifier outperformed the machine learning classifiers based on the four performance evaluation metrics. Two eXplainable Artificial Intelligence (XAI) algorithms, namely SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations), were integrated to provide a comprehensible prediction of the best model and determine the significant factors. This approach provided transparency, fairness and reliability on prediction that improved student performance in the classroom and anticipation.</div></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":"26 ","pages":"Article 100384"},"PeriodicalIF":2.3000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005625000116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The accurate prediction of students' academic achievement has garnered considerable attention in the research community due to its importance in understanding students' progress and assisting them in achieving success. This study presents a novel approach for predicting undergraduate student's performance in the context of Bangladesh. The dataset contains 872 student records from multiple institutions. Initially the dataset was produced utilizing data-preprocessing techniques such as one-hot encoding, column remaining, and managing missing values. SMOTE (Synthetic Minority Oversampling Technique) and normalizing algorithms were employed to attain data balance and feature scaling, respectively. Afterwards, a total of seven distinct machine learning (ML) classifiers, with hyperparameter tuning, were employed to train and test in order to achieve the prediction of students' academic performance. Furthermore, a custom stacking ensemble classifier was utilized, which attained an accuracy of 86.38 %. This classifier outperformed the machine learning classifiers based on the four performance evaluation metrics. Two eXplainable Artificial Intelligence (XAI) algorithms, namely SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations), were integrated to provide a comprehensible prediction of the best model and determine the significant factors. This approach provided transparency, fairness and reliability on prediction that improved student performance in the classroom and anticipation.