The Performance-Actionability Trade-Off in Retention Prediction at Middle School

Susana Lavado, Miguel Mateus, Leid Zejnilovic
{"title":"The Performance-Actionability Trade-Off in Retention Prediction at Middle School","authors":"Susana Lavado, Miguel Mateus, Leid Zejnilovic","doi":"10.1109/ICMLA55696.2022.00087","DOIUrl":null,"url":null,"abstract":"Predicting students’ retention risk is one of the major trends in machine learning applications in education. While early identification of at-risk students allows timely planning and implementation of measures to prevent adverse outcomes, there is a trade-off between the predictive model’s performance and the prediction window size, or model performance and its actionability. In this study, we used a dataset of 83,596 unique Portuguese students in grades 5th to 9th to predict retention at or before the end of 9th grade. We explored how different prediction window sizes impact the predictive model’s performance, the feature importance, and the models’ bias. The models with the shorter prediction window performed better in terms of precision, but the model with the largest prediction window showed a higher lift over the existing rule-based model. Prediction window size impacted the importance of demographic features and model’s fairness. Our results contribute to the extant discussion on predicting retention, by adding empirical evidence about the models’ added value in performance versus the existing practice, suggesting types of data to collect and use, and discussing education-specific challenges of responsible data science.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA55696.2022.00087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Predicting students’ retention risk is one of the major trends in machine learning applications in education. While early identification of at-risk students allows timely planning and implementation of measures to prevent adverse outcomes, there is a trade-off between the predictive model’s performance and the prediction window size, or model performance and its actionability. In this study, we used a dataset of 83,596 unique Portuguese students in grades 5th to 9th to predict retention at or before the end of 9th grade. We explored how different prediction window sizes impact the predictive model’s performance, the feature importance, and the models’ bias. The models with the shorter prediction window performed better in terms of precision, but the model with the largest prediction window showed a higher lift over the existing rule-based model. Prediction window size impacted the importance of demographic features and model’s fairness. Our results contribute to the extant discussion on predicting retention, by adding empirical evidence about the models’ added value in performance versus the existing practice, suggesting types of data to collect and use, and discussing education-specific challenges of responsible data science.
中学保留预测的绩效-行动权衡
预测学生的滞留风险是机器学习在教育领域应用的主要趋势之一。虽然早期识别有风险的学生可以及时规划和实施措施,以防止不良后果,但在预测模型的性能和预测窗口大小之间,或模型性能与其可操作性之间存在权衡。在这项研究中,我们使用了83596名五年级到九年级的葡萄牙学生的数据集来预测九年级结束或之前的保留率。我们探讨了不同的预测窗口大小如何影响预测模型的性能、特征重要性和模型的偏差。预测窗口较短的模型在精度方面表现较好,但预测窗口最大的模型比现有的基于规则的模型有更高的提升。预测窗口大小影响人口统计学特征的重要性和模型的公平性。我们的结果有助于现有的关于预测保留的讨论,通过添加关于模型在绩效方面的附加值与现有实践的经验证据,建议收集和使用的数据类型,并讨论负责任的数据科学的教育特定挑战。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信