What postpones degree completion? Discovering key predictors of undergraduate degree completion through explainable artificial intelligence (XAI)

IF 4 Q2 BUSINESS
Burak Cankaya, Robin Roberts, Stephanie Douglas, Rachel Vigness, Asil Oztekin
{"title":"What postpones degree completion? Discovering key predictors of undergraduate degree completion through explainable artificial intelligence (XAI)","authors":"Burak Cankaya, Robin Roberts, Stephanie Douglas, Rachel Vigness, Asil Oztekin","doi":"10.1057/s41270-024-00290-6","DOIUrl":null,"url":null,"abstract":"<p>The timing of degree completion for students taking post-secondary courses has been a constant source of angst for administrators wanting the best outcomes for their students. Most methods for predicting student degree completion extensions are completed by analog methods using human effort to analyze data. The majority of data analysis reporting of degree completion extension variables and impacts has, for decades, been done manually. Administrators primarily forecast the factors based on their expertise and intuition to evaluate implications and repercussions. The variables are large, varied, and situational to each individual and complex. We used machine learning (automated processes using predictive algorithms) to predict undergraduate extensions for at least 2 years beyond a standard 4 years to complete a bachelor's degree. The study builds a machine learning-based education understanding XAI model (ED-XAI) to examine students’ dependent and independent variables and accurately predict/explain degree extension. The study utilized Random Forest, Support Vector Machines, and Deep Learning Machine learning algorithms. XAI used Information Fusion, SHapley Additive exPlanations (SHAP), and Local Interpretable Model-Agnostic Explanations (LIME) models to explain the findings of the Machine Learning models. The ED-XAI model explained multiple scenarios and discovered variables influencing students’ degree completion linked to their status and funding source. The Random Forest model gave supreme predictive results with 89.1% Mean ROC, 71.6% Overall Precision, 86% Overall Recall, and 71.6% In-class Precision. The educational information system introduced in this study has significant implications for accurate variables reporting and impacts on degree extensions leading to successful degree completions minimally reported in higher education marketing analytics research.</p>","PeriodicalId":43041,"journal":{"name":"Journal of Marketing Analytics","volume":"101 1","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Marketing Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1057/s41270-024-00290-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0

Abstract

The timing of degree completion for students taking post-secondary courses has been a constant source of angst for administrators wanting the best outcomes for their students. Most methods for predicting student degree completion extensions are completed by analog methods using human effort to analyze data. The majority of data analysis reporting of degree completion extension variables and impacts has, for decades, been done manually. Administrators primarily forecast the factors based on their expertise and intuition to evaluate implications and repercussions. The variables are large, varied, and situational to each individual and complex. We used machine learning (automated processes using predictive algorithms) to predict undergraduate extensions for at least 2 years beyond a standard 4 years to complete a bachelor's degree. The study builds a machine learning-based education understanding XAI model (ED-XAI) to examine students’ dependent and independent variables and accurately predict/explain degree extension. The study utilized Random Forest, Support Vector Machines, and Deep Learning Machine learning algorithms. XAI used Information Fusion, SHapley Additive exPlanations (SHAP), and Local Interpretable Model-Agnostic Explanations (LIME) models to explain the findings of the Machine Learning models. The ED-XAI model explained multiple scenarios and discovered variables influencing students’ degree completion linked to their status and funding source. The Random Forest model gave supreme predictive results with 89.1% Mean ROC, 71.6% Overall Precision, 86% Overall Recall, and 71.6% In-class Precision. The educational information system introduced in this study has significant implications for accurate variables reporting and impacts on degree extensions leading to successful degree completions minimally reported in higher education marketing analytics research.

Abstract Image

是什么推迟了学位的完成?通过可解释人工智能(XAI)发现本科学位完成的关键预测因素
对于希望学生获得最佳结果的管理者来说,学生完成中学后课程的时间安排一直是个令人苦恼的问题。大多数预测学生学位完成延期的方法都是通过模拟方法,利用人力来分析数据完成的。几十年来,有关学位完成延期的变量和影响的大部分数据分析报告都是人工完成的。管理人员主要根据自己的专业知识和直觉来预测这些因素,以评估其影响和反响。这些变量数量大、种类多、因人而异、情况复杂。我们使用机器学习(使用预测算法的自动化流程)来预测本科生在标准 4 年完成学士学位后至少延长 2 年的时间。该研究建立了一个基于机器学习的教育理解 XAI 模型(ED-XAI),以检查学生的因变量和自变量,并准确预测/解释学位延期。研究采用了随机森林、支持向量机和深度学习机器学习算法。XAI 使用信息融合、SHAPLE Additive exPlanations (SHAP) 和 Local Interpretable Model-Agnostic Explanations (LIME) 模型来解释机器学习模型的结论。ED-XAI 模型解释了多种情况,并发现了与学生身份和资金来源相关的影响学生完成学位的变量。随机森林模型的平均 ROC 为 89.1%,总体精确率为 71.6%,总体召回率为 86%,类内精确率为 71.6%,预测结果极佳。本研究中引入的教育信息系统对准确的变量报告和学位延期的影响具有重要意义,而高等教育营销分析研究中很少有关于成功完成学位的报道。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.40
自引率
16.70%
发文量
46
期刊介绍: Data has become the new ore in today’s knowledge economy. However, merely storing and reporting are not enough to thrive in today’s increasingly competitive markets. What is called for is the ability to make sense of all these oceans of data, and to apply those insights to the way companies approach their markets, adjust to changing market conditions, and respond to new competitors. Marketing analytics lies at the heart of this contemporary wave of data driven decision-making. Companies can no longer survive when they rely on gut instinct to make decisions. Strategic leverage of data is one of the few remaining sources of sustainable competitive advantage. New products can be copied faster than ever before. Staff are becoming less loyal as well as more mobile, and business centers themselves are moving across the globe in a world that is getting flatter and flatter. The Journal of Marketing Analytics brings together applied research and practice papers in this blossoming field. A unique blend of applied academic research, combined with insights from commercial best practices makes the Journal of Marketing Analytics a perfect companion for academics and practitioners alike. Academics can stay in touch with the latest developments in this field. Marketing analytics professionals can read about the latest trends, and cutting edge academic research in this discipline. The Journal of Marketing Analytics will feature applied research papers on topics like targeting, segmentation, big data, customer loyalty and lifecycle management, cross-selling, CRM, data quality management, multi-channel marketing, and marketing strategy. The Journal of Marketing Analytics aims to combine the rigor of carefully controlled scientific research methods with applicability of real world case studies. Our double blind review process ensures that papers are selected on their content and merits alone, selecting the best possible papers in this field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信