Burak Cankaya, Robin Roberts, Stephanie Douglas, Rachel Vigness, Asil Oztekin
{"title":"What postpones degree completion? Discovering key predictors of undergraduate degree completion through explainable artificial intelligence (XAI)","authors":"Burak Cankaya, Robin Roberts, Stephanie Douglas, Rachel Vigness, Asil Oztekin","doi":"10.1057/s41270-024-00290-6","DOIUrl":null,"url":null,"abstract":"<p>The timing of degree completion for students taking post-secondary courses has been a constant source of angst for administrators wanting the best outcomes for their students. Most methods for predicting student degree completion extensions are completed by analog methods using human effort to analyze data. The majority of data analysis reporting of degree completion extension variables and impacts has, for decades, been done manually. Administrators primarily forecast the factors based on their expertise and intuition to evaluate implications and repercussions. The variables are large, varied, and situational to each individual and complex. We used machine learning (automated processes using predictive algorithms) to predict undergraduate extensions for at least 2 years beyond a standard 4 years to complete a bachelor's degree. The study builds a machine learning-based education understanding XAI model (ED-XAI) to examine students’ dependent and independent variables and accurately predict/explain degree extension. The study utilized Random Forest, Support Vector Machines, and Deep Learning Machine learning algorithms. XAI used Information Fusion, SHapley Additive exPlanations (SHAP), and Local Interpretable Model-Agnostic Explanations (LIME) models to explain the findings of the Machine Learning models. The ED-XAI model explained multiple scenarios and discovered variables influencing students’ degree completion linked to their status and funding source. The Random Forest model gave supreme predictive results with 89.1% Mean ROC, 71.6% Overall Precision, 86% Overall Recall, and 71.6% In-class Precision. The educational information system introduced in this study has significant implications for accurate variables reporting and impacts on degree extensions leading to successful degree completions minimally reported in higher education marketing analytics research.</p>","PeriodicalId":43041,"journal":{"name":"Journal of Marketing Analytics","volume":"101 1","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Marketing Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1057/s41270-024-00290-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0
Abstract
The timing of degree completion for students taking post-secondary courses has been a constant source of angst for administrators wanting the best outcomes for their students. Most methods for predicting student degree completion extensions are completed by analog methods using human effort to analyze data. The majority of data analysis reporting of degree completion extension variables and impacts has, for decades, been done manually. Administrators primarily forecast the factors based on their expertise and intuition to evaluate implications and repercussions. The variables are large, varied, and situational to each individual and complex. We used machine learning (automated processes using predictive algorithms) to predict undergraduate extensions for at least 2 years beyond a standard 4 years to complete a bachelor's degree. The study builds a machine learning-based education understanding XAI model (ED-XAI) to examine students’ dependent and independent variables and accurately predict/explain degree extension. The study utilized Random Forest, Support Vector Machines, and Deep Learning Machine learning algorithms. XAI used Information Fusion, SHapley Additive exPlanations (SHAP), and Local Interpretable Model-Agnostic Explanations (LIME) models to explain the findings of the Machine Learning models. The ED-XAI model explained multiple scenarios and discovered variables influencing students’ degree completion linked to their status and funding source. The Random Forest model gave supreme predictive results with 89.1% Mean ROC, 71.6% Overall Precision, 86% Overall Recall, and 71.6% In-class Precision. The educational information system introduced in this study has significant implications for accurate variables reporting and impacts on degree extensions leading to successful degree completions minimally reported in higher education marketing analytics research.
期刊介绍:
Data has become the new ore in today’s knowledge economy. However, merely storing and reporting are not enough to thrive in today’s increasingly competitive markets. What is called for is the ability to make sense of all these oceans of data, and to apply those insights to the way companies approach their markets, adjust to changing market conditions, and respond to new competitors.
Marketing analytics lies at the heart of this contemporary wave of data driven decision-making. Companies can no longer survive when they rely on gut instinct to make decisions. Strategic leverage of data is one of the few remaining sources of sustainable competitive advantage. New products can be copied faster than ever before. Staff are becoming less loyal as well as more mobile, and business centers themselves are moving across the globe in a world that is getting flatter and flatter.
The Journal of Marketing Analytics brings together applied research and practice papers in this blossoming field. A unique blend of applied academic research, combined with insights from commercial best practices makes the Journal of Marketing Analytics a perfect companion for academics and practitioners alike. Academics can stay in touch with the latest developments in this field. Marketing analytics professionals can read about the latest trends, and cutting edge academic research in this discipline.
The Journal of Marketing Analytics will feature applied research papers on topics like targeting, segmentation, big data, customer loyalty and lifecycle management, cross-selling, CRM, data quality management, multi-channel marketing, and marketing strategy.
The Journal of Marketing Analytics aims to combine the rigor of carefully controlled scientific research methods with applicability of real world case studies. Our double blind review process ensures that papers are selected on their content and merits alone, selecting the best possible papers in this field.