Profiling and Archetyping of Higher Education Applicants Using Intelligent Data Analysis Techniques

IF 1 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Cindy Espinoza;Jesús Carretero
{"title":"Profiling and Archetyping of Higher Education Applicants Using Intelligent Data Analysis Techniques","authors":"Cindy Espinoza;Jesús Carretero","doi":"10.1109/RITA.2025.3583369","DOIUrl":null,"url":null,"abstract":"Student dropout is a significant challenge in higher education, generating frustration in society and wasting resources. As a result, student retention constitutes a constant challenge for higher education institutions everywhere. This work focuses on the question: Can intelligent predictive data analysis techniques be applied to reduce the dropout rate in public and private universities? To answer this question, we have adopted an exploratory methodological approach based on historical data from approximately 13,715 applicants who later became university students. Unlike other research, based on publicly available data and statistics, our work relies on five years of actual data of students whose behavior has been synthesized in 27 variables related to socioeconomic, academic, and family factors and analyzes it. This paper has two main contributions. First, we propose intelligent predictive data analytics techniques and demonstrate that it is possible to profile and target the applicant for higher education as a strategy to reduce the dropout rate and improve their student welfare, so that the dropout probability can be used as part of an early warning in the recruitment process. Second, we propose a methodology for the segmentation and/or archetyping of applicants, which can be part of a corrective alert in the adaptation process. The profiling model and archetyping are replicable in private and public universities since we use easily extractable generic variables that do not require the university to have a high level of maturity in data management processes. Therefore, our results contribute to educational data mining (EDM), demonstrating that intelligent predictive data analysis techniques can be used to profile and archetype private and public university applicants for higher education. The evaluation of our solution proved that the neural network model profiled the dropout applicants with an accuracy higher than up to 97%, after which unsupervised learning was applied to generate archetypes.","PeriodicalId":38963,"journal":{"name":"Revista Iberoamericana de Tecnologias del Aprendizaje","volume":"20 ","pages":"139-151"},"PeriodicalIF":1.0000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista Iberoamericana de Tecnologias del Aprendizaje","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11052675/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Student dropout is a significant challenge in higher education, generating frustration in society and wasting resources. As a result, student retention constitutes a constant challenge for higher education institutions everywhere. This work focuses on the question: Can intelligent predictive data analysis techniques be applied to reduce the dropout rate in public and private universities? To answer this question, we have adopted an exploratory methodological approach based on historical data from approximately 13,715 applicants who later became university students. Unlike other research, based on publicly available data and statistics, our work relies on five years of actual data of students whose behavior has been synthesized in 27 variables related to socioeconomic, academic, and family factors and analyzes it. This paper has two main contributions. First, we propose intelligent predictive data analytics techniques and demonstrate that it is possible to profile and target the applicant for higher education as a strategy to reduce the dropout rate and improve their student welfare, so that the dropout probability can be used as part of an early warning in the recruitment process. Second, we propose a methodology for the segmentation and/or archetyping of applicants, which can be part of a corrective alert in the adaptation process. The profiling model and archetyping are replicable in private and public universities since we use easily extractable generic variables that do not require the university to have a high level of maturity in data management processes. Therefore, our results contribute to educational data mining (EDM), demonstrating that intelligent predictive data analysis techniques can be used to profile and archetype private and public university applicants for higher education. The evaluation of our solution proved that the neural network model profiled the dropout applicants with an accuracy higher than up to 97%, after which unsupervised learning was applied to generate archetypes.
使用智能数据分析技术对高等教育申请者进行剖析和原型化
学生辍学是高等教育面临的一个重大挑战,它给社会带来挫折,浪费资源。因此,生源问题对各地的高等教育机构构成了一个持续的挑战。这项工作关注的问题是:智能预测数据分析技术能否应用于降低公立和私立大学的辍学率?为了回答这个问题,我们采用了一种探索性的方法,该方法基于大约13,715名后来成为大学生的申请人的历史数据。与其他基于公开数据和统计数据的研究不同,我们的工作依赖于5年来学生的实际数据,这些学生的行为被综合在27个与社会经济、学术和家庭因素相关的变量中,并对其进行分析。本文有两个主要贡献。首先,我们提出了智能预测数据分析技术,并证明了可以将高等教育申请人作为降低辍学率和提高学生福利的策略,从而将辍学率用作招聘过程中早期预警的一部分。其次,我们提出了一种对申请人进行分割和/或原型化的方法,这可以作为适应过程中纠正警报的一部分。分析模型和原型在私立和公立大学中都是可复制的,因为我们使用了易于提取的通用变量,这些变量不需要大学在数据管理过程中具有很高的成熟度。因此,我们的研究结果有助于教育数据挖掘(EDM),表明智能预测数据分析技术可以用来描述和原型私立和公立大学的高等教育申请者。对我们的解决方案的评估证明,神经网络模型对辍学申请者的描述准确率高达97%,之后应用无监督学习生成原型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.30
自引率
0.00%
发文量
45
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信