Enhancing the Admissions Process for Engineering Baccalaureate Programs: A Machine Learning Approach to Developing a Valid and Reliable Examination

Marben S. Ramos, John Raymond B. Barajas, Pee Jay N. Gealone, Nico O. Aspra, Oliver M. Padua, Arpon T. Lucero
{"title":"Enhancing the Admissions Process for Engineering Baccalaureate Programs: A Machine Learning Approach to Developing a Valid and Reliable Examination","authors":"Marben S. Ramos, John Raymond B. Barajas, Pee Jay N. Gealone, Nico O. Aspra, Oliver M. Padua, Arpon T. Lucero","doi":"10.1109/SIEDS58326.2023.10137793","DOIUrl":null,"url":null,"abstract":"This paper details an approach to identify multiple-choice questions that are most effective in discriminating deficiencies in mathematics competencies of incoming first-year engineering students who are graduates of the K-12 program that was recently implemented in the Philippines. To achieve this objective, machine learning algorithms such as the k-Nearest Neighbors (kNN), Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and Gradient Boosting Machines (GBM) were implemented. From a question bank containing 1,300 questions covering Algebra (A), Advanced Algebra (AA), Plane and Spherical Trigonometry (T), Analytic Geometry (AG), and Solid Mensuration (SM), five domain experts identified the suitability of the questions as part of a diagnostic examination in mathematics. Specifically, using a 5-point Likert scale (5 being the highest), the experts rated the suitability of each question to test the proficiency of a student in 23 mathematics competencies based on what is prescribed by the Commission on Higher Education (CHED). The collected survey data were then used to train the machine learning models, which extracted patterns to identify the questions that would be most suitable to test the mathematics competencies of incoming first-year engineering students. With a precision recall score of 99.90%, the LR model was selected as the best performing model and analysis of how the LR model predicts the labels through the use of shap values revealed that the preference was given towards questions which test student proficiency in foundational mathematics competencies like that of Algebra and Analytical Geometry. Overall, these findings provided a better understanding of the questions that are most effective in discriminating student deficiencies in mathematics subjects.","PeriodicalId":267464,"journal":{"name":"2023 Systems and Information Engineering Design Symposium (SIEDS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIEDS58326.2023.10137793","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper details an approach to identify multiple-choice questions that are most effective in discriminating deficiencies in mathematics competencies of incoming first-year engineering students who are graduates of the K-12 program that was recently implemented in the Philippines. To achieve this objective, machine learning algorithms such as the k-Nearest Neighbors (kNN), Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and Gradient Boosting Machines (GBM) were implemented. From a question bank containing 1,300 questions covering Algebra (A), Advanced Algebra (AA), Plane and Spherical Trigonometry (T), Analytic Geometry (AG), and Solid Mensuration (SM), five domain experts identified the suitability of the questions as part of a diagnostic examination in mathematics. Specifically, using a 5-point Likert scale (5 being the highest), the experts rated the suitability of each question to test the proficiency of a student in 23 mathematics competencies based on what is prescribed by the Commission on Higher Education (CHED). The collected survey data were then used to train the machine learning models, which extracted patterns to identify the questions that would be most suitable to test the mathematics competencies of incoming first-year engineering students. With a precision recall score of 99.90%, the LR model was selected as the best performing model and analysis of how the LR model predicts the labels through the use of shap values revealed that the preference was given towards questions which test student proficiency in foundational mathematics competencies like that of Algebra and Analytical Geometry. Overall, these findings provided a better understanding of the questions that are most effective in discriminating student deficiencies in mathematics subjects.
加强工程学士学位课程的录取过程:一种开发有效可靠考试的机器学习方法
本文详细介绍了一种识别多项选择题的方法,这些选择题最有效地区分了最近在菲律宾实施的K-12项目的一年级工程专业毕业生在数学能力方面的缺陷。为了实现这一目标,实现了k近邻(kNN)、逻辑回归(LR)、随机森林(RF)、决策树(DT)和梯度增强机(GBM)等机器学习算法。从包含1300个问题的题库中,包括代数(a)、高等代数(AA)、平面和球面三角(T)、解析几何(AG)和立体测量(SM),五位领域专家确定了这些问题作为数学诊断考试的一部分的适用性。具体来说,使用5分李克特量表(最高5分),专家们根据高等教育委员会(CHED)规定的内容,对每个问题的适用性进行评级,以测试学生在23项数学能力中的熟练程度。收集到的调查数据随后被用于训练机器学习模型,该模型提取模式,以确定最适合测试即将入学的一年级工程专业学生数学能力的问题。以99.90%的查全率,LR模型被选为表现最好的模型。通过分析LR模型如何通过形状值预测标签,发现学生更倾向于测试学生对基础数学能力(如代数和解析几何)熟练程度的问题。总的来说,这些发现提供了一个更好的理解的问题,是最有效的区分学生在数学科目的缺陷。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信