Daniela Alejandra Gomez Cravioto, Ramon Eduardo Diaz Ramos, M. Galaz, N. H. Gress, Héctor Gibrán Ceballos Cancino
{"title":"用决策树分析影响校友研究生学业成就的因素","authors":"Daniela Alejandra Gomez Cravioto, Ramon Eduardo Diaz Ramos, M. Galaz, N. H. Gress, Héctor Gibrán Ceballos Cancino","doi":"10.1109/CSASE48920.2020.9142069","DOIUrl":null,"url":null,"abstract":"In Mexico, higher education is constantly suffering from low percentage of placement and interest of individuals for a graduate degree. Mexico needs more postgraduate students to increase the research and development activities and boost innovation in the private sector, especially in strategic industries. This paper suggests the use of data mining techniques to explore alumni factors and understand if these have a relationship with the alumnus returning to study a postgraduate degree. Fifteen attributes obtained from an alumni survey study were analyzed; this survey contains information from 12,780 former students, which graduated from a bachelor’s degree in Tec de Monterrey. The Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology is used, and the machine learning algorithms, Random Forest, J48 and REPTree are compared to identify the best approach to build a classification model which can predict whether an alumni will study or not a postgraduate degree. For the purpose of this research, the data mining tool used was the Waikato Environment for Knowledge Analysis (WEKA). The resulting model shows that random forest outperforms the other decision tree algorithms based on the accuracy and classifier error, which drives the conclusion that this is a more suitable classifier for the explored dataset.","PeriodicalId":254581,"journal":{"name":"2020 International Conference on Computer Science and Software Engineering (CSASE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Analysing Factors That Influence Alumni Graduate Studies Attainment with Decision Trees\",\"authors\":\"Daniela Alejandra Gomez Cravioto, Ramon Eduardo Diaz Ramos, M. Galaz, N. H. Gress, Héctor Gibrán Ceballos Cancino\",\"doi\":\"10.1109/CSASE48920.2020.9142069\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In Mexico, higher education is constantly suffering from low percentage of placement and interest of individuals for a graduate degree. Mexico needs more postgraduate students to increase the research and development activities and boost innovation in the private sector, especially in strategic industries. This paper suggests the use of data mining techniques to explore alumni factors and understand if these have a relationship with the alumnus returning to study a postgraduate degree. Fifteen attributes obtained from an alumni survey study were analyzed; this survey contains information from 12,780 former students, which graduated from a bachelor’s degree in Tec de Monterrey. The Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology is used, and the machine learning algorithms, Random Forest, J48 and REPTree are compared to identify the best approach to build a classification model which can predict whether an alumni will study or not a postgraduate degree. For the purpose of this research, the data mining tool used was the Waikato Environment for Knowledge Analysis (WEKA). The resulting model shows that random forest outperforms the other decision tree algorithms based on the accuracy and classifier error, which drives the conclusion that this is a more suitable classifier for the explored dataset.\",\"PeriodicalId\":254581,\"journal\":{\"name\":\"2020 International Conference on Computer Science and Software Engineering (CSASE)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Computer Science and Software Engineering (CSASE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSASE48920.2020.9142069\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computer Science and Software Engineering (CSASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSASE48920.2020.9142069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
在墨西哥,高等教育一直受到低就业率和个人对研究生学位兴趣的困扰。墨西哥需要更多的研究生来增加研究和开发活动,促进私营部门的创新,特别是在战略行业。本文建议使用数据挖掘技术来探索校友因素,并了解这些因素是否与校友回国攻读研究生学位有关。分析了从校友调查研究中获得的15个属性;这项调查包含了从蒙特雷理工大学学士学位毕业的12,780名前学生的信息。使用数据挖掘跨行业标准过程(CRISP-DM)方法,并比较机器学习算法、随机森林、J48和REPTree,以确定构建分类模型的最佳方法,该模型可以预测校友是否将攻读研究生学位。本研究使用的数据挖掘工具是Waikato Environment For Knowledge Analysis (WEKA)。结果表明,基于准确率和分类器误差,随机森林算法优于其他决策树算法,从而得出结论,该算法更适合于所探索的数据集。
Analysing Factors That Influence Alumni Graduate Studies Attainment with Decision Trees
In Mexico, higher education is constantly suffering from low percentage of placement and interest of individuals for a graduate degree. Mexico needs more postgraduate students to increase the research and development activities and boost innovation in the private sector, especially in strategic industries. This paper suggests the use of data mining techniques to explore alumni factors and understand if these have a relationship with the alumnus returning to study a postgraduate degree. Fifteen attributes obtained from an alumni survey study were analyzed; this survey contains information from 12,780 former students, which graduated from a bachelor’s degree in Tec de Monterrey. The Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology is used, and the machine learning algorithms, Random Forest, J48 and REPTree are compared to identify the best approach to build a classification model which can predict whether an alumni will study or not a postgraduate degree. For the purpose of this research, the data mining tool used was the Waikato Environment for Knowledge Analysis (WEKA). The resulting model shows that random forest outperforms the other decision tree algorithms based on the accuracy and classifier error, which drives the conclusion that this is a more suitable classifier for the explored dataset.