Daniela Alejandra Gomez Cravioto, Ramon Eduardo Diaz Ramos, M. Galaz, N. H. Gress, Héctor Gibrán Ceballos Cancino
{"title":"Analysing Factors That Influence Alumni Graduate Studies Attainment with Decision Trees","authors":"Daniela Alejandra Gomez Cravioto, Ramon Eduardo Diaz Ramos, M. Galaz, N. H. Gress, Héctor Gibrán Ceballos Cancino","doi":"10.1109/CSASE48920.2020.9142069","DOIUrl":null,"url":null,"abstract":"In Mexico, higher education is constantly suffering from low percentage of placement and interest of individuals for a graduate degree. Mexico needs more postgraduate students to increase the research and development activities and boost innovation in the private sector, especially in strategic industries. This paper suggests the use of data mining techniques to explore alumni factors and understand if these have a relationship with the alumnus returning to study a postgraduate degree. Fifteen attributes obtained from an alumni survey study were analyzed; this survey contains information from 12,780 former students, which graduated from a bachelor’s degree in Tec de Monterrey. The Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology is used, and the machine learning algorithms, Random Forest, J48 and REPTree are compared to identify the best approach to build a classification model which can predict whether an alumni will study or not a postgraduate degree. For the purpose of this research, the data mining tool used was the Waikato Environment for Knowledge Analysis (WEKA). The resulting model shows that random forest outperforms the other decision tree algorithms based on the accuracy and classifier error, which drives the conclusion that this is a more suitable classifier for the explored dataset.","PeriodicalId":254581,"journal":{"name":"2020 International Conference on Computer Science and Software Engineering (CSASE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computer Science and Software Engineering (CSASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSASE48920.2020.9142069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In Mexico, higher education is constantly suffering from low percentage of placement and interest of individuals for a graduate degree. Mexico needs more postgraduate students to increase the research and development activities and boost innovation in the private sector, especially in strategic industries. This paper suggests the use of data mining techniques to explore alumni factors and understand if these have a relationship with the alumnus returning to study a postgraduate degree. Fifteen attributes obtained from an alumni survey study were analyzed; this survey contains information from 12,780 former students, which graduated from a bachelor’s degree in Tec de Monterrey. The Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology is used, and the machine learning algorithms, Random Forest, J48 and REPTree are compared to identify the best approach to build a classification model which can predict whether an alumni will study or not a postgraduate degree. For the purpose of this research, the data mining tool used was the Waikato Environment for Knowledge Analysis (WEKA). The resulting model shows that random forest outperforms the other decision tree algorithms based on the accuracy and classifier error, which drives the conclusion that this is a more suitable classifier for the explored dataset.