{"title":"The Comparison of C4.5 and CART (Classification and Regression Tree) Algorithm in Classification of Occupation for Fresh Graduate","authors":"Febian Joshua Reynara, Sepriana Carolina, Iustisia Natalia Simbolon","doi":"10.4108/eai.27-11-2021.2315527","DOIUrl":null,"url":null,"abstract":". The problem that college students face is the difficulty of determining the appropriate field of work after they graduate from college. In this study, a classification of the field of work was carried out using the data mining method based on the alumni field of work data. The data on the field of work of alumni contained information such as gender, study program, practical work topics, types of practical work companies, final project topics, and year of graduation. The classification on the field of work carried out was divided into three types of experiments, namely experiments in eight target categories (STQA Engineer, Software and Mobile Application Developer, Web Developer, UI/UX Designer, Software and Business Analyst, Lecturer and Researcher, AI Engineer, DevOps and Cybersecurity Practitioner), three target categories (SQA, Programmer, Data Manager, and Analyst) and two target categories (Programmer and Non-Programmer). The data mining algorithms used to classify were C4.5 and CART (Classification and Regression Tree). The accuracy obtained using the C4.5 algorithm was 42% in the eight categories experiment, 58% in the three categories experiment, and 75% in the two categories experiment. In comparison, the accuracy obtained using the CART algorithm was 43% in the eight categories experiment, 61% in the three categories experiment, and 77% in the two categories experiment. Based on the experimental results, it can be concluded that the best algorithm to classify the fields of work based on alumni data from the two algorithms used is the CART algorithm, even though the difference is not too significant.","PeriodicalId":246168,"journal":{"name":"Proceedings of the 4th International Conference on Vocational Education and Technology, IConVET 2021, 27 November 2021, Singaraja, Bali, Indonesia","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Vocational Education and Technology, IConVET 2021, 27 November 2021, Singaraja, Bali, Indonesia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/eai.27-11-2021.2315527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
. The problem that college students face is the difficulty of determining the appropriate field of work after they graduate from college. In this study, a classification of the field of work was carried out using the data mining method based on the alumni field of work data. The data on the field of work of alumni contained information such as gender, study program, practical work topics, types of practical work companies, final project topics, and year of graduation. The classification on the field of work carried out was divided into three types of experiments, namely experiments in eight target categories (STQA Engineer, Software and Mobile Application Developer, Web Developer, UI/UX Designer, Software and Business Analyst, Lecturer and Researcher, AI Engineer, DevOps and Cybersecurity Practitioner), three target categories (SQA, Programmer, Data Manager, and Analyst) and two target categories (Programmer and Non-Programmer). The data mining algorithms used to classify were C4.5 and CART (Classification and Regression Tree). The accuracy obtained using the C4.5 algorithm was 42% in the eight categories experiment, 58% in the three categories experiment, and 75% in the two categories experiment. In comparison, the accuracy obtained using the CART algorithm was 43% in the eight categories experiment, 61% in the three categories experiment, and 77% in the two categories experiment. Based on the experimental results, it can be concluded that the best algorithm to classify the fields of work based on alumni data from the two algorithms used is the CART algorithm, even though the difference is not too significant.