Anna Sheila I. Crisostomo, R. Encarnacion, Shakir Al Balushi
{"title":"A Data Mining Approach to Construct Classification Model for Predicting Tourism Graduates Employability","authors":"Anna Sheila I. Crisostomo, R. Encarnacion, Shakir Al Balushi","doi":"10.1109/SERA57763.2023.10197814","DOIUrl":null,"url":null,"abstract":"Higher educational institutions’ goal is to ensure that graduates are imbibed with the required employability skills. Hence, by applying the concept of Knowledge Discover in Database (KDD), this study aims to build a graduates’ employment prediction model using classification task in Bayes and Tree Methods. The data utilized for this purpose are collected from the tracer survey conducted to Oman Tourism College alumni. Based on the graduates’ profiles, the generated model predicts whether a graduate is employed full-time, part-time, self-employed or unemployed. Using several classification techniques provided by Waikato Environment for Knowledge Analysis (WEKA), the findings revealed that RandomTree algorithm and REPTree algorithms, under decision tree methods yielded accuracy rates of 96.3636% and 88.1818% respectively. BayesNet algorithm, a variant of Bayes algorithm yielded an accuracy of 84.5455%, ranked third. Information gain and ranker method are also used for attribute ranking which showed occupation as the most influential factor for employability followed by job sector. Other attributes used in classifying the employment status of the graduates include occupation, job sector, specialization, degree, age, personality development skills, cultural competency, leadership, interpersonal skills, creativity, and problem-solving skills. This experiment concludes that a tree-based classifier is the most suitable algorithm for predicting tourism graduates’ employability in the tourism and hospitality sector of the Sultanate of Oman.","PeriodicalId":211080,"journal":{"name":"2023 IEEE/ACIS 21st International Conference on Software Engineering Research, Management and Applications (SERA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/ACIS 21st International Conference on Software Engineering Research, Management and Applications (SERA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERA57763.2023.10197814","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Higher educational institutions’ goal is to ensure that graduates are imbibed with the required employability skills. Hence, by applying the concept of Knowledge Discover in Database (KDD), this study aims to build a graduates’ employment prediction model using classification task in Bayes and Tree Methods. The data utilized for this purpose are collected from the tracer survey conducted to Oman Tourism College alumni. Based on the graduates’ profiles, the generated model predicts whether a graduate is employed full-time, part-time, self-employed or unemployed. Using several classification techniques provided by Waikato Environment for Knowledge Analysis (WEKA), the findings revealed that RandomTree algorithm and REPTree algorithms, under decision tree methods yielded accuracy rates of 96.3636% and 88.1818% respectively. BayesNet algorithm, a variant of Bayes algorithm yielded an accuracy of 84.5455%, ranked third. Information gain and ranker method are also used for attribute ranking which showed occupation as the most influential factor for employability followed by job sector. Other attributes used in classifying the employment status of the graduates include occupation, job sector, specialization, degree, age, personality development skills, cultural competency, leadership, interpersonal skills, creativity, and problem-solving skills. This experiment concludes that a tree-based classifier is the most suitable algorithm for predicting tourism graduates’ employability in the tourism and hospitality sector of the Sultanate of Oman.
高等教育机构的目标是确保毕业生具备所需的就业技能。因此,本研究运用KDD (Knowledge Discover in Database)的概念,利用贝叶斯分类任务和树方法构建毕业生就业预测模型。用于此目的的数据是从对阿曼旅游学院校友进行的示踪剂调查中收集的。根据毕业生的个人资料,生成的模型预测毕业生是全职、兼职、自雇还是失业。使用Waikato Environment for Knowledge Analysis (WEKA)提供的几种分类技术,研究结果表明,在决策树方法下,RandomTree算法和REPTree算法的准确率分别为96.3636%和88.1818%。BayesNet算法是Bayes算法的一种变体,准确率为84.5455%,排名第三。利用信息增益法和排名法进行属性排序,结果显示职业是影响就业能力的最主要因素,其次是工作部门。用于分类毕业生就业状况的其他属性包括职业、工作部门、专业、学位、年龄、个性发展技能、文化能力、领导能力、人际交往能力、创造力和解决问题的能力。本实验得出结论,基于树的分类器是预测阿曼苏丹国旅游和酒店业旅游毕业生就业能力的最合适算法。