{"title":"类失衡问题的决策树与贝叶斯网络混合集成","authors":"Pumitara Ruangthong, S. Jaiyen","doi":"10.1109/KST.2016.7440523","DOIUrl":null,"url":null,"abstract":"Class imbalance problem is the main issue causing unsatisfactory outcome in classification. Any type of classification used still cannot improve the result. Therefore, in this research we propose a new hybrid ensemble model based on AdaBoost.M2 and adopt SMOTE algorithm to solve the class imbalance problem in order to predict the probability of term deposit from bank customers. The proposed hybrid ensemble model consist of diverse based classifiers which are Bayesian Network, Alternating Decision Tree, Tree-J48, and REPTree (Reduced-Error Pruning). From the experimental results, the proposed model can achieve the highest performance comparing to normal ensemble models and ensemble models that use majority class reduction, and finally generates the results of 91.5% sensitivity, 100% specificity, and 96.3% accuracy.","PeriodicalId":350687,"journal":{"name":"2016 8th International Conference on Knowledge and Smart Technology (KST)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Hybrid ensembles of decision trees and Bayesian network for class imbalance problem\",\"authors\":\"Pumitara Ruangthong, S. Jaiyen\",\"doi\":\"10.1109/KST.2016.7440523\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Class imbalance problem is the main issue causing unsatisfactory outcome in classification. Any type of classification used still cannot improve the result. Therefore, in this research we propose a new hybrid ensemble model based on AdaBoost.M2 and adopt SMOTE algorithm to solve the class imbalance problem in order to predict the probability of term deposit from bank customers. The proposed hybrid ensemble model consist of diverse based classifiers which are Bayesian Network, Alternating Decision Tree, Tree-J48, and REPTree (Reduced-Error Pruning). From the experimental results, the proposed model can achieve the highest performance comparing to normal ensemble models and ensemble models that use majority class reduction, and finally generates the results of 91.5% sensitivity, 100% specificity, and 96.3% accuracy.\",\"PeriodicalId\":350687,\"journal\":{\"name\":\"2016 8th International Conference on Knowledge and Smart Technology (KST)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 8th International Conference on Knowledge and Smart Technology (KST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KST.2016.7440523\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 8th International Conference on Knowledge and Smart Technology (KST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KST.2016.7440523","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hybrid ensembles of decision trees and Bayesian network for class imbalance problem
Class imbalance problem is the main issue causing unsatisfactory outcome in classification. Any type of classification used still cannot improve the result. Therefore, in this research we propose a new hybrid ensemble model based on AdaBoost.M2 and adopt SMOTE algorithm to solve the class imbalance problem in order to predict the probability of term deposit from bank customers. The proposed hybrid ensemble model consist of diverse based classifiers which are Bayesian Network, Alternating Decision Tree, Tree-J48, and REPTree (Reduced-Error Pruning). From the experimental results, the proposed model can achieve the highest performance comparing to normal ensemble models and ensemble models that use majority class reduction, and finally generates the results of 91.5% sensitivity, 100% specificity, and 96.3% accuracy.