{"title":"结合决策树学习和聚类的数据分类","authors":"C. Kaewchinporn, N. Vongsuchoto, A. Srisawat","doi":"10.1109/JCSSE.2011.5930148","DOIUrl":null,"url":null,"abstract":"In this paper, we present a new classification algorithm which is a combination of decision tree learning and clustering called Tree Bagging and Weighted Clustering (TBWC). The TBWC algorithm was developed to enhance a classification performance of a clustering algorithm. In the experiments, five datasets were used to evaluate the predictive performance. The experimental results show that the TBWC algorithm yields the highest accuracies when compared with decision tree learning and clustering for all datasets. In addition, this algorithm can improve the predictive performance especially for multi-class datasets which can increase the accuracy up to 36.67%. Finally, it can reduce attributes up to 59.82%.","PeriodicalId":287775,"journal":{"name":"2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"A combination of decision tree learning and clustering for data classification\",\"authors\":\"C. Kaewchinporn, N. Vongsuchoto, A. Srisawat\",\"doi\":\"10.1109/JCSSE.2011.5930148\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a new classification algorithm which is a combination of decision tree learning and clustering called Tree Bagging and Weighted Clustering (TBWC). The TBWC algorithm was developed to enhance a classification performance of a clustering algorithm. In the experiments, five datasets were used to evaluate the predictive performance. The experimental results show that the TBWC algorithm yields the highest accuracies when compared with decision tree learning and clustering for all datasets. In addition, this algorithm can improve the predictive performance especially for multi-class datasets which can increase the accuracy up to 36.67%. Finally, it can reduce attributes up to 59.82%.\",\"PeriodicalId\":287775,\"journal\":{\"name\":\"2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"volume\":\"90 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-05-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCSSE.2011.5930148\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE.2011.5930148","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
摘要
本文提出了一种将决策树学习与聚类相结合的分类算法,称为tree Bagging and Weighted clustering (TBWC)。TBWC算法是为了提高聚类算法的分类性能而开发的。在实验中,使用了五个数据集来评估预测性能。实验结果表明,与决策树学习和聚类相比,TBWC算法在所有数据集上都具有最高的准确率。此外,该算法还可以提高预测性能,特别是对多类数据集的预测,准确率可达36.67%。最后,该方法可以减少59.82%的属性。
A combination of decision tree learning and clustering for data classification
In this paper, we present a new classification algorithm which is a combination of decision tree learning and clustering called Tree Bagging and Weighted Clustering (TBWC). The TBWC algorithm was developed to enhance a classification performance of a clustering algorithm. In the experiments, five datasets were used to evaluate the predictive performance. The experimental results show that the TBWC algorithm yields the highest accuracies when compared with decision tree learning and clustering for all datasets. In addition, this algorithm can improve the predictive performance especially for multi-class datasets which can increase the accuracy up to 36.67%. Finally, it can reduce attributes up to 59.82%.