{"title":"一个整合决策树学习算法和聚类分析的框架","authors":"M. Kurematsu, H. Fujita","doi":"10.1109/SoMeT.2013.6645670","DOIUrl":null,"url":null,"abstract":"We proposed a modified decision tree learning algorithm to improve this algorithm in this paper. Our proposed approach classifies given data set by a traditional decision tree learning algorithm and cluster analysis and selects whichever is better according to information gain. In order to evaluate our approach, we did an experiment using program-generated data sets. We compared ID3 which is one of well-known decision tree learning algorithm to our approach about the recall ratio in this experiment. Experimental result shows the recall ratio of our approach is similar than the recall ratio of a traditional decision tree learning algorithm. Though we can not show the advantage of our approach according to the experiment, we show it is worth using cluster analysis to make a decision tree. In future, we have to evaluate our approach according to cross-validation method using big and complex data sets in order to say the advantage of our approach. We think our approach is not good for all data set, so we try to find the situation which our approach is better than other approaches according to the experimental results. In addition to, we have to show how to explain a decision tree by our approach to keep the readability of a decision tree.","PeriodicalId":447065,"journal":{"name":"2013 IEEE 12th International Conference on Intelligent Software Methodologies, Tools and Techniques (SoMeT)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"A framework for integrating a decision tree learning algorithm and cluster analysis\",\"authors\":\"M. Kurematsu, H. Fujita\",\"doi\":\"10.1109/SoMeT.2013.6645670\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We proposed a modified decision tree learning algorithm to improve this algorithm in this paper. Our proposed approach classifies given data set by a traditional decision tree learning algorithm and cluster analysis and selects whichever is better according to information gain. In order to evaluate our approach, we did an experiment using program-generated data sets. We compared ID3 which is one of well-known decision tree learning algorithm to our approach about the recall ratio in this experiment. Experimental result shows the recall ratio of our approach is similar than the recall ratio of a traditional decision tree learning algorithm. Though we can not show the advantage of our approach according to the experiment, we show it is worth using cluster analysis to make a decision tree. In future, we have to evaluate our approach according to cross-validation method using big and complex data sets in order to say the advantage of our approach. We think our approach is not good for all data set, so we try to find the situation which our approach is better than other approaches according to the experimental results. In addition to, we have to show how to explain a decision tree by our approach to keep the readability of a decision tree.\",\"PeriodicalId\":447065,\"journal\":{\"name\":\"2013 IEEE 12th International Conference on Intelligent Software Methodologies, Tools and Techniques (SoMeT)\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE 12th International Conference on Intelligent Software Methodologies, Tools and Techniques (SoMeT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SoMeT.2013.6645670\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 12th International Conference on Intelligent Software Methodologies, Tools and Techniques (SoMeT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SoMeT.2013.6645670","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A framework for integrating a decision tree learning algorithm and cluster analysis
We proposed a modified decision tree learning algorithm to improve this algorithm in this paper. Our proposed approach classifies given data set by a traditional decision tree learning algorithm and cluster analysis and selects whichever is better according to information gain. In order to evaluate our approach, we did an experiment using program-generated data sets. We compared ID3 which is one of well-known decision tree learning algorithm to our approach about the recall ratio in this experiment. Experimental result shows the recall ratio of our approach is similar than the recall ratio of a traditional decision tree learning algorithm. Though we can not show the advantage of our approach according to the experiment, we show it is worth using cluster analysis to make a decision tree. In future, we have to evaluate our approach according to cross-validation method using big and complex data sets in order to say the advantage of our approach. We think our approach is not good for all data set, so we try to find the situation which our approach is better than other approaches according to the experimental results. In addition to, we have to show how to explain a decision tree by our approach to keep the readability of a decision tree.