{"title":"具有多数和少数性质的修剪","authors":"Hae Sook Jeon, W. Lee","doi":"10.1109/ICISA.2014.6847450","DOIUrl":null,"url":null,"abstract":"Classification is very imprtant research in knowledge discovery and machine learning. The decision tree is one of the well-known data mining methods. In general, a decision tree can be grown so as to have zero eeor on the training data set. If there is any noise in the data set or it does not completely cover the decision space, then over-fitting occurs and the tree needs to be pruned in order to accurately generalize the test data set. In this paper, we propose a pre-pruning method with majority and minority properties for the decision tree. It uses two kinds of qualifying criteria to consider whether the ration of the highest class of a subtree is the majority of the subtree or a minority of the overall tree. New measures for these are added to the classifier with the extended data expression. Experiments show that a clasifier using this pruning method can improve classification accuracy as well as reduce the size of the tree.","PeriodicalId":117185,"journal":{"name":"2014 International Conference on Information Science & Applications (ICISA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Pruning with Majority and Minority Properties\",\"authors\":\"Hae Sook Jeon, W. Lee\",\"doi\":\"10.1109/ICISA.2014.6847450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Classification is very imprtant research in knowledge discovery and machine learning. The decision tree is one of the well-known data mining methods. In general, a decision tree can be grown so as to have zero eeor on the training data set. If there is any noise in the data set or it does not completely cover the decision space, then over-fitting occurs and the tree needs to be pruned in order to accurately generalize the test data set. In this paper, we propose a pre-pruning method with majority and minority properties for the decision tree. It uses two kinds of qualifying criteria to consider whether the ration of the highest class of a subtree is the majority of the subtree or a minority of the overall tree. New measures for these are added to the classifier with the extended data expression. Experiments show that a clasifier using this pruning method can improve classification accuracy as well as reduce the size of the tree.\",\"PeriodicalId\":117185,\"journal\":{\"name\":\"2014 International Conference on Information Science & Applications (ICISA)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Information Science & Applications (ICISA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISA.2014.6847450\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Information Science & Applications (ICISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISA.2014.6847450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification is very imprtant research in knowledge discovery and machine learning. The decision tree is one of the well-known data mining methods. In general, a decision tree can be grown so as to have zero eeor on the training data set. If there is any noise in the data set or it does not completely cover the decision space, then over-fitting occurs and the tree needs to be pruned in order to accurately generalize the test data set. In this paper, we propose a pre-pruning method with majority and minority properties for the decision tree. It uses two kinds of qualifying criteria to consider whether the ration of the highest class of a subtree is the majority of the subtree or a minority of the overall tree. New measures for these are added to the classifier with the extended data expression. Experiments show that a clasifier using this pruning method can improve classification accuracy as well as reduce the size of the tree.