{"title":"基于EKCStacking算法的不平衡数据集分类方法","authors":"Qunzhong Liu, W. Luo, Tao Shi","doi":"10.1145/3375998.3376002","DOIUrl":null,"url":null,"abstract":"The processing of imbalanced data sets has always been a hot issue in machine learning. The traditional classification method is to pursue the overall classification accuracy of data sets, and often ignores the classification effect of minority samples. Stacking is a framework algorithm. Based on the Stacking framework, in this paper, we introduce a new oversampling algorithm EKSMOTE and cost-sensitive theory into Stacking, and propose the EKCStacking algorithm. The algorithm uses the EKSMOTE algorithm to reduce imbalanced ratio of data set before data training, and then the Level 1 layer uses a cost-sensitive classifier. The experimental results of the data set in the Keel database show that EKCStacking improves the classification accuracy of minority samples and makes the performance more stable compared with the traditional algorithm.","PeriodicalId":395773,"journal":{"name":"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Classification method for imbalanced data set based on EKCStacking algorithm\",\"authors\":\"Qunzhong Liu, W. Luo, Tao Shi\",\"doi\":\"10.1145/3375998.3376002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The processing of imbalanced data sets has always been a hot issue in machine learning. The traditional classification method is to pursue the overall classification accuracy of data sets, and often ignores the classification effect of minority samples. Stacking is a framework algorithm. Based on the Stacking framework, in this paper, we introduce a new oversampling algorithm EKSMOTE and cost-sensitive theory into Stacking, and propose the EKCStacking algorithm. The algorithm uses the EKSMOTE algorithm to reduce imbalanced ratio of data set before data training, and then the Level 1 layer uses a cost-sensitive classifier. The experimental results of the data set in the Keel database show that EKCStacking improves the classification accuracy of minority samples and makes the performance more stable compared with the traditional algorithm.\",\"PeriodicalId\":395773,\"journal\":{\"name\":\"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3375998.3376002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 8th International Conference on Networks, Communication and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3375998.3376002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification method for imbalanced data set based on EKCStacking algorithm
The processing of imbalanced data sets has always been a hot issue in machine learning. The traditional classification method is to pursue the overall classification accuracy of data sets, and often ignores the classification effect of minority samples. Stacking is a framework algorithm. Based on the Stacking framework, in this paper, we introduce a new oversampling algorithm EKSMOTE and cost-sensitive theory into Stacking, and propose the EKCStacking algorithm. The algorithm uses the EKSMOTE algorithm to reduce imbalanced ratio of data set before data training, and then the Level 1 layer uses a cost-sensitive classifier. The experimental results of the data set in the Keel database show that EKCStacking improves the classification accuracy of minority samples and makes the performance more stable compared with the traditional algorithm.