Niloofar Afshari Abolkarlou, A. Niknafs, M. K. Ebrahimpour
{"title":"集成失衡分类:采用数据预处理、聚类算法和遗传算法","authors":"Niloofar Afshari Abolkarlou, A. Niknafs, M. K. Ebrahimpour","doi":"10.1109/ICCKE.2014.6993364","DOIUrl":null,"url":null,"abstract":"One of the most interesting and important issues in the machine learning and data mining research areas is high accuracy classification. Imbalance data is a great challenge. The imbalance data is a kind of situation when the number of one data member class is significantly smaller than the other class. In the recent years this issue has got more attention among many researchers all over the world. In this paper we are going to propose a new algorithm for dealing and classifying the imbalance data. In the first part of the proposed method the SMOTE (Synthetic Minority Oversampling TEchnique) oversampling preprocessing is done for increasing the numbers of minority members of the dataset in order to emphasize them, then the algorithm is run on the 10 binary classes, imbalance KEEL datasets. The experimental results show that the proposed ensemble learning algorithm has better results than some well-known algorithms such as SMOTEBagging and SMOTEBoosting in imbalance data.","PeriodicalId":152540,"journal":{"name":"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Ensemble imbalance classification: Using data preprocessing, clustering algorithm and genetic algorithm\",\"authors\":\"Niloofar Afshari Abolkarlou, A. Niknafs, M. K. Ebrahimpour\",\"doi\":\"10.1109/ICCKE.2014.6993364\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the most interesting and important issues in the machine learning and data mining research areas is high accuracy classification. Imbalance data is a great challenge. The imbalance data is a kind of situation when the number of one data member class is significantly smaller than the other class. In the recent years this issue has got more attention among many researchers all over the world. In this paper we are going to propose a new algorithm for dealing and classifying the imbalance data. In the first part of the proposed method the SMOTE (Synthetic Minority Oversampling TEchnique) oversampling preprocessing is done for increasing the numbers of minority members of the dataset in order to emphasize them, then the algorithm is run on the 10 binary classes, imbalance KEEL datasets. The experimental results show that the proposed ensemble learning algorithm has better results than some well-known algorithms such as SMOTEBagging and SMOTEBoosting in imbalance data.\",\"PeriodicalId\":152540,\"journal\":{\"name\":\"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2014.6993364\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2014.6993364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Ensemble imbalance classification: Using data preprocessing, clustering algorithm and genetic algorithm
One of the most interesting and important issues in the machine learning and data mining research areas is high accuracy classification. Imbalance data is a great challenge. The imbalance data is a kind of situation when the number of one data member class is significantly smaller than the other class. In the recent years this issue has got more attention among many researchers all over the world. In this paper we are going to propose a new algorithm for dealing and classifying the imbalance data. In the first part of the proposed method the SMOTE (Synthetic Minority Oversampling TEchnique) oversampling preprocessing is done for increasing the numbers of minority members of the dataset in order to emphasize them, then the algorithm is run on the 10 binary classes, imbalance KEEL datasets. The experimental results show that the proposed ensemble learning algorithm has better results than some well-known algorithms such as SMOTEBagging and SMOTEBoosting in imbalance data.