{"title":"利用模糊欠采样和模糊主成分分析改进旋转森林算法的不平衡分类","authors":"M. Hosseinzadeh, M. Eftekhari","doi":"10.1109/CSICSSE.2015.7369242","DOIUrl":null,"url":null,"abstract":"This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.","PeriodicalId":115653,"journal":{"name":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm\",\"authors\":\"M. Hosseinzadeh, M. Eftekhari\",\"doi\":\"10.1109/CSICSSE.2015.7369242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.\",\"PeriodicalId\":115653,\"journal\":{\"name\":\"2015 International Symposium on Computer Science and Software Engineering (CSSE)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Symposium on Computer Science and Software Engineering (CSSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSICSSE.2015.7369242\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSICSSE.2015.7369242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm
This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.