利用模糊欠采样和模糊主成分分析改进旋转森林算法的不平衡分类

M. Hosseinzadeh, M. Eftekhari
{"title":"利用模糊欠采样和模糊主成分分析改进旋转森林算法的不平衡分类","authors":"M. Hosseinzadeh, M. Eftekhari","doi":"10.1109/CSICSSE.2015.7369242","DOIUrl":null,"url":null,"abstract":"This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.","PeriodicalId":115653,"journal":{"name":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm\",\"authors\":\"M. Hosseinzadeh, M. Eftekhari\",\"doi\":\"10.1109/CSICSSE.2015.7369242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.\",\"PeriodicalId\":115653,\"journal\":{\"name\":\"2015 International Symposium on Computer Science and Software Engineering (CSSE)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Symposium on Computer Science and Software Engineering (CSSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSICSSE.2015.7369242\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Symposium on Computer Science and Software Engineering (CSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSICSSE.2015.7369242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

本文提出了一种新的欠采样方法,利用模糊隶属度来降低数据集的不平衡率,并利用新的模糊主成分分析(F-PCA)通过旋转森林算法进行分类。在欠采样阶段,对每个特征(维度)定义前两个隶属函数;一个表示少数概念,另一个表示多数概念。之后,每个数据样本根据其在特征空间的每个维度上的隶属度得到一个分数。大多数得分最高的样本是移除的最佳候选。然后在旋转森林算法的训练阶段,对欠采样阶段产生的样本的模糊化值进行模糊主成分分析(F-PCA)。此外,这些值用于构建集成的基本分类器。所得结果表明,与其他最先进的类不平衡问题算法相比,我们提出的方法效率高,性能显著。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using fuzzy undersampling and fuzzy PCA to improve imbalanced classification through Rotation Forest algorithm
This paper proposed a novel undersampling method to reduce the imbalance ratio of a dataset using fuzzy memberships degrees as well as utilizing a new fuzzy principal components analysis (F-PCA) for the classification through Rotation Forest algorithm. In the undersampling phase, first two membership functions are defined on each feature (dimension); one indicates the minority concept and the other shows majority concept. After that, each data sample receives a score based on its membership degrees in each dimension of the feature space. Majority samples with the highest scores are the best candidates of removal. Then during the Rotation Forest algorithm's train phase, a fuzzy Principal Component Analysis (F-PCA) is applied on the fuzzified values of samples which are produced in the undersampling phase. Moreover, these values are used to build the base classifiers of the ensemble. The obtained results illustrate the efficiency and noteworthy high performance of our proposed method comparing to the other state-of-the-art algorithms for class imbalance problem.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信