MIAC: Mutual-Information Classifier with ADASYN for Imbalanced Classification

Yanyu Cao, Xiaodong Zhao, Zhiping Zhou, Yufei Chen, Xianhui Liu, Yongming Lang
{"title":"MIAC: Mutual-Information Classifier with ADASYN for Imbalanced Classification","authors":"Yanyu Cao, Xiaodong Zhao, Zhiping Zhou, Yufei Chen, Xianhui Liu, Yongming Lang","doi":"10.1109/SPAC46244.2018.8965597","DOIUrl":null,"url":null,"abstract":"currently, classification of imbalanced data is a significant issue in the area of data mining and machine learning because of the imbalance of most of the data set. An effective solution of this problem is Cost-Sensitive Learning (CSL), but when the costs are not given, this method cannot work property. As a Cost-Free Learning (CFL) method, Mutual-Information Classification (MIC) can obtain the optimal classification results when the cost information is not given. But this method emphasizes the data of minority class too much and neglects the accuracy of the classification of majority class. And based on the above, this paper presented a CFL method called Mutual-Information-ADASYN Classification (MIAC). Firstly, we get the abstaining samples which are hard to be classified by using MIC. Then we use these abstention samples to synthesize new instance by using the method of ADASYN. Thirdly, we build Mutual- Information-ADASYN Classification using the new samples. Finally, we use our classifier to get the final results. We evaluated the performance of MIAC on several imbalance binary datasets with different imbalance ratios. The experimental results indicate that the MIAC is more effective than MIC on dealing with imbalanced datasets.","PeriodicalId":360369,"journal":{"name":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC46244.2018.8965597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

currently, classification of imbalanced data is a significant issue in the area of data mining and machine learning because of the imbalance of most of the data set. An effective solution of this problem is Cost-Sensitive Learning (CSL), but when the costs are not given, this method cannot work property. As a Cost-Free Learning (CFL) method, Mutual-Information Classification (MIC) can obtain the optimal classification results when the cost information is not given. But this method emphasizes the data of minority class too much and neglects the accuracy of the classification of majority class. And based on the above, this paper presented a CFL method called Mutual-Information-ADASYN Classification (MIAC). Firstly, we get the abstaining samples which are hard to be classified by using MIC. Then we use these abstention samples to synthesize new instance by using the method of ADASYN. Thirdly, we build Mutual- Information-ADASYN Classification using the new samples. Finally, we use our classifier to get the final results. We evaluated the performance of MIAC on several imbalance binary datasets with different imbalance ratios. The experimental results indicate that the MIAC is more effective than MIC on dealing with imbalanced datasets.
基于ADASYN的非平衡分类互信息分类器
目前,不平衡数据的分类是数据挖掘和机器学习领域的一个重要问题,因为大多数数据集都是不平衡的。成本敏感学习(CSL)是解决这一问题的一种有效方法,但在成本不确定的情况下,该方法不能有效地解决这一问题。互信息分类(MIC)作为一种无成本学习(CFL)方法,可以在成本信息不给定的情况下获得最优的分类结果。但这种方法过于强调少数类的数据,忽略了多数类的分类精度。在此基础上,本文提出了一种基于互信息- adasyn分类(MIAC)的CFL方法。首先,我们得到了难以用MIC进行分类的弃权样本。然后用ADASYN的方法将这些遗漏的样本合成新的实例。第三,利用新样本建立互信息- adasyn分类。最后,我们使用我们的分类器来获得最终结果。我们评估了MIAC在几个不同失衡比例的不平衡二值数据集上的性能。实验结果表明,MIAC比MIC更有效地处理不平衡数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信