DBBoost-Enhancing Imbalanced Classification by a Novel Ensemble Based Technique

Chunkai Zhang, Pengfei Jia
{"title":"DBBoost-Enhancing Imbalanced Classification by a Novel Ensemble Based Technique","authors":"Chunkai Zhang, Pengfei Jia","doi":"10.1109/ICMB.2014.45","DOIUrl":null,"url":null,"abstract":"Classification with imbalanced data-sets has become one of the most popular issues in machine learning, since it prevails in various applications. For binary-class problem, the amount of instances from the majority class is significant larger than that from the minority class. Consequently, traditional classifiers achieve a better performance over the majority class, while unsatisfactory predictive accuracy over the minority class. The emergence of ensemble learning provides a possible solution of solving this concern. And there are many recent researches indicate that the combination of Boosting and/or Bagging with pre-processing techniques is an effective way to enhance the classification performance of imbalanced data-sets. Centered on binary-class imbalanced problem, to overcome the drawbacks of state-of-the-art approaches, this paper introduces a novel technique (DBBoost) based on the combination of AdaBoost with an adaptive sampling approach. Through supporting by statistical analysis, experiments show that DBBoost outperforms the state-of-the-art methods based on ensemble.","PeriodicalId":273636,"journal":{"name":"2014 International Conference on Medical Biometrics","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Medical Biometrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMB.2014.45","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Classification with imbalanced data-sets has become one of the most popular issues in machine learning, since it prevails in various applications. For binary-class problem, the amount of instances from the majority class is significant larger than that from the minority class. Consequently, traditional classifiers achieve a better performance over the majority class, while unsatisfactory predictive accuracy over the minority class. The emergence of ensemble learning provides a possible solution of solving this concern. And there are many recent researches indicate that the combination of Boosting and/or Bagging with pre-processing techniques is an effective way to enhance the classification performance of imbalanced data-sets. Centered on binary-class imbalanced problem, to overcome the drawbacks of state-of-the-art approaches, this paper introduces a novel technique (DBBoost) based on the combination of AdaBoost with an adaptive sampling approach. Through supporting by statistical analysis, experiments show that DBBoost outperforms the state-of-the-art methods based on ensemble.
基于集成技术的dbboost增强不平衡分类
不平衡数据集的分类已经成为机器学习中最受欢迎的问题之一,因为它在各种应用中都很普遍。对于二元类问题,多数类的实例数量明显大于少数类的实例数量。因此,传统分类器在多数类上实现了更好的性能,而在少数类上实现了不理想的预测精度。集成学习的出现为解决这一问题提供了可能的解决方案。近年来的许多研究表明,将Boosting和Bagging与预处理技术相结合是提高不平衡数据集分类性能的有效途径。针对二类不平衡问题,为了克服现有方法的不足,本文提出了一种基于AdaBoost和自适应采样相结合的新技术(DBBoost)。通过统计分析的支持,实验表明DBBoost优于基于集成的最新方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信