Enhanced Bagging (eBagging): A Novel Approach for Ensemble Learning

The International Arab Journal of Information Technology Pub Date : 2020-07-01 DOI:10.34028/iajit/17/4/10

Goksu Tuysuzoglu, Derya Birant

引用次数: 31

Abstract

Bagging is one of the well-known ensemble learning methods, which combines several classifiers trained on different subsamples of the dataset. However, a drawback of bagging is its random selection, where the classification performance depends on chance to choose a suitable subset of training objects. This paper proposes a novel modified version of bagging, named enhanced Bagging (eBagging), which uses a new mechanism (error-based bootstrapping) when constructing training sets in order to cope with this problem. In the experimental setting, the proposed eBagging technique was tested on 33 well-known benchmark datasets and compared with both bagging, random forest and boosting techniques using well-known classification algorithms: Support Vector Machines (SVM), decision trees (C4.5), k-Nearest Neighbour (kNN) and Naive Bayes (NB). The results show that eBagging outperforms its counterparts by classifying the data points more accurately while reducing the training error.

查看原文本刊更多论文

增强Bagging (eBagging):一种集成学习的新方法

Bagging是一种众所周知的集成学习方法，它结合了在数据集的不同子样本上训练的多个分类器。然而，bagging的一个缺点是它的随机选择，其中分类性能取决于选择合适的训练对象子集的机会。本文提出了一种改进的bagging算法，称为enhanced bagging (eBagging)，它在构造训练集时使用了一种新的机制(基于错误的bootstrapping)来解决这个问题。在实验环境中，提出的eBagging技术在33个知名的基准数据集上进行了测试，并使用知名的分类算法(支持向量机(SVM)、决策树(C4.5)、k近邻(kNN)和朴素贝叶斯(NB))与bagging、随机森林和boosting技术进行了比较。结果表明，eBagging在减少训练误差的同时，对数据点的分类更加准确，优于同类方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The International Arab Journal of Information Technology

自引率

0.00%

发文量