Building a Diverse Ensemble for Classification

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI) Pub Date : 2015-10-25 DOI:10.1109/MICAI.2015.28

A. Aminsharifi, Shima Pouyesh, H. Parvin

{"title":"Building a Diverse Ensemble for Classification","authors":"A. Aminsharifi, Shima Pouyesh, H. Parvin","doi":"10.1109/MICAI.2015.28","DOIUrl":null,"url":null,"abstract":"Pattern recognition systems are widely used in a host of different fields. Due to some reasons such as lack of knowledge about a method based on which the best classifier is detected for any arbitrary problem, and thanks to significant improvement in accuracy, researchers turn to ensemble methods in almost every task of pattern recognition. Classification as a major task in pattern recognition, have been subject to this transition. The classifier ensemble which uses a number of base classifiers is considered as meta-classifier to learn any classification problem in pattern recognition. Although some researchers think they are better than single classifiers, they will not be better if some conditions are not met. The most important condition among them is diversity of base classifiers. Generally in design of multiple classifier systems, the more diverse the results of the classifiers, the more appropriate the aggregated result. It has been shown that the necessary diversity for the ensemble can be achieved by manipulation of dataset features, manipulation of data points in dataset, different sub-samplings of dataset, and usage of different classification algorithms. We also propose a new method of creating this diversity. We use Linear Discriminante Analysis to manipulate the data points in dataset. Although the classifier ensemble produced by proposed method may not always outperform all of its base classifiers, it always possesses the diversity needed for creation of an ensemble, and consequently it always outperforms all of its base classifiers on average.","PeriodicalId":448255,"journal":{"name":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2015.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Pattern recognition systems are widely used in a host of different fields. Due to some reasons such as lack of knowledge about a method based on which the best classifier is detected for any arbitrary problem, and thanks to significant improvement in accuracy, researchers turn to ensemble methods in almost every task of pattern recognition. Classification as a major task in pattern recognition, have been subject to this transition. The classifier ensemble which uses a number of base classifiers is considered as meta-classifier to learn any classification problem in pattern recognition. Although some researchers think they are better than single classifiers, they will not be better if some conditions are not met. The most important condition among them is diversity of base classifiers. Generally in design of multiple classifier systems, the more diverse the results of the classifiers, the more appropriate the aggregated result. It has been shown that the necessary diversity for the ensemble can be achieved by manipulation of dataset features, manipulation of data points in dataset, different sub-samplings of dataset, and usage of different classification algorithms. We also propose a new method of creating this diversity. We use Linear Discriminante Analysis to manipulate the data points in dataset. Although the classifier ensemble produced by proposed method may not always outperform all of its base classifiers, it always possesses the diversity needed for creation of an ensemble, and consequently it always outperforms all of its base classifiers on average.

查看原文本刊更多论文

构建用于分类的多样化集成

模式识别系统广泛应用于许多不同的领域。由于缺乏对任意问题检测最佳分类器的方法的知识，以及由于准确性的显着提高，研究人员在模式识别的几乎所有任务中都转向集成方法。分类作为模式识别中的一项主要任务，已经受到这种转变的影响。由多个基本分类器组成的分类器集成被认为是学习模式识别中任何分类问题的元分类器。虽然一些研究者认为它们比单一分类器更好，但如果不满足某些条件，它们并不会更好。其中最重要的条件是基分类器的多样性。一般在设计多分类器系统时，分类器的分类结果越多样化，聚合结果越合适。研究表明，通过对数据集特征的操作、对数据集中数据点的操作、对数据集的不同子采样以及使用不同的分类算法，可以实现集成所需的多样性。我们还提出了一种创造这种多样性的新方法。我们使用线性判别分析来处理数据集中的数据点。尽管由所提方法产生的分类器集成并不总是优于其所有的基本分类器，但它总是具有创建集成所需的多样性，因此它总是平均优于其所有的基本分类器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)

自引率

0.00%

发文量