An approach to development of an ensemble classification system

2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN) Pub Date : 2016-09-01 DOI:10.1109/ICRCICN.2016.7813659

Shampa Sengupta, A. Das

{"title":"An approach to development of an ensemble classification system","authors":"Shampa Sengupta, A. Das","doi":"10.1109/ICRCICN.2016.7813659","DOIUrl":null,"url":null,"abstract":"Generally, information system handles huge volume of dataset. Classifiers provide poor performance when such dataset are feed into it for categorization due to their high dimension. The most important attributes are extracted from the dataset prior to classification for efficient classifier design. It is also true that there may be many classifiers of a particular system, some provide better accuracy than others but selection of single classifier with all its optimized parameters is always not a good choice for various real world applications. The paper proposes a novel method for construction of an ensemble classifier by combining multiple classifiers obtained using Rough Set Theory and Genetic algorithm. The method selects the classifiers for integration based on accuracy and develops an efficient and effective ensemble classification system. In the first phase of the work, rule based classifiers termed as base classifiers are constructed from the reduced information sub systems obtained using Rough Set Theory, where a set of high quality rules are generated for each sub system. In the second phase, base classifiers are combined and an optimal ensemble classification system is developed using Genetic Algorithm. Here, ensemble classifier takes an important role to discover the class labels of the test objects with higher accuracy. The proposed algorithm has been run on standard benchmark dataset collected from the UCI repository.","PeriodicalId":254393,"journal":{"name":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2016.7813659","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Generally, information system handles huge volume of dataset. Classifiers provide poor performance when such dataset are feed into it for categorization due to their high dimension. The most important attributes are extracted from the dataset prior to classification for efficient classifier design. It is also true that there may be many classifiers of a particular system, some provide better accuracy than others but selection of single classifier with all its optimized parameters is always not a good choice for various real world applications. The paper proposes a novel method for construction of an ensemble classifier by combining multiple classifiers obtained using Rough Set Theory and Genetic algorithm. The method selects the classifiers for integration based on accuracy and develops an efficient and effective ensemble classification system. In the first phase of the work, rule based classifiers termed as base classifiers are constructed from the reduced information sub systems obtained using Rough Set Theory, where a set of high quality rules are generated for each sub system. In the second phase, base classifiers are combined and an optimal ensemble classification system is developed using Genetic Algorithm. Here, ensemble classifier takes an important role to discover the class labels of the test objects with higher accuracy. The proposed algorithm has been run on standard benchmark dataset collected from the UCI repository.

查看原文本刊更多论文

集成分类系统的一种开发方法

通常，信息系统需要处理大量的数据集。当将此类数据集输入分类器进行分类时，由于其高维，分类器的性能很差。在分类之前从数据集中提取最重要的属性，以便有效地设计分类器。一个特定的系统可能有许多分类器，有些分类器比其他分类器提供更好的精度，但对于各种现实世界的应用，选择具有所有优化参数的单个分类器总是不是一个好的选择。提出了一种结合粗糙集理论和遗传算法得到的多个分类器构造集成分类器的新方法。该方法基于准确率选择分类器进行集成，开发了一个高效、有效的集成分类系统。在工作的第一阶段，基于规则的分类器(称为基分类器)是从使用粗糙集理论获得的约简信息子系统构建的，其中为每个子系统生成一组高质量的规则。第二阶段，结合基分类器，利用遗传算法构建最优集成分类系统。在这里，集成分类器在以更高的准确率发现测试对象的类标签方面起着重要的作用。该算法已在从UCI存储库收集的标准基准数据集上运行。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)

自引率

0.00%

发文量