{"title":"An approach to development of an ensemble classification system","authors":"Shampa Sengupta, A. Das","doi":"10.1109/ICRCICN.2016.7813659","DOIUrl":null,"url":null,"abstract":"Generally, information system handles huge volume of dataset. Classifiers provide poor performance when such dataset are feed into it for categorization due to their high dimension. The most important attributes are extracted from the dataset prior to classification for efficient classifier design. It is also true that there may be many classifiers of a particular system, some provide better accuracy than others but selection of single classifier with all its optimized parameters is always not a good choice for various real world applications. The paper proposes a novel method for construction of an ensemble classifier by combining multiple classifiers obtained using Rough Set Theory and Genetic algorithm. The method selects the classifiers for integration based on accuracy and develops an efficient and effective ensemble classification system. In the first phase of the work, rule based classifiers termed as base classifiers are constructed from the reduced information sub systems obtained using Rough Set Theory, where a set of high quality rules are generated for each sub system. In the second phase, base classifiers are combined and an optimal ensemble classification system is developed using Genetic Algorithm. Here, ensemble classifier takes an important role to discover the class labels of the test objects with higher accuracy. The proposed algorithm has been run on standard benchmark dataset collected from the UCI repository.","PeriodicalId":254393,"journal":{"name":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2016.7813659","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Generally, information system handles huge volume of dataset. Classifiers provide poor performance when such dataset are feed into it for categorization due to their high dimension. The most important attributes are extracted from the dataset prior to classification for efficient classifier design. It is also true that there may be many classifiers of a particular system, some provide better accuracy than others but selection of single classifier with all its optimized parameters is always not a good choice for various real world applications. The paper proposes a novel method for construction of an ensemble classifier by combining multiple classifiers obtained using Rough Set Theory and Genetic algorithm. The method selects the classifiers for integration based on accuracy and develops an efficient and effective ensemble classification system. In the first phase of the work, rule based classifiers termed as base classifiers are constructed from the reduced information sub systems obtained using Rough Set Theory, where a set of high quality rules are generated for each sub system. In the second phase, base classifiers are combined and an optimal ensemble classification system is developed using Genetic Algorithm. Here, ensemble classifier takes an important role to discover the class labels of the test objects with higher accuracy. The proposed algorithm has been run on standard benchmark dataset collected from the UCI repository.