A comprehensive ensemble pruning framework based on dual-objective maximization trade-off

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems Pub Date : 2024-05-10 DOI:10.1007/s10115-024-02125-3

Anitha Gopalakrishnan, J. Martin Leo Manickam

{"title":"A comprehensive ensemble pruning framework based on dual-objective maximization trade-off","authors":"Anitha Gopalakrishnan, J. Martin Leo Manickam","doi":"10.1007/s10115-024-02125-3","DOIUrl":null,"url":null,"abstract":"<p>Ensemble learning has gotten a lot of interest because of its capacity to increase predictive accuracy by merging numerous models. However, redundant data and a high level of computing complexity frequently plague ensembles. To choose a subset of models while maintaining the accuracy and diversity of the ensemble, ensemble pruning techniques are used to address these problems. Accuracy and diversity must coexist, even though their goals are conflicting. This is why we formulate the issue of ensemble pruning as a dual-objective maximization problem using the idea from information theory. Then, we propose a Comprehensive Ensemble Pruning Framework (CEPF) based on the dual-objective maximization (DOM) trade-off metric. Extensive evaluation of our framework on the exclusively collected PhysioSense dataset demonstrates the superiority of our method compared to existing pruning techniques. PhysioSense dataset was collected after getting approval from the Institutional Human Ethics Committee (IHEC) of Panimalar Medical College Hospital and Research Institute, Chennai, Tamil Nadu (Protocol No: PMCHRI-IHEC-059). The proposed framework not only preserves or improves ensemble accuracy and diversity but also achieves a significant reduction in actual ensemble size. Furthermore, the proposed method provides valuable insights into the dual-objective trade-off between accuracy and diversity paving the way for further research and advancements in ensemble pruning techniques.</p>","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"46 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge and Information Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10115-024-02125-3","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Ensemble learning has gotten a lot of interest because of its capacity to increase predictive accuracy by merging numerous models. However, redundant data and a high level of computing complexity frequently plague ensembles. To choose a subset of models while maintaining the accuracy and diversity of the ensemble, ensemble pruning techniques are used to address these problems. Accuracy and diversity must coexist, even though their goals are conflicting. This is why we formulate the issue of ensemble pruning as a dual-objective maximization problem using the idea from information theory. Then, we propose a Comprehensive Ensemble Pruning Framework (CEPF) based on the dual-objective maximization (DOM) trade-off metric. Extensive evaluation of our framework on the exclusively collected PhysioSense dataset demonstrates the superiority of our method compared to existing pruning techniques. PhysioSense dataset was collected after getting approval from the Institutional Human Ethics Committee (IHEC) of Panimalar Medical College Hospital and Research Institute, Chennai, Tamil Nadu (Protocol No: PMCHRI-IHEC-059). The proposed framework not only preserves or improves ensemble accuracy and diversity but also achieves a significant reduction in actual ensemble size. Furthermore, the proposed method provides valuable insights into the dual-objective trade-off between accuracy and diversity paving the way for further research and advancements in ensemble pruning techniques.

Abstract Image

查看原文本刊更多论文

基于双目标最大化权衡的综合集合剪枝框架

集合学习因其通过合并众多模型来提高预测准确性的能力而备受关注。然而，冗余数据和高计算复杂度经常困扰着集合学习。为了选择模型子集，同时保持集合的准确性和多样性，集合剪枝技术被用来解决这些问题。准确性和多样性必须共存，尽管它们的目标相互冲突。因此，我们利用信息论的思想，将集合修剪问题表述为一个双目标最大化问题。然后，我们提出了一个基于双目标最大化（DOM）权衡指标的综合集合修剪框架（CEPF）。在专门收集的 PhysioSense 数据集上对我们的框架进行了广泛评估，结果表明我们的方法优于现有的剪枝技术。PhysioSense 数据集是在获得泰米尔纳德邦金奈市 Panimalar 医学院医院和研究所机构人类伦理委员会 (IHEC) 批准后收集的（协议编号：PMCHRI-IHEC-059）。所提出的框架不仅保留或提高了集合的准确性和多样性，还显著减少了实际的集合规模。此外，所提出的方法为准确性和多样性之间的双目标权衡提供了宝贵的见解，为进一步研究和改进集合修剪技术铺平了道路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge and Information Systems 工程技术-计算机：人工智能

CiteScore

5.70

自引率

7.40%

发文量

152

审稿时长

7.2 months

期刊介绍： Knowledge and Information Systems (KAIS) provides an international forum for researchers and professionals to share their knowledge and report new advances on all topics related to knowledge systems and advanced information systems. This monthly peer-reviewed archival journal publishes state-of-the-art research reports on emerging topics in KAIS, reviews of important techniques in related areas, and application papers of interest to a general readership.