成本敏感情况下不精确分类的Bagging算法

IF 8.1 1区计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Sciences Pub Date : 2025-04-01 DOI:10.1016/j.ins.2025.122151

Serafín Moral-García , Andrés R. Masegosa , Joaquín Abellán

{"title":"成本敏感情况下不精确分类的Bagging算法","authors":"Serafín Moral-García , Andrés R. Masegosa , Joaquín Abellán","doi":"10.1016/j.ins.2025.122151","DOIUrl":null,"url":null,"abstract":"<div><div>Classic classification aims to predict the value of a variable under study, also called the class variable, based on an attribute set associated with a given item. When the evidence is not strong enough to determine such a value, a set of values of the class variable probably represents a more informative situation. This is called Imprecise Classification. An important aspect of any classification task is the cost that must be assumed in the case of error. Previous research has shown that the fusion/combination of classifiers tends to obtain better predictive results. In Imprecise Classification, there are very few models capable of efficiently fusing information from multiple classifiers. Concerning Imprecise Classification considering error costs, there is no method for this aim in the literature so far. This work presents the first method capable of fusing imprecise classifiers that take into account error costs. To do this, a procedure representing a midpoint between misclassification risk and informative outputs is used as a basis. Experiments highlight that our proposed fusion procedure shows an improvement in the results over those obtained by other methods of the state-of-the-art.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"712 ","pages":"Article 122151"},"PeriodicalIF":8.1000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Bagging algorithm for imprecise classification in cost-sensitive scenarios\",\"authors\":\"Serafín Moral-García , Andrés R. Masegosa , Joaquín Abellán\",\"doi\":\"10.1016/j.ins.2025.122151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Classic classification aims to predict the value of a variable under study, also called the class variable, based on an attribute set associated with a given item. When the evidence is not strong enough to determine such a value, a set of values of the class variable probably represents a more informative situation. This is called Imprecise Classification. An important aspect of any classification task is the cost that must be assumed in the case of error. Previous research has shown that the fusion/combination of classifiers tends to obtain better predictive results. In Imprecise Classification, there are very few models capable of efficiently fusing information from multiple classifiers. Concerning Imprecise Classification considering error costs, there is no method for this aim in the literature so far. This work presents the first method capable of fusing imprecise classifiers that take into account error costs. To do this, a procedure representing a midpoint between misclassification risk and informative outputs is used as a basis. Experiments highlight that our proposed fusion procedure shows an improvement in the results over those obtained by other methods of the state-of-the-art.</div></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":\"712 \",\"pages\":\"Article 122151\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S002002552500283X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S002002552500283X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

经典分类的目的是根据与给定项目相关联的属性集来预测所研究的变量（也称为类变量）的值。当证据不足以确定这样的值时，类变量的一组值可能代表更有信息的情况。这被称为不精确分类。任何分类任务的一个重要方面是在出现错误的情况下必须承担的代价。先前的研究表明，分类器的融合/组合倾向于获得更好的预测结果。在不精确分类中，很少有模型能够有效地融合来自多个分类器的信息。对于考虑错误代价的不精确分类，目前文献中还没有针对这一目标的方法。这项工作提出了第一种能够融合考虑错误代价的不精确分类器的方法。为此，使用一个表示错误分类风险和信息输出之间的中点的程序作为基础。实验突出表明，我们提出的融合程序显示了较先进的其他方法所获得的结果的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Bagging algorithm for imprecise classification in cost-sensitive scenarios

Classic classification aims to predict the value of a variable under study, also called the class variable, based on an attribute set associated with a given item. When the evidence is not strong enough to determine such a value, a set of values of the class variable probably represents a more informative situation. This is called Imprecise Classification. An important aspect of any classification task is the cost that must be assumed in the case of error. Previous research has shown that the fusion/combination of classifiers tends to obtain better predictive results. In Imprecise Classification, there are very few models capable of efficiently fusing information from multiple classifiers. Concerning Imprecise Classification considering error costs, there is no method for this aim in the literature so far. This work presents the first method capable of fusing imprecise classifiers that take into account error costs. To do this, a procedure representing a midpoint between misclassification risk and informative outputs is used as a basis. Experiments highlight that our proposed fusion procedure shows an improvement in the results over those obtained by other methods of the state-of-the-art.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Information Sciences 工程技术-计算机：信息系统

CiteScore

14.00

自引率

17.30%

发文量

1322

审稿时长

10.4 months

期刊介绍： Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.