成本敏感情况下不精确分类的Bagging算法

IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS
Serafín Moral-García , Andrés R. Masegosa , Joaquín Abellán
{"title":"成本敏感情况下不精确分类的Bagging算法","authors":"Serafín Moral-García ,&nbsp;Andrés R. Masegosa ,&nbsp;Joaquín Abellán","doi":"10.1016/j.ins.2025.122151","DOIUrl":null,"url":null,"abstract":"<div><div>Classic classification aims to predict the value of a variable under study, also called the class variable, based on an attribute set associated with a given item. When the evidence is not strong enough to determine such a value, a set of values of the class variable probably represents a more informative situation. This is called Imprecise Classification. An important aspect of any classification task is the cost that must be assumed in the case of error. Previous research has shown that the fusion/combination of classifiers tends to obtain better predictive results. In Imprecise Classification, there are very few models capable of efficiently fusing information from multiple classifiers. Concerning Imprecise Classification considering error costs, there is no method for this aim in the literature so far. This work presents the first method capable of fusing imprecise classifiers that take into account error costs. To do this, a procedure representing a midpoint between misclassification risk and informative outputs is used as a basis. Experiments highlight that our proposed fusion procedure shows an improvement in the results over those obtained by other methods of the state-of-the-art.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"712 ","pages":"Article 122151"},"PeriodicalIF":8.1000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Bagging algorithm for imprecise classification in cost-sensitive scenarios\",\"authors\":\"Serafín Moral-García ,&nbsp;Andrés R. Masegosa ,&nbsp;Joaquín Abellán\",\"doi\":\"10.1016/j.ins.2025.122151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Classic classification aims to predict the value of a variable under study, also called the class variable, based on an attribute set associated with a given item. When the evidence is not strong enough to determine such a value, a set of values of the class variable probably represents a more informative situation. This is called Imprecise Classification. An important aspect of any classification task is the cost that must be assumed in the case of error. Previous research has shown that the fusion/combination of classifiers tends to obtain better predictive results. In Imprecise Classification, there are very few models capable of efficiently fusing information from multiple classifiers. Concerning Imprecise Classification considering error costs, there is no method for this aim in the literature so far. This work presents the first method capable of fusing imprecise classifiers that take into account error costs. To do this, a procedure representing a midpoint between misclassification risk and informative outputs is used as a basis. Experiments highlight that our proposed fusion procedure shows an improvement in the results over those obtained by other methods of the state-of-the-art.</div></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":\"712 \",\"pages\":\"Article 122151\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S002002552500283X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S002002552500283X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

经典分类的目的是根据与给定项目相关联的属性集来预测所研究的变量(也称为类变量)的值。当证据不足以确定这样的值时,类变量的一组值可能代表更有信息的情况。这被称为不精确分类。任何分类任务的一个重要方面是在出现错误的情况下必须承担的代价。先前的研究表明,分类器的融合/组合倾向于获得更好的预测结果。在不精确分类中,很少有模型能够有效地融合来自多个分类器的信息。对于考虑错误代价的不精确分类,目前文献中还没有针对这一目标的方法。这项工作提出了第一种能够融合考虑错误代价的不精确分类器的方法。为此,使用一个表示错误分类风险和信息输出之间的中点的程序作为基础。实验突出表明,我们提出的融合程序显示了较先进的其他方法所获得的结果的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Bagging algorithm for imprecise classification in cost-sensitive scenarios
Classic classification aims to predict the value of a variable under study, also called the class variable, based on an attribute set associated with a given item. When the evidence is not strong enough to determine such a value, a set of values of the class variable probably represents a more informative situation. This is called Imprecise Classification. An important aspect of any classification task is the cost that must be assumed in the case of error. Previous research has shown that the fusion/combination of classifiers tends to obtain better predictive results. In Imprecise Classification, there are very few models capable of efficiently fusing information from multiple classifiers. Concerning Imprecise Classification considering error costs, there is no method for this aim in the literature so far. This work presents the first method capable of fusing imprecise classifiers that take into account error costs. To do this, a procedure representing a midpoint between misclassification risk and informative outputs is used as a basis. Experiments highlight that our proposed fusion procedure shows an improvement in the results over those obtained by other methods of the state-of-the-art.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Sciences
Information Sciences 工程技术-计算机:信息系统
CiteScore
14.00
自引率
17.30%
发文量
1322
审稿时长
10.4 months
期刊介绍: Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信