用于比较多个有序分类器的成本敏感性能度量

N. George, T. Lu, Ching-Wei Chang
{"title":"用于比较多个有序分类器的成本敏感性能度量","authors":"N. George, T. Lu, Ching-Wei Chang","doi":"10.5430/air.v5n1p135","DOIUrl":null,"url":null,"abstract":"The surge of interest in personalized and precision medicine during recent years has increased the application of ordinal classification problems in biomedical science. Currently, accuracy, Kendall's τb , and average mean absolute error are three commonly used metrics for evaluating the effectiveness of an ordinal classifier. Although there are benefits to each, no single metric considers the benefits of predictive accuracy with the tradeoffs of misclassification cost. In addition, decision analysis that considers pairwise analysis of the metrics is not trivial due to inconsistent findings. A new cost-sensitive metric is proposed to find the optimal tradeoff between the two most critical performance measures of a classification task - accuracy and cost. The proposed method accounts for an inherent ordinal data structure, total misclassification cost of a classifier, and imbalanced class distribution. The strengths of the new methodology are demonstrated through analyses of three real cancer datasets and four simulation studies. The new cost-sensitive metric proved better performance in its ability to identify the best ordinal classifier for a given analysis. The performance metric devised in this study provides a comprehensive tool for comparative analysis of multiple (and competing) ordinal classifiers. Consideration of the tradeoff between accuracy and misclassification cost in decisions regarding ordinal classification problems is imperative in real-world application. The work presented here is a precursor to the possibility of incorporating the proposed metric into a prediction modeling algorithm for ordinal data as a means of integrating misclassification cost in final model selection.","PeriodicalId":91658,"journal":{"name":"Artificial intelligence research","volume":"5 1 1","pages":"135-143"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.5430/air.v5n1p135","citationCount":"11","resultStr":"{\"title\":\"Cost-sensitive performance metric for comparing multiple ordinal classifiers\",\"authors\":\"N. George, T. Lu, Ching-Wei Chang\",\"doi\":\"10.5430/air.v5n1p135\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The surge of interest in personalized and precision medicine during recent years has increased the application of ordinal classification problems in biomedical science. Currently, accuracy, Kendall's τb , and average mean absolute error are three commonly used metrics for evaluating the effectiveness of an ordinal classifier. Although there are benefits to each, no single metric considers the benefits of predictive accuracy with the tradeoffs of misclassification cost. In addition, decision analysis that considers pairwise analysis of the metrics is not trivial due to inconsistent findings. A new cost-sensitive metric is proposed to find the optimal tradeoff between the two most critical performance measures of a classification task - accuracy and cost. The proposed method accounts for an inherent ordinal data structure, total misclassification cost of a classifier, and imbalanced class distribution. The strengths of the new methodology are demonstrated through analyses of three real cancer datasets and four simulation studies. The new cost-sensitive metric proved better performance in its ability to identify the best ordinal classifier for a given analysis. The performance metric devised in this study provides a comprehensive tool for comparative analysis of multiple (and competing) ordinal classifiers. Consideration of the tradeoff between accuracy and misclassification cost in decisions regarding ordinal classification problems is imperative in real-world application. The work presented here is a precursor to the possibility of incorporating the proposed metric into a prediction modeling algorithm for ordinal data as a means of integrating misclassification cost in final model selection.\",\"PeriodicalId\":91658,\"journal\":{\"name\":\"Artificial intelligence research\",\"volume\":\"5 1 1\",\"pages\":\"135-143\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.5430/air.v5n1p135\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5430/air.v5n1p135\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5430/air.v5n1p135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

摘要

近年来,人们对个性化和精准医疗的兴趣激增,增加了有序分类问题在生物医学科学中的应用。目前,准确率、肯德尔τb和平均绝对误差是评估有序分类器有效性的三个常用指标。虽然每一种方法都有好处,但没有一种度量标准考虑到预测准确性的好处与错误分类成本的权衡。此外,考虑对指标进行两两分析的决策分析也不是微不足道的,因为结果不一致。提出了一种新的成本敏感度量,用于在分类任务的两个最关键的性能度量-准确率和成本之间找到最佳权衡。该方法考虑了固有的有序数据结构、分类器的总误分类代价和类分布不平衡等问题。通过对三个真实癌症数据集和四个模拟研究的分析,证明了新方法的优势。对于给定的分析,新的成本敏感度量在识别最佳有序分类器的能力方面证明了更好的性能。本研究设计的性能指标为多个(和竞争的)有序分类器的比较分析提供了一个全面的工具。在实际应用中,在有序分类问题的决策中考虑准确率和误分类代价之间的权衡是必要的。这里提出的工作是将所提出的度量纳入有序数据的预测建模算法的可能性的先驱,作为在最终模型选择中整合错误分类成本的手段。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Cost-sensitive performance metric for comparing multiple ordinal classifiers
The surge of interest in personalized and precision medicine during recent years has increased the application of ordinal classification problems in biomedical science. Currently, accuracy, Kendall's τb , and average mean absolute error are three commonly used metrics for evaluating the effectiveness of an ordinal classifier. Although there are benefits to each, no single metric considers the benefits of predictive accuracy with the tradeoffs of misclassification cost. In addition, decision analysis that considers pairwise analysis of the metrics is not trivial due to inconsistent findings. A new cost-sensitive metric is proposed to find the optimal tradeoff between the two most critical performance measures of a classification task - accuracy and cost. The proposed method accounts for an inherent ordinal data structure, total misclassification cost of a classifier, and imbalanced class distribution. The strengths of the new methodology are demonstrated through analyses of three real cancer datasets and four simulation studies. The new cost-sensitive metric proved better performance in its ability to identify the best ordinal classifier for a given analysis. The performance metric devised in this study provides a comprehensive tool for comparative analysis of multiple (and competing) ordinal classifiers. Consideration of the tradeoff between accuracy and misclassification cost in decisions regarding ordinal classification problems is imperative in real-world application. The work presented here is a precursor to the possibility of incorporating the proposed metric into a prediction modeling algorithm for ordinal data as a means of integrating misclassification cost in final model selection.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信