带拒绝选项的分类：通过适形预测保证无分布错误

IF 4.9

Machine learning with applications Pub Date : 2025-05-09 DOI:10.1016/j.mlwa.2025.100664

Johan Hallberg Szabadváry , Tuwe Löfström , Ulf Johansson , Cecilia Sönströd , Ernst Ahlberg , Lars Carlsson

{"title":"带拒绝选项的分类：通过适形预测保证无分布错误","authors":"Johan Hallberg Szabadváry , Tuwe Löfström , Ulf Johansson , Cecilia Sönströd , Ernst Ahlberg , Lars Carlsson","doi":"10.1016/j.mlwa.2025.100664","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning (ML) models always make a prediction, even when they are likely to be wrong. This causes problems in practical applications, as we do not know if we should trust a prediction. ML with reject option addresses this issue by abstaining from making a prediction if it is likely to be incorrect. In this work, we formalise the approach to ML with reject option in binary classification, deriving theoretical guarantees on the resulting error rate. This is achieved through conformal prediction (CP), which produce prediction sets with distribution-free validity guarantees. In binary classification, CP can output prediction sets containing exactly one, two or no labels. By accepting only the singleton predictions, we turn CP into a binary classifier with reject option.</div><div>Here, CP is formally put in the framework of predicting with reject option. We state and prove the resulting error rate, and give finite sample estimates. Numerical examples provide illustrations of derived error rate through several different conformal prediction settings, ranging from full conformal prediction to offline batch inductive conformal prediction. The former has a direct link to sharp validity guarantees, whereas the latter is more fuzzy in terms of validity guarantees but can be used in practice. Error-reject curves illustrate the trade-off between error rate and reject rate, and can serve to aid a user to set an acceptable error rate or reject rate in practice.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"20 ","pages":"Article 100664"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification with reject option: Distribution-free error guarantees via conformal prediction\",\"authors\":\"Johan Hallberg Szabadváry , Tuwe Löfström , Ulf Johansson , Cecilia Sönströd , Ernst Ahlberg , Lars Carlsson\",\"doi\":\"10.1016/j.mlwa.2025.100664\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Machine learning (ML) models always make a prediction, even when they are likely to be wrong. This causes problems in practical applications, as we do not know if we should trust a prediction. ML with reject option addresses this issue by abstaining from making a prediction if it is likely to be incorrect. In this work, we formalise the approach to ML with reject option in binary classification, deriving theoretical guarantees on the resulting error rate. This is achieved through conformal prediction (CP), which produce prediction sets with distribution-free validity guarantees. In binary classification, CP can output prediction sets containing exactly one, two or no labels. By accepting only the singleton predictions, we turn CP into a binary classifier with reject option.</div><div>Here, CP is formally put in the framework of predicting with reject option. We state and prove the resulting error rate, and give finite sample estimates. Numerical examples provide illustrations of derived error rate through several different conformal prediction settings, ranging from full conformal prediction to offline batch inductive conformal prediction. The former has a direct link to sharp validity guarantees, whereas the latter is more fuzzy in terms of validity guarantees but can be used in practice. Error-reject curves illustrate the trade-off between error rate and reject rate, and can serve to aid a user to set an acceptable error rate or reject rate in practice.</div></div>\",\"PeriodicalId\":74093,\"journal\":{\"name\":\"Machine learning with applications\",\"volume\":\"20 \",\"pages\":\"Article 100664\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Machine learning with applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666827025000477\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000477","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

机器学习（ML）模型总是会做出预测，即使它们可能是错误的。这在实际应用中造成了问题，因为我们不知道我们是否应该相信预测。带有拒绝选项的ML通过避免做出可能不正确的预测来解决这个问题。在这项工作中，我们形式化了二元分类中带有拒绝选项的机器学习方法，对产生的错误率给出了理论保证。这是通过共形预测（CP）实现的，它产生具有无分布有效性保证的预测集。在二元分类中，CP可以输出恰好包含一个、两个或没有标签的预测集。通过只接受单例预测，我们将CP转换为具有拒绝选项的二元分类器。在这里，CP被形式化地放在带有拒绝权的预测框架中。我们陈述并证明了结果错误率，并给出了有限样本估计。数值示例提供了通过几种不同的共形预测设置（从完全共形预测到离线批量归纳共形预测）推导出的错误率的说明。前者与明确的效度保证有直接联系，而后者在效度保证方面较为模糊，但可以在实践中使用。错误拒绝曲线说明了错误率和拒绝率之间的权衡，可以帮助用户在实践中设置可接受的错误率或拒绝率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Classification with reject option: Distribution-free error guarantees via conformal prediction

查看原文本刊更多论文

Classification with reject option: Distribution-free error guarantees via conformal prediction

Machine learning (ML) models always make a prediction, even when they are likely to be wrong. This causes problems in practical applications, as we do not know if we should trust a prediction. ML with reject option addresses this issue by abstaining from making a prediction if it is likely to be incorrect. In this work, we formalise the approach to ML with reject option in binary classification, deriving theoretical guarantees on the resulting error rate. This is achieved through conformal prediction (CP), which produce prediction sets with distribution-free validity guarantees. In binary classification, CP can output prediction sets containing exactly one, two or no labels. By accepting only the singleton predictions, we turn CP into a binary classifier with reject option.

Here, CP is formally put in the framework of predicting with reject option. We state and prove the resulting error rate, and give finite sample estimates. Numerical examples provide illustrations of derived error rate through several different conformal prediction settings, ranging from full conformal prediction to offline batch inductive conformal prediction. The former has a direct link to sharp validity guarantees, whereas the latter is more fuzzy in terms of validity guarantees but can be used in practice. Error-reject curves illustrate the trade-off between error rate and reject rate, and can serve to aid a user to set an acceptable error rate or reject rate in practice.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Machine learning with applications Management Science and Operations Research, Artificial Intelligence, Computer Science Applications

自引率

0.00%

发文量

审稿时长

98 days