Improving Accuracy and Calibration of Deep Image Classifiers With Agreement-Driven Dynamic Ensemble

IEEE Open Journal of the Computer Society Pub Date : 2024-12-18 DOI:10.1109/OJCS.2024.3519984

Pedro Conde;Rui L. Lopes;Cristiano Premebida

{"title":"Improving Accuracy and Calibration of Deep Image Classifiers With Agreement-Driven Dynamic Ensemble","authors":"Pedro Conde;Rui L. Lopes;Cristiano Premebida","doi":"10.1109/OJCS.2024.3519984","DOIUrl":null,"url":null,"abstract":"One of the biggest challenges when considering the applicability of Deep Learning systems to real-world problems is the possibility of failure in \n<italic>critical\n situations. Possible strategies to tackle this problem are two-fold: (i) models need to be highly accurate, consequently reducing this risk of failure; (ii) facing the impossibility of completely eliminating the risk of error, the models should be able to inform the level of uncertainty at the prediction level. As such, state-of-the-art DL models should be \n<italic>accurate\n and also \n<italic>calibrated\n, meaning that each prediction has to codify its confidence/uncertainty in a way that approximates the true likelihood of correctness. Nonetheless, relevant literature shows that improvements in \n<italic>accuracy\n and \n<italic>calibration\n are not usually related. This motivates the development of Agreement-Driven Dynamic Ensemble, a deep ensemble method that - by dynamically combining the advantages of two different ensemble strategies - is capable of achieving the highest possible accuracy values while obtaining also substantial improvements in calibration. The merits of the proposed algorithm are shown through a series of representative experiments, leveraging two different neural network architectures and three different datasets against multiple state-of-the-art baselines.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"164-175"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10806808","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10806808/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

One of the biggest challenges when considering the applicability of Deep Learning systems to real-world problems is the possibility of failure in critical situations. Possible strategies to tackle this problem are two-fold: (i) models need to be highly accurate, consequently reducing this risk of failure; (ii) facing the impossibility of completely eliminating the risk of error, the models should be able to inform the level of uncertainty at the prediction level. As such, state-of-the-art DL models should be accurate and also calibrated , meaning that each prediction has to codify its confidence/uncertainty in a way that approximates the true likelihood of correctness. Nonetheless, relevant literature shows that improvements in accuracy and calibration are not usually related. This motivates the development of Agreement-Driven Dynamic Ensemble, a deep ensemble method that - by dynamically combining the advantages of two different ensemble strategies - is capable of achieving the highest possible accuracy values while obtaining also substantial improvements in calibration. The merits of the proposed algorithm are shown through a series of representative experiments, leveraging two different neural network architectures and three different datasets against multiple state-of-the-art baselines.

查看原文本刊更多论文

基于协议驱动的动态集成提高深度图像分类器的精度和校准

在考虑深度学习系统对现实世界问题的适用性时，最大的挑战之一是在关键情况下失败的可能性。解决这一问题的可能策略有两方面：(i)模型需要高度准确，从而减少失败的风险；（ii）在不可能完全消除误差风险的情况下，模型应能够告知预测层面的不确定性水平。因此，最先进的深度学习模型应该是准确的，也应该是经过校准的，这意味着每个预测都必须以接近正确的真实可能性的方式编纂其置信度/不确定性。尽管如此，相关文献表明，精度和校准的改进通常并不相关。这激发了协议驱动动态集成的发展，这是一种深度集成方法，通过动态结合两种不同集成策略的优势，能够在获得校准方面的实质性改进的同时获得尽可能高的精度值。通过一系列具有代表性的实验，利用两种不同的神经网络架构和三种不同的数据集，针对多个最先进的基线，展示了所提出算法的优点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Open Journal of the Computer Society

CiteScore

12.60

自引率

0.00%

发文量