Dynamic Correlation Learning and Regularization for Multi-Label Confidence Calibration

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2024-09-02 DOI:10.1109/TIP.2024.3448248

Tianshui Chen;Weihang Wang;Tao Pu;Jinghui Qin;Zhijing Yang;Jie Liu;Liang Lin

{"title":"Dynamic Correlation Learning and Regularization for Multi-Label Confidence Calibration","authors":"Tianshui Chen;Weihang Wang;Tao Pu;Jinghui Qin;Zhijing Yang;Jie Liu;Liang Lin","doi":"10.1109/TIP.2024.3448248","DOIUrl":null,"url":null,"abstract":"Modern visual recognition models often display overconfidence due to their reliance on complex deep neural networks and one-hot target supervision, resulting in unreliable confidence scores that necessitate calibration. While current confidence calibration techniques primarily address single-label scenarios, there is a lack of focus on more practical and generalizable multi-label contexts. This paper introduces the Multi-Label Confidence Calibration (MLCC) task, aiming to provide well-calibrated confidence scores in multi-label scenarios. Unlike single-label images, multi-label images contain multiple objects, leading to semantic confusion and further unreliability in confidence scores. Existing single-label calibration methods, based on label smoothing, fail to account for category correlations, which are crucial for addressing semantic confusion, thereby yielding sub-optimal performance. To overcome these limitations, we propose the Dynamic Correlation Learning and Regularization (DCLR) algorithm, which leverages multi-grained semantic correlations to better model semantic confusion for adaptive regularization. DCLR learns dynamic instance-level and prototype-level similarities specific to each category, using these to measure semantic correlations across different categories. With this understanding, we construct adaptive label vectors that assign higher values to categories with strong correlations, thereby facilitating more effective regularization. We establish an evaluation benchmark, re-implementing several advanced confidence calibration algorithms and applying them to leading multi-label recognition (MLR) models for fair comparison. Through extensive experiments, we demonstrate the superior performance of DCLR over existing methods in providing reliable confidence scores in multi-label scenarios.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"33 ","pages":"4811-4823"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10663362/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Modern visual recognition models often display overconfidence due to their reliance on complex deep neural networks and one-hot target supervision, resulting in unreliable confidence scores that necessitate calibration. While current confidence calibration techniques primarily address single-label scenarios, there is a lack of focus on more practical and generalizable multi-label contexts. This paper introduces the Multi-Label Confidence Calibration (MLCC) task, aiming to provide well-calibrated confidence scores in multi-label scenarios. Unlike single-label images, multi-label images contain multiple objects, leading to semantic confusion and further unreliability in confidence scores. Existing single-label calibration methods, based on label smoothing, fail to account for category correlations, which are crucial for addressing semantic confusion, thereby yielding sub-optimal performance. To overcome these limitations, we propose the Dynamic Correlation Learning and Regularization (DCLR) algorithm, which leverages multi-grained semantic correlations to better model semantic confusion for adaptive regularization. DCLR learns dynamic instance-level and prototype-level similarities specific to each category, using these to measure semantic correlations across different categories. With this understanding, we construct adaptive label vectors that assign higher values to categories with strong correlations, thereby facilitating more effective regularization. We establish an evaluation benchmark, re-implementing several advanced confidence calibration algorithms and applying them to leading multi-label recognition (MLR) models for fair comparison. Through extensive experiments, we demonstrate the superior performance of DCLR over existing methods in providing reliable confidence scores in multi-label scenarios.

查看原文本刊更多论文

多标签信度校准的动态相关性学习和正则化。

现代视觉识别模型由于依赖于复杂的深度神经网络和单击目标监督，常常表现出过度自信，从而导致不可靠的置信度分数，因此有必要进行校准。目前的置信度校准技术主要针对单标签场景，而对于更实用、更通用的多标签场景则缺乏关注。本文介绍了多标签置信度校准（MLCC）任务，旨在为多标签场景提供校准良好的置信度分数。与单标签图像不同，多标签图像包含多个对象，会导致语义混淆，进一步提高置信度分数的不稳定性。现有的基于标签平滑的单标签校准方法未能考虑到类别相关性，而类别相关性对于解决语义混淆问题至关重要，因此无法达到最佳性能。为了克服这些局限性，我们提出了动态相关性学习和正则化（DCLR）算法，该算法利用多粒度语义相关性更好地模拟语义混淆，从而实现自适应正则化。DCLR 学习每个类别特有的动态实例级和原型级相似性，并利用这些相似性来衡量不同类别之间的语义相关性。有了这种理解，我们就能构建自适应标签向量，为具有强相关性的类别分配更高的值，从而促进更有效的正则化。我们建立了一个评估基准，重新实施了几种先进的置信度校准算法，并将它们应用到领先的多标签识别（MLR）模型中进行公平比较。通过大量实验，我们证明了 DCLR 在多标签场景中提供可靠置信度分数方面的性能优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量