Inference time correction based on confidence and uncertainty for improved deep-learning model performance and explainability in medical image classification

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics Pub Date : 2025-08-13 DOI:10.1016/j.compmedimag.2025.102630

Joel Jeffrey , Ashwin RajKumar , Sudhanshu Pandey , Lokesh Bathala , Phaneendra K. Yalavarthy

{"title":"Inference time correction based on confidence and uncertainty for improved deep-learning model performance and explainability in medical image classification","authors":"Joel Jeffrey , Ashwin RajKumar , Sudhanshu Pandey , Lokesh Bathala , Phaneendra K. Yalavarthy","doi":"10.1016/j.compmedimag.2025.102630","DOIUrl":null,"url":null,"abstract":"<div><div>The major challenge faced by artificial intelligence (AI) models for medical image analysis is the class imbalance of training data and limited explainability. This study introduces a Confidence and Entropy-based Uncertainty Thresholding Algorithm (CEbUTAl), which is a novel post-processing method, designed to enhance both model performance and explainability. CEbUTAl modifies model predictions during inference, based on uncertainty and confidence measures, to improve classification in scenarios with class imbalance. CEbUTAl’s inference-time correction addresses explainability, while simultaneously improving performance, contrary to the prevailing notion that explainability necessitates a compromise in performance. The algorithm was evaluated across five medical imaging tasks: intracranial hemorrhage detection, optical coherence tomography analysis, breast cancer detection, carpal tunnel syndrome detection, and multi-class skin lesion classification. Results demonstrate that CEbUTAl improves accuracy by approximately 5% and increases sensitivity across multiple deep learning architectures, loss functions, and tasks. Comparative studies indicate that CEbUTAl outperforms state-of-the-art methods in addressing class imbalance and quantifying uncertainty. The model-agnostic, task-agnostic and post-processing nature of CEbUTAl makes it appealing for enhancing both performance and trustworthiness in medical image analysis. This study provides a generalizable approach to mitigate biases arising from class imbalance, while improving the explainability of AI models, thus increasing their utility in clinical practice.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"125 ","pages":"Article 102630"},"PeriodicalIF":4.9000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125001399","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

The major challenge faced by artificial intelligence (AI) models for medical image analysis is the class imbalance of training data and limited explainability. This study introduces a Confidence and Entropy-based Uncertainty Thresholding Algorithm (CEbUTAl), which is a novel post-processing method, designed to enhance both model performance and explainability. CEbUTAl modifies model predictions during inference, based on uncertainty and confidence measures, to improve classification in scenarios with class imbalance. CEbUTAl’s inference-time correction addresses explainability, while simultaneously improving performance, contrary to the prevailing notion that explainability necessitates a compromise in performance. The algorithm was evaluated across five medical imaging tasks: intracranial hemorrhage detection, optical coherence tomography analysis, breast cancer detection, carpal tunnel syndrome detection, and multi-class skin lesion classification. Results demonstrate that CEbUTAl improves accuracy by approximately 5% and increases sensitivity across multiple deep learning architectures, loss functions, and tasks. Comparative studies indicate that CEbUTAl outperforms state-of-the-art methods in addressing class imbalance and quantifying uncertainty. The model-agnostic, task-agnostic and post-processing nature of CEbUTAl makes it appealing for enhancing both performance and trustworthiness in medical image analysis. This study provides a generalizable approach to mitigate biases arising from class imbalance, while improving the explainability of AI models, thus increasing their utility in clinical practice.

查看原文本刊更多论文

基于置信度和不确定性的医学图像分类深度学习模型性能和可解释性的推理时间校正

人工智能（AI）医学图像分析模型面临的主要挑战是训练数据的类别不平衡和有限的可解释性。本文提出了一种基于置信度和熵的不确定性阈值算法（CEbUTAl），这是一种新的后处理方法，旨在提高模型的性能和可解释性。CEbUTAl在推理过程中修改模型预测，基于不确定性和置信度度量，以改进类不平衡场景中的分类。CEbUTAl的推理时间修正解决了可解释性，同时提高了性能，这与普遍认为可解释性必须在性能上做出妥协的观念相反。该算法在五个医学成像任务中进行了评估：颅内出血检测、光学相干断层扫描分析、乳腺癌检测、腕管综合征检测和多类别皮肤病变分类。结果表明，CEbUTAl在多个深度学习架构、损失函数和任务中提高了大约5%的准确性，并提高了灵敏度。比较研究表明，CEbUTAl在解决阶级不平衡和量化不确定性方面优于最先进的方法。CEbUTAl的模型不可知论、任务不可知论和后处理特性使其在提高医学图像分析的性能和可信度方面具有吸引力。本研究提供了一种可推广的方法来减轻因类别不平衡而产生的偏见，同时提高人工智能模型的可解释性，从而提高其在临床实践中的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computerized Medical Imaging and Graphics 医学-核医学

CiteScore

10.70

自引率

3.50%

发文量

审稿时长

26 days

期刊介绍： The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.