Junxia Wang , Jing Wang , Jun Ma , Baijing Chen , Zeyuan Chen , Yuanjie Zheng
{"title":"CaliDiff: Multi-rater annotation calibrating diffusion probabilistic model towards medical image segmentation","authors":"Junxia Wang , Jing Wang , Jun Ma , Baijing Chen , Zeyuan Chen , Yuanjie Zheng","doi":"10.1016/j.media.2025.103812","DOIUrl":null,"url":null,"abstract":"<div><div>Medical image segmentation is critical for accurate diagnostics and effective treatment planning. Traditional multi-rater labeling strategies, while integrating consensus from multiple experts, often do not fully capture the unique insights of individual raters. Moreover, deep discriminative models that aggregate such expert labels typically embed inherent biases into the segmentation results. To address these issues, we introduce CaliDiff, a novel multi-rater annotation calibration diffusion probabilistic model. This model effectively approximates the joint probability distribution among multiple expert annotations and their corresponding images, fully leveraging diverse expert knowledge while actively refining these annotations to approximate the true underlying distribution closely. CaliDiff operates through a structured multi-stage process: it begins with a shared-parameter inverse diffusion to normalize initial expert biases, followed by Expertness Consistent Alignment to minimize variance among annotations and enhance consistency in high-confidence areas. Additionally, we incorporate a Committee-based Endogenous Knowledge Learning mechanism that uses adversarial soft supervision to simulate a reliable pseudo-ground truth, integrating Cross-Expert Fusion and Implicit Consensus Inference. Extensive experimental evaluations on various medical image segmentation datasets show that CaliDiff not only significantly improves the calibration of annotations but also achieves state-of-the-art performance, thereby enhancing the reliability and objectivity of medical diagnostics.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"107 ","pages":"Article 103812"},"PeriodicalIF":11.8000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525003585","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Medical image segmentation is critical for accurate diagnostics and effective treatment planning. Traditional multi-rater labeling strategies, while integrating consensus from multiple experts, often do not fully capture the unique insights of individual raters. Moreover, deep discriminative models that aggregate such expert labels typically embed inherent biases into the segmentation results. To address these issues, we introduce CaliDiff, a novel multi-rater annotation calibration diffusion probabilistic model. This model effectively approximates the joint probability distribution among multiple expert annotations and their corresponding images, fully leveraging diverse expert knowledge while actively refining these annotations to approximate the true underlying distribution closely. CaliDiff operates through a structured multi-stage process: it begins with a shared-parameter inverse diffusion to normalize initial expert biases, followed by Expertness Consistent Alignment to minimize variance among annotations and enhance consistency in high-confidence areas. Additionally, we incorporate a Committee-based Endogenous Knowledge Learning mechanism that uses adversarial soft supervision to simulate a reliable pseudo-ground truth, integrating Cross-Expert Fusion and Implicit Consensus Inference. Extensive experimental evaluations on various medical image segmentation datasets show that CaliDiff not only significantly improves the calibration of annotations but also achieves state-of-the-art performance, thereby enhancing the reliability and objectivity of medical diagnostics.
期刊介绍:
Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.