PULASki: Learning inter-rater variability using statistical distances to improve probabilistic segmentation

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-05-07 DOI:10.1016/j.media.2025.103623

Soumick Chatterjee , Franziska Gaidzik , Alessandro Sciarra , Hendrik Mattern , Gábor Janiga , Oliver Speck , Andreas Nürnberger , Sahani Pathiraja

{"title":"PULASki: Learning inter-rater variability using statistical distances to improve probabilistic segmentation","authors":"Soumick Chatterjee , Franziska Gaidzik , Alessandro Sciarra , Hendrik Mattern , Gábor Janiga , Oliver Speck , Andreas Nürnberger , Sahani Pathiraja","doi":"10.1016/j.media.2025.103623","DOIUrl":null,"url":null,"abstract":"<div><div>In the domain of medical imaging, many supervised learning based methods for segmentation face several challenges such as high variability in annotations from multiple experts, paucity of labelled data and class imbalanced datasets. These issues may result in segmentations that lack the requisite precision for clinical analysis and can be misleadingly overconfident without associated uncertainty quantification. This work proposes the PULASki method as a computationally efficient generative tool for biomedical image segmentation that accurately captures variability in expert annotations, even in small datasets. This approach makes use of an improved loss function based on statistical distances in a conditional variational autoencoder structure (Probabilistic UNet) , which improves learning of the conditional decoder compared to the standard cross-entropy particularly in class imbalanced problems. The proposed method was analysed for two structurally different segmentation tasks (intracranial vessel and multiple sclerosis (MS) lesion) and compare our results to four well-established baselines in terms of quantitative metrics and qualitative output. These experiments involve class-imbalanced datasets characterised by challenging features, including suboptimal signal-to-noise ratios and high ambiguity. Empirical results demonstrate the PULASKi method outperforms all baselines at the 5% significance level. Our experiments are also of the first to present a comparative study of the computationally feasible segmentation of complex geometries using 3D patches and the traditional use of 2D slices. The generated segmentations are shown to be much more anatomically plausible than in the 2D case, particularly for the vessel task. Our method can also be applied to a wide range of multi-label segmentation tasks and is useful for downstream tasks such as hemodynamic modelling (computational fluid dynamics and data assimilation), clinical decision making, and treatment planning.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"103 ","pages":"Article 103623"},"PeriodicalIF":11.8000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525001707","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In the domain of medical imaging, many supervised learning based methods for segmentation face several challenges such as high variability in annotations from multiple experts, paucity of labelled data and class imbalanced datasets. These issues may result in segmentations that lack the requisite precision for clinical analysis and can be misleadingly overconfident without associated uncertainty quantification. This work proposes the PULASki method as a computationally efficient generative tool for biomedical image segmentation that accurately captures variability in expert annotations, even in small datasets. This approach makes use of an improved loss function based on statistical distances in a conditional variational autoencoder structure (Probabilistic UNet) , which improves learning of the conditional decoder compared to the standard cross-entropy particularly in class imbalanced problems. The proposed method was analysed for two structurally different segmentation tasks (intracranial vessel and multiple sclerosis (MS) lesion) and compare our results to four well-established baselines in terms of quantitative metrics and qualitative output. These experiments involve class-imbalanced datasets characterised by challenging features, including suboptimal signal-to-noise ratios and high ambiguity. Empirical results demonstrate the PULASKi method outperforms all baselines at the 5% significance level. Our experiments are also of the first to present a comparative study of the computationally feasible segmentation of complex geometries using 3D patches and the traditional use of 2D slices. The generated segmentations are shown to be much more anatomically plausible than in the 2D case, particularly for the vessel task. Our method can also be applied to a wide range of multi-label segmentation tasks and is useful for downstream tasks such as hemodynamic modelling (computational fluid dynamics and data assimilation), clinical decision making, and treatment planning.

查看原文本刊更多论文

PULASki：使用统计距离学习评分者之间的可变性来改进概率分割

在医学成像领域，许多基于监督学习的分割方法面临着许多挑战，例如来自多个专家的注释的高度可变性，标记数据的缺乏和类不平衡数据集。这些问题可能导致分割缺乏临床分析所需的精度，并且可能在没有相关不确定性量化的情况下误导性地过度自信。这项工作提出了PULASki方法作为生物医学图像分割的计算高效生成工具，即使在小数据集中也能准确捕获专家注释中的可变性。这种方法利用了条件变分自编码器结构（Probabilistic UNet）中基于统计距离的改进损失函数，与标准交叉熵相比，它提高了条件解码器的学习能力，特别是在类不平衡问题中。我们对两种结构不同的分割任务（颅内血管和多发性硬化症（MS）病变）进行了分析，并将我们的结果与四种已建立的定量指标和定性输出基线进行了比较。这些实验涉及具有挑战性特征的类不平衡数据集，包括次优信噪比和高模糊性。实证结果表明，在5%显著性水平下，PULASKi方法优于所有基线。我们的实验也是第一个对使用3D补丁和传统使用2D切片的复杂几何形状进行计算可行分割的比较研究。生成的分割显示出比二维情况下解剖学上更合理，特别是对于血管任务。我们的方法也可以应用于广泛的多标签分割任务，对下游任务如血流动力学建模（计算流体动力学和数据同化）、临床决策和治疗计划很有用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.