A Probabilistic Model for Controlling Diversity and Accuracy of Ambiguous Medical Image Segmentation

Proceedings of the 30th ACM International Conference on Multimedia Pub Date : 2022-10-10 DOI:10.1145/3503161.3548115

Wei Zhang, Xiaohong Zhang, Sheng Huang, Yuting Lu, Kun Wang

{"title":"A Probabilistic Model for Controlling Diversity and Accuracy of Ambiguous Medical Image Segmentation","authors":"Wei Zhang, Xiaohong Zhang, Sheng Huang, Yuting Lu, Kun Wang","doi":"10.1145/3503161.3548115","DOIUrl":null,"url":null,"abstract":"Medical image segmentation tasks often have more than one plausible annotation for a given input image due to its inherent ambiguity. Generating multiple plausible predictions for a single image is of interest for medical critical applications. Many methods estimate the distribution of the annotation space by developing probabilistic models to generate multiple hypotheses. However, these methods aim to improve the diversity of predictions at the expense of the more important accuracy. In this paper, we propose a novel probabilistic segmentation model, called Joint Probabilistic U-net, which successfully achieves flexible control over the two abstract conceptions of diversity and accuracy. Specifically, we (i) model the joint distribution of images and annotations to learn a latent space, which is used to decouple diversity and accuracy, and (ii) transform the Gaussian distribution in the latent space to a complex distribution to improve model's expressiveness. In addition, we explore two strategies for preventing the latent space collapse, which are effective in improving the model's performance on datasets with limited annotation. We demonstrate the effectiveness of the proposed model on two medical image datasets, i.e. LIDC-IDRI and ISBI 2016, and achieved state-of-the-art results on several metrics.","PeriodicalId":412792,"journal":{"name":"Proceedings of the 30th ACM International Conference on Multimedia","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 30th ACM International Conference on Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3503161.3548115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Medical image segmentation tasks often have more than one plausible annotation for a given input image due to its inherent ambiguity. Generating multiple plausible predictions for a single image is of interest for medical critical applications. Many methods estimate the distribution of the annotation space by developing probabilistic models to generate multiple hypotheses. However, these methods aim to improve the diversity of predictions at the expense of the more important accuracy. In this paper, we propose a novel probabilistic segmentation model, called Joint Probabilistic U-net, which successfully achieves flexible control over the two abstract conceptions of diversity and accuracy. Specifically, we (i) model the joint distribution of images and annotations to learn a latent space, which is used to decouple diversity and accuracy, and (ii) transform the Gaussian distribution in the latent space to a complex distribution to improve model's expressiveness. In addition, we explore two strategies for preventing the latent space collapse, which are effective in improving the model's performance on datasets with limited annotation. We demonstrate the effectiveness of the proposed model on two medical image datasets, i.e. LIDC-IDRI and ISBI 2016, and achieved state-of-the-art results on several metrics.

查看原文本刊更多论文

一种控制模糊医学图像分割多样性和准确性的概率模型

医学图像分割任务由于其固有的模糊性，通常对给定的输入图像有多个合理的注释。为单个图像生成多个合理的预测是医学关键应用的兴趣。许多方法通过建立概率模型来产生多个假设来估计标注空间的分布。然而，这些方法旨在提高预测的多样性，而牺牲了更重要的准确性。本文提出了一种新的概率分割模型——联合概率U-net，该模型成功地实现了对多样性和准确性两个抽象概念的灵活控制。具体而言，我们(i)对图像和注释的联合分布进行建模，学习潜在空间，用于解耦多样性和准确性;(ii)将潜在空间中的高斯分布转换为复分布，以提高模型的表达性。此外，我们还探索了两种防止潜在空间崩溃的策略，这两种策略有效地提高了模型在有限注释数据集上的性能。我们在两个医学图像数据集(即LIDC-IDRI和ISBI 2016)上证明了所提出模型的有效性，并在几个指标上取得了最先进的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 30th ACM International Conference on Multimedia

自引率

0.00%

发文量