Weakly supervised commissioning of externally developed auto-segmentation models and applied to male pelvis MR auto-segmentation

IF 2.7 3区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Physica Medica-European Journal of Medical Physics Pub Date : 2025-08-01 DOI:10.1016/j.ejmp.2025.105057

Bastiaan W.K. Schipaanboord , Peter J. Koopmans , Erik van der Bijl , Charlotte L. Brouwer , Tomas Janssen

{"title":"Weakly supervised commissioning of externally developed auto-segmentation models and applied to male pelvis MR auto-segmentation","authors":"Bastiaan W.K. Schipaanboord , Peter J. Koopmans , Erik van der Bijl , Charlotte L. Brouwer , Tomas Janssen","doi":"10.1016/j.ejmp.2025.105057","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction:</h3><div>When introducing an auto-segmentation model into clinical practice, assessing the quality of the predicted segmentations and the robustness of the model over a wide range of anatomical variation and/or image quality is difficult. Especially, when the model is provided by an external party and the institution introducing the model does not possess a high-quality dataset to commission the model on.</div></div><div><h3>Materials & Methods:</h3><div>Assuming that a model is more likely to fail for an atypical case as opposed to a more average one, we propose a methodology that selects cases for commissioning using unsupervised anomaly detection. For this, the model supplier provides a set of image/shape features that correlate with model performance on the training data. Next, the receiving hospital can use these features to train an unsupervised anomaly detector on a large dataset of unlabeled cases and use the anomaly scores to select representative cases for commissioning of the model. Since the anomaly detector is trained on unlabeled data, a large, high-quality, curated dataset is not required on the receiving hospital side.</div></div><div><h3>Results:</h3><div>Using the proposed approach, the likelihood of selecting atypical edge cases with low segmentation performance was increased, as compared to random selection. For a selection of 20 cases, an increase of 22% was observed.</div></div><div><h3>Conclusions:</h3><div>The increased performance spread provides a more representative range of expected performance in clinical practice. This approach could be used for model commissioning to increase the confidence that the model performs well over a wide range of expected anatomical variation.</div></div>","PeriodicalId":56092,"journal":{"name":"Physica Medica-European Journal of Medical Physics","volume":"136 ","pages":"Article 105057"},"PeriodicalIF":2.7000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physica Medica-European Journal of Medical Physics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S112017972500167X","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction:

When introducing an auto-segmentation model into clinical practice, assessing the quality of the predicted segmentations and the robustness of the model over a wide range of anatomical variation and/or image quality is difficult. Especially, when the model is provided by an external party and the institution introducing the model does not possess a high-quality dataset to commission the model on.

Materials & Methods:

Assuming that a model is more likely to fail for an atypical case as opposed to a more average one, we propose a methodology that selects cases for commissioning using unsupervised anomaly detection. For this, the model supplier provides a set of image/shape features that correlate with model performance on the training data. Next, the receiving hospital can use these features to train an unsupervised anomaly detector on a large dataset of unlabeled cases and use the anomaly scores to select representative cases for commissioning of the model. Since the anomaly detector is trained on unlabeled data, a large, high-quality, curated dataset is not required on the receiving hospital side.

Results:

Using the proposed approach, the likelihood of selecting atypical edge cases with low segmentation performance was increased, as compared to random selection. For a selection of 20 cases, an increase of 22% was observed.

Conclusions:

The increased performance spread provides a more representative range of expected performance in clinical practice. This approach could be used for model commissioning to increase the confidence that the model performs well over a wide range of expected anatomical variation.

查看原文本刊更多论文

弱监督调试外部开发的自动分割模型，并应用于男性骨盆MR自动分割。

当将自动分割模型引入临床实践时，评估预测分割的质量和模型在大范围解剖变异和/或图像质量上的鲁棒性是困难的。特别是当模型是由外部方提供的，而引入模型的机构没有高质量的数据集来委托模型时。材料和方法：假设模型在非典型情况下比在一般情况下更有可能失败，我们提出了一种方法，该方法使用无监督异常检测来选择进行调试的情况。为此，模型供应商提供了一组与训练数据上的模型性能相关的图像/形状特征。接下来，接收医院可以使用这些特征在未标记病例的大型数据集上训练无监督异常检测器，并使用异常分数选择具有代表性的病例进行模型调试。由于异常检测器是在未标记的数据上进行训练的，因此接收医院方面不需要一个大的、高质量的、精心策划的数据集。结果：与随机选择相比，使用所提出的方法，选择具有低分割性能的非典型边缘案例的可能性增加。选择20个病例，观察到增加了22%。结论：在临床实践中，增加的表现范围提供了更有代表性的预期表现范围。这种方法可用于模型调试，以增加模型在广泛的预期解剖变异范围内表现良好的信心。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Physica Medica-European Journal of Medical Physics 生物-生物物理

CiteScore

6.80

自引率

14.70%

发文量

493

审稿时长

78 days

期刊介绍： Physica Medica, European Journal of Medical Physics, publishing with Elsevier from 2007, provides an international forum for research and reviews on the following main topics: Medical Imaging Radiation Therapy Radiation Protection Measuring Systems and Signal Processing Education and training in Medical Physics Professional issues in Medical Physics.