Bastiaan W.K. Schipaanboord , Peter J. Koopmans , Erik van der Bijl , Charlotte L. Brouwer , Tomas Janssen
{"title":"Weakly supervised commissioning of externally developed auto-segmentation models and applied to male pelvis MR auto-segmentation","authors":"Bastiaan W.K. Schipaanboord , Peter J. Koopmans , Erik van der Bijl , Charlotte L. Brouwer , Tomas Janssen","doi":"10.1016/j.ejmp.2025.105057","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction:</h3><div>When introducing an auto-segmentation model into clinical practice, assessing the quality of the predicted segmentations and the robustness of the model over a wide range of anatomical variation and/or image quality is difficult. Especially, when the model is provided by an external party and the institution introducing the model does not possess a high-quality dataset to commission the model on.</div></div><div><h3>Materials & Methods:</h3><div>Assuming that a model is more likely to fail for an atypical case as opposed to a more average one, we propose a methodology that selects cases for commissioning using unsupervised anomaly detection. For this, the model supplier provides a set of image/shape features that correlate with model performance on the training data. Next, the receiving hospital can use these features to train an unsupervised anomaly detector on a large dataset of unlabeled cases and use the anomaly scores to select representative cases for commissioning of the model. Since the anomaly detector is trained on unlabeled data, a large, high-quality, curated dataset is not required on the receiving hospital side.</div></div><div><h3>Results:</h3><div>Using the proposed approach, the likelihood of selecting atypical edge cases with low segmentation performance was increased, as compared to random selection. For a selection of 20 cases, an increase of 22% was observed.</div></div><div><h3>Conclusions:</h3><div>The increased performance spread provides a more representative range of expected performance in clinical practice. This approach could be used for model commissioning to increase the confidence that the model performs well over a wide range of expected anatomical variation.</div></div>","PeriodicalId":56092,"journal":{"name":"Physica Medica-European Journal of Medical Physics","volume":"136 ","pages":"Article 105057"},"PeriodicalIF":2.7000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physica Medica-European Journal of Medical Physics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S112017972500167X","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction:
When introducing an auto-segmentation model into clinical practice, assessing the quality of the predicted segmentations and the robustness of the model over a wide range of anatomical variation and/or image quality is difficult. Especially, when the model is provided by an external party and the institution introducing the model does not possess a high-quality dataset to commission the model on.
Materials & Methods:
Assuming that a model is more likely to fail for an atypical case as opposed to a more average one, we propose a methodology that selects cases for commissioning using unsupervised anomaly detection. For this, the model supplier provides a set of image/shape features that correlate with model performance on the training data. Next, the receiving hospital can use these features to train an unsupervised anomaly detector on a large dataset of unlabeled cases and use the anomaly scores to select representative cases for commissioning of the model. Since the anomaly detector is trained on unlabeled data, a large, high-quality, curated dataset is not required on the receiving hospital side.
Results:
Using the proposed approach, the likelihood of selecting atypical edge cases with low segmentation performance was increased, as compared to random selection. For a selection of 20 cases, an increase of 22% was observed.
Conclusions:
The increased performance spread provides a more representative range of expected performance in clinical practice. This approach could be used for model commissioning to increase the confidence that the model performs well over a wide range of expected anatomical variation.
期刊介绍:
Physica Medica, European Journal of Medical Physics, publishing with Elsevier from 2007, provides an international forum for research and reviews on the following main topics:
Medical Imaging
Radiation Therapy
Radiation Protection
Measuring Systems and Signal Processing
Education and training in Medical Physics
Professional issues in Medical Physics.