SegQC: a segmentation network-based framework for multi-metric segmentation quality control and segmentation error detection in volumetric medical images

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-05-08 DOI:10.1016/j.media.2025.103638

Bella Specktor-Fadida , Liat Ben-Sira , Dafna Ben-Bashat , Leo Joskowicz

{"title":"SegQC: a segmentation network-based framework for multi-metric segmentation quality control and segmentation error detection in volumetric medical images","authors":"Bella Specktor-Fadida , Liat Ben-Sira , Dafna Ben-Bashat , Leo Joskowicz","doi":"10.1016/j.media.2025.103638","DOIUrl":null,"url":null,"abstract":"<div><div>Quality control (QC) of structures segmentation in volumetric medical images is important for identifying segmentation errors in clinical practice and for facilitating model development by enhancing network performance in semi-supervised and active learning scenarios. This paper introduces SegQC, a novel framework for segmentation quality estimation and segmentation error detection. SegQC computes an estimate measure of the quality of a segmentation in volumetric scans and in their individual slices and identifies possible segmentation error regions within a slice. The key components of SegQC include: 1) SegQC<img>Net, a deep network that inputs a scan and its segmentation mask and outputs segmentation error probabilities for each voxel in the scan; 2) three new segmentation quality metrics computed from the segmentation error probabilities; 3) a new method for detecting possible segmentation errors in scan slices computed from the segmentation error probabilities. We introduce a novel evaluation scheme to measure segmentation error discrepancies based on an expert radiologist’s corrections of automatically produced segmentations that yields smaller observer variability and is closer to actual segmentation errors. We demonstrate SegQC on three fetal structures in 198 fetal MRI scans – fetal brain, fetal body and the placenta. To assess the benefits of SegQC, we compare it to the unsupervised Test Time Augmentation (TTA)-based QC and to supervised autoencoder (AE)-based QC. Our studies indicate that SegQC outperforms TTA-based quality estimation for whole scans and individual slices in terms of Pearson correlation and MAE for fetal body and fetal brain structures segmentation as well as for volumetric overlap metrics estimation of the placenta structure. Compared to both unsupervised TTA and supervised AE methods, SegQC achieves lower MAE for both 3D and 2D Dice estimates and higher Pearson correlation for volumetric Dice. Our segmentation error detection method achieved recall and precision rates of 0.77 and 0.48 for fetal body, and 0.74 and 0.55 for fetal brain segmentation error detection, respectively. Ranking derived from metrics estimation surpasses rankings based on entropy and sum for TTA and SegQC<img>Net estimations, respectively. SegQC provides high-quality metrics estimation for both 2D and 3D medical images as well as error localization within slices, offering important improvements to segmentation QC.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"103 ","pages":"Article 103638"},"PeriodicalIF":11.8000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525001859","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Quality control (QC) of structures segmentation in volumetric medical images is important for identifying segmentation errors in clinical practice and for facilitating model development by enhancing network performance in semi-supervised and active learning scenarios. This paper introduces SegQC, a novel framework for segmentation quality estimation and segmentation error detection. SegQC computes an estimate measure of the quality of a segmentation in volumetric scans and in their individual slices and identifies possible segmentation error regions within a slice. The key components of SegQC include: 1) SegQCNet, a deep network that inputs a scan and its segmentation mask and outputs segmentation error probabilities for each voxel in the scan; 2) three new segmentation quality metrics computed from the segmentation error probabilities; 3) a new method for detecting possible segmentation errors in scan slices computed from the segmentation error probabilities. We introduce a novel evaluation scheme to measure segmentation error discrepancies based on an expert radiologist’s corrections of automatically produced segmentations that yields smaller observer variability and is closer to actual segmentation errors. We demonstrate SegQC on three fetal structures in 198 fetal MRI scans – fetal brain, fetal body and the placenta. To assess the benefits of SegQC, we compare it to the unsupervised Test Time Augmentation (TTA)-based QC and to supervised autoencoder (AE)-based QC. Our studies indicate that SegQC outperforms TTA-based quality estimation for whole scans and individual slices in terms of Pearson correlation and MAE for fetal body and fetal brain structures segmentation as well as for volumetric overlap metrics estimation of the placenta structure. Compared to both unsupervised TTA and supervised AE methods, SegQC achieves lower MAE for both 3D and 2D Dice estimates and higher Pearson correlation for volumetric Dice. Our segmentation error detection method achieved recall and precision rates of 0.77 and 0.48 for fetal body, and 0.74 and 0.55 for fetal brain segmentation error detection, respectively. Ranking derived from metrics estimation surpasses rankings based on entropy and sum for TTA and SegQCNet estimations, respectively. SegQC provides high-quality metrics estimation for both 2D and 3D medical images as well as error localization within slices, offering important improvements to segmentation QC.

Abstract Image

查看原文本刊更多论文

SegQC：一个基于分割网络的框架，用于体积医学图像的多度量分割质量控制和分割误差检测

体积医学图像中结构分割的质量控制（QC）对于识别临床实践中的分割错误以及通过提高半监督和主动学习场景下的网络性能来促进模型开发具有重要意义。本文介绍了一种新的分割质量估计和分割误差检测框架SegQC。SegQC计算体积扫描和其单独切片中分割质量的估计度量，并识别切片内可能的分割错误区域。SegQC的关键组件包括：1)SegQCNet，这是一个深度网络，它输入扫描及其分割掩码，并输出扫描中每个体素的分割错误概率；2)根据分割误差概率计算出三种新的分割质量度量；3)一种基于分割错误概率的扫描切片分割错误检测方法。我们引入了一种新的评估方案来测量分割误差差异，该方案基于放射科专家对自动生成的分割的修正，产生更小的观察者可变性，更接近实际的分割误差。我们在198个胎儿MRI扫描中展示了三个胎儿结构的SegQC -胎儿脑，胎儿体和胎盘。为了评估SegQC的好处，我们将其与基于无监督测试时间增强（TTA）的QC和基于监督自编码器（AE）的QC进行了比较。我们的研究表明，SegQC在整个扫描和单个切片的Pearson相关性方面优于基于ta的质量估计，在胎儿身体和胎儿大脑结构分割以及胎盘结构的体积重叠度量估计方面优于基于ta的质量估计。与无监督TTA和监督AE方法相比，SegQC对3D和2D Dice估计的MAE都较低，对体积Dice的Pearson相关性较高。我们的分割错误检测方法对胎儿身体的查全率和查准率分别为0.77和0.48，对胎儿大脑的分割错误检测的查全率和查准率分别为0.74和0.55。对于TTA和SegQCNet估计，来自度量估计的排名分别优于基于熵和总和的排名。SegQC为2D和3D医学图像以及切片内的错误定位提供了高质量的度量估计，为分割QC提供了重要的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.