Evaluation of state-of-the-art deep learning models in the segmentation of the left and right ventricles in parasternal short-axis echocardiograms.

IF 1.9 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Journal of Medical Imaging Pub Date : 2025-03-01 Epub Date: 2025-03-26 DOI:10.1117/1.JMI.12.2.024002

Julian R Cuellar, Vu Dinh, Manjula Burri, Julie Roelandts, James Wendling, Jon D Klingensmith

{"title":"Evaluation of state-of-the-art deep learning models in the segmentation of the left and right ventricles in parasternal short-axis echocardiograms.","authors":"Julian R Cuellar, Vu Dinh, Manjula Burri, Julie Roelandts, James Wendling, Jon D Klingensmith","doi":"10.1117/1.JMI.12.2.024002","DOIUrl":null,"url":null,"abstract":"Purpose: Previous studies on echocardiogram segmentation are focused on the left ventricle in parasternal long-axis views. Deep-learning models were evaluated on the segmentation of the ventricles in parasternal short-axis echocardiograms (PSAX-echo). Segmentation of the ventricles in complementary echocardiogram views will allow the computation of important metrics with the potential to aid in diagnosing cardio-pulmonary diseases and other cardiomyopathies. Evaluating state-of-the-art models with small datasets can reveal if they improve performance on limited data.Approach: PSAX-echo was performed on 33 volunteer women. An experienced cardiologist identified end-diastole and end-systole frames from 387 scans, and expert observers manually traced the contours of the cardiac structures. Traced frames were pre-processed and used to create labels to train two domain-specific (Unet-Resnet101 and Unet-ResNet50), and four general-domain [three segment anything (SAM) variants, and the Detectron2] deep-learning models. The performance of the models was evaluated using the Dice similarity coefficient (DSC), Hausdorff distance (HD), and difference in cross-sectional area (DCSA).Results: The Unet-Resnet101 model provided superior performance in the segmentation of the ventricles with 0.83, 4.93 pixels, and <math><mrow><mn>106</mn> <mtext> </mtext> <msup><mrow><mtext>pixel</mtext></mrow> <mrow><mn>2</mn></mrow> </msup> </mrow> </math> on average for DSC, HD, and DCSA, respectively. A fine-tuned MedSAM model provided a performance of 0.82, 6.66 pixels, and <math><mrow><mn>1252</mn> <mtext> </mtext> <msup><mrow><mtext>pixel</mtext></mrow> <mrow><mn>2</mn></mrow> </msup> </mrow> </math> , whereas the Detectron2 model provided 0.78, 2.12 pixels, and <math><mrow><mn>116</mn> <mtext> </mtext> <msup><mrow><mtext>pixel</mtext></mrow> <mrow><mn>2</mn></mrow> </msup> </mrow> </math> for the same metrics, respectively.Conclusions: Deep-learning models are suitable for the segmentation of the left and right ventricles in PSAX-echo. We demonstrated that domain-specific trained models such as Unet-ResNet provide higher accuracy for echo segmentation than general-domain segmentation models when working with small and locally acquired datasets.","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 2","pages":"024002"},"PeriodicalIF":1.9000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11943840/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.12.2.024002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/26 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Previous studies on echocardiogram segmentation are focused on the left ventricle in parasternal long-axis views. Deep-learning models were evaluated on the segmentation of the ventricles in parasternal short-axis echocardiograms (PSAX-echo). Segmentation of the ventricles in complementary echocardiogram views will allow the computation of important metrics with the potential to aid in diagnosing cardio-pulmonary diseases and other cardiomyopathies. Evaluating state-of-the-art models with small datasets can reveal if they improve performance on limited data.

Approach: PSAX-echo was performed on 33 volunteer women. An experienced cardiologist identified end-diastole and end-systole frames from 387 scans, and expert observers manually traced the contours of the cardiac structures. Traced frames were pre-processed and used to create labels to train two domain-specific (Unet-Resnet101 and Unet-ResNet50), and four general-domain [three segment anything (SAM) variants, and the Detectron2] deep-learning models. The performance of the models was evaluated using the Dice similarity coefficient (DSC), Hausdorff distance (HD), and difference in cross-sectional area (DCSA).

Results: The Unet-Resnet101 model provided superior performance in the segmentation of the ventricles with 0.83, 4.93 pixels, and $106 {pixel}^{2}$ on average for DSC, HD, and DCSA, respectively. A fine-tuned MedSAM model provided a performance of 0.82, 6.66 pixels, and $1252 {pixel}^{2}$ , whereas the Detectron2 model provided 0.78, 2.12 pixels, and $116 {pixel}^{2}$ for the same metrics, respectively.

Conclusions: Deep-learning models are suitable for the segmentation of the left and right ventricles in PSAX-echo. We demonstrated that domain-specific trained models such as Unet-ResNet provide higher accuracy for echo segmentation than general-domain segmentation models when working with small and locally acquired datasets.

查看原文本刊更多论文

评价最先进的深度学习模型在胸骨旁短轴超声心动图左心室和右心室的分割。

目的：以往的超声心动图分割研究主要集中在胸骨旁长轴位的左心室。在胸骨旁短轴超声心动图（PSAX-echo）上对脑室分割进行深度学习模型评估。在补充超声心动图视图中分割心室将允许计算重要指标，有助于诊断心肺疾病和其他心肌病。用小数据集评估最先进的模型可以揭示它们是否在有限的数据上提高了性能。方法：对33名女性志愿者进行psax超声检查。一位经验丰富的心脏病专家从387次扫描中确定了舒张末期和收缩末期帧，专家观察者手动追踪心脏结构的轮廓。跟踪帧经过预处理并用于创建标签，以训练两个特定领域（Unet-Resnet101和Unet-ResNet50）和四个通用领域[三段任意（SAM）变体，以及Detectron2]深度学习模型。采用Dice相似系数（DSC）、Hausdorff距离（HD）和横截面积差（DCSA）对模型的性能进行评价。结果：Unet-Resnet101模型在脑室分割方面表现优异，DSC、HD和DCSA的平均分割像素分别为0.83、4.93和106像素2。经过微调的MedSAM模型的性能分别为0.82、6.66像素和1252像素2，而Detectron2模型的性能分别为0.78、2.12像素和116像素2。结论：深度学习模型适用于PSAX-echo左、右心室分割。我们证明了特定领域的训练模型，如Unet-ResNet，在处理小型和本地获取的数据集时，比一般领域分割模型提供更高的回声分割精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.10

自引率

4.20%

发文量

期刊介绍： JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.