Diagnosis of congenital heart diseases in children from 2D and 3D sonography using convolutional neural networks: A scoping literature review

WFUMB Ultrasound Open Pub Date : 2025-10-08 DOI:10.1016/j.wfumbo.2025.100096

Ayiga Majid , Johnes Obungoloch , Alfred Enywaku , Obeti Francis , Denis Jjuuko , Eugene Bizimana , Biryomumeisho Joshua , Wasswa William

{"title":"Diagnosis of congenital heart diseases in children from 2D and 3D sonography using convolutional neural networks: A scoping literature review","authors":"Ayiga Majid , Johnes Obungoloch , Alfred Enywaku , Obeti Francis , Denis Jjuuko , Eugene Bizimana , Biryomumeisho Joshua , Wasswa William","doi":"10.1016/j.wfumbo.2025.100096","DOIUrl":null,"url":null,"abstract":"<div><div>Congenital heart diseases (CHDs) are the commonest congenital anomalies and a leading cause of neonatal morbidity and mortality worldwide. They affect approximately 1 % of live births globally. Prenatal detection with fetal echocardiography allows for timely referral and intervention. However, screening performance remains uneven because of operator dependence and resource limits. Convolutional neural networks (CNNs) offer automated image interpretation and have been widely applied to 2D and 3D fetal ultrasound (US) in recent years.</div><div>This scoping review maps CNN applications for prenatal CHD across 2010–September 2025. In adherence to the PRISMA-ScR methodology, we screened 845 records from PubMed, IEEE Xplore, Google Scholar, ScienceDirect and preprint servers and included 29 studies for synthesis. The volume of articles published on the topic rose sharply after 2020, with most datasets drawn from Asia and North America and no representation from sub-Saharan Africa. The most commonly reported architectures were DenseNet variants, U-Net families for segmentation, YOLO (You Only Look Once) variants for real-time detection, and ensemble hybrids for classification.</div><div>Reported internal performance is high in many studies, with top models achieving near-expert discrimination with AUCs (Area Under the Curve) up to 0.99. In our cohort, the mean AUC across studies reporting that metric was 0.911 (Standard Deviation-SD = 0.09; 95 % CI: 0.86–0.96; n = 11). Average sensitivity and specificity across studies reporting these metrics were 0.92 (SD = 0.04; n = 13) and 0.91 (SD = 0.04; n = 7), respectively. However, external and multi-centre validation remains limited, and performance commonly falls when models are tested on unseen centers or different scanner types. Methodological gaps include small or imbalanced lesion classes, inconsistent patient-level splitting, sparse reporting of deployment constraints, and limited fairness analyses.</div><div>Recent work shows promising directions such as multimodal fusion of B-mode and Doppler, lightweight networks, on-device optimization, federated and privacy-preserving training, and prospective protocols for multi-centre evaluation. To translate CNNs into clinical screening, we recommend coordinated efforts to build diverse, well-annotated repositories, adopt strict patient-level validation and external testing, report deployment metrics, and run prospective, pragmatic trials that measure clinical outcomes and health-economic impact. These steps can move CNNs from high-performance demonstrations to dependable tools that improve prenatal CHD detection equitably.</div></div>","PeriodicalId":101281,"journal":{"name":"WFUMB Ultrasound Open","volume":"3 2","pages":"Article 100096"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"WFUMB Ultrasound Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949668325000187","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Congenital heart diseases (CHDs) are the commonest congenital anomalies and a leading cause of neonatal morbidity and mortality worldwide. They affect approximately 1 % of live births globally. Prenatal detection with fetal echocardiography allows for timely referral and intervention. However, screening performance remains uneven because of operator dependence and resource limits. Convolutional neural networks (CNNs) offer automated image interpretation and have been widely applied to 2D and 3D fetal ultrasound (US) in recent years.

This scoping review maps CNN applications for prenatal CHD across 2010–September 2025. In adherence to the PRISMA-ScR methodology, we screened 845 records from PubMed, IEEE Xplore, Google Scholar, ScienceDirect and preprint servers and included 29 studies for synthesis. The volume of articles published on the topic rose sharply after 2020, with most datasets drawn from Asia and North America and no representation from sub-Saharan Africa. The most commonly reported architectures were DenseNet variants, U-Net families for segmentation, YOLO (You Only Look Once) variants for real-time detection, and ensemble hybrids for classification.

Reported internal performance is high in many studies, with top models achieving near-expert discrimination with AUCs (Area Under the Curve) up to 0.99. In our cohort, the mean AUC across studies reporting that metric was 0.911 (Standard Deviation-SD = 0.09; 95 % CI: 0.86–0.96; n = 11). Average sensitivity and specificity across studies reporting these metrics were 0.92 (SD = 0.04; n = 13) and 0.91 (SD = 0.04; n = 7), respectively. However, external and multi-centre validation remains limited, and performance commonly falls when models are tested on unseen centers or different scanner types. Methodological gaps include small or imbalanced lesion classes, inconsistent patient-level splitting, sparse reporting of deployment constraints, and limited fairness analyses.

Recent work shows promising directions such as multimodal fusion of B-mode and Doppler, lightweight networks, on-device optimization, federated and privacy-preserving training, and prospective protocols for multi-centre evaluation. To translate CNNs into clinical screening, we recommend coordinated efforts to build diverse, well-annotated repositories, adopt strict patient-level validation and external testing, report deployment metrics, and run prospective, pragmatic trials that measure clinical outcomes and health-economic impact. These steps can move CNNs from high-performance demonstrations to dependable tools that improve prenatal CHD detection equitably.

查看原文本刊更多论文

使用卷积神经网络从二维和三维超声诊断儿童先天性心脏病：范围文献综述

先天性心脏病（CHDs）是最常见的先天性异常，也是全世界新生儿发病率和死亡率的主要原因。它们影响了全球约1%的活产。产前检测胎儿超声心动图允许及时转诊和干预。然而，由于操作者的依赖性和资源限制，筛选效果仍然参差不齐。卷积神经网络（cnn）提供自动图像解释，近年来已广泛应用于二维和三维胎儿超声（US）。这一范围综述绘制了2010年至2025年9月期间CNN在产前冠心病中的应用。按照PRISMA-ScR方法，我们从PubMed、IEEE explore、b谷歌Scholar、ScienceDirect和预印服务器中筛选了845条记录，并纳入了29项研究用于合成。在2020年之后，发表的关于该主题的文章数量急剧增加，大多数数据集来自亚洲和北美，没有来自撒哈拉以南非洲的代表。最常见的报告架构是DenseNet变体，用于分割的U-Net系列，用于实时检测的YOLO （You Only Look Once）变体，以及用于分类的集成混合。在许多研究中，报告的内部绩效很高，顶级模型的auc（曲线下面积）达到了0.99的接近专家的区分。在我们的队列中，报告度量的研究的平均AUC为0.911（标准差= 0.09;95% CI: 0.86-0.96; n = 11）。报告这些指标的研究的平均灵敏度和特异性分别为0.92 （SD = 0.04, n = 13）和0.91 （SD = 0.04, n = 7）。然而，外部和多中心验证仍然有限，并且当模型在未见过的中心或不同的扫描仪类型上进行测试时，性能通常会下降。方法上的差距包括小的或不平衡的病变类别、不一致的患者水平划分、部署约束的稀疏报告和有限的公平性分析。最近的工作显示了有希望的方向，如b模式和多普勒的多模态融合，轻量级网络，设备上优化，联合和隐私保护培训，以及多中心评估的前瞻性协议。为了将cnn转化为临床筛查，我们建议协调努力，建立多样化、注释良好的知识库，采用严格的患者级验证和外部测试，报告部署指标，并进行前瞻性、务实的试验，以衡量临床结果和健康经济影响。这些步骤可以将cnn从高性能演示转变为可靠的工具，从而公平地改善产前冠心病检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

WFUMB Ultrasound Open

自引率

0.00%

发文量