产前先天性心脏病（CHD）筛查的深度学习模型可以应用于社区背景下的回顾性成像，在注释良好的队列中优于初始临床检测。

IF 6.3 1区医学 Q1 ACOUSTICS

Ultrasound in Obstetrics & Gynecology Pub Date : 2024-01-01 DOI:10.1002/uog.27503

C Athalye, A van Nisselrooij, S Rizvi, M C Haak, A J Moon-Grady, R Arnaout

{"title":"产前先天性心脏病（CHD）筛查的深度学习模型可以应用于社区背景下的回顾性成像，在注释良好的队列中优于初始临床检测。","authors":"C Athalye, A van Nisselrooij, S Rizvi, M C Haak, A J Moon-Grady, R Arnaout","doi":"10.1002/uog.27503","DOIUrl":null,"url":null,"abstract":"Objectives: Despite nearly universal prenatal ultrasound screening programs, congenital heart defects (CHD) are still missed, which may result in severe morbidity or even death. Deep machine learning (DL) can automate image recognition from ultrasound. The main aim of this study was to assess the performance of a previously developed DL model, trained on images from a tertiary center, using fetal ultrasound images obtained during the second-trimester standard anomaly scan in a low-risk population. A secondary aim was to compare initial screening diagnosis, which made use of live imaging at the point-of-care, with diagnosis by clinicians evaluating only stored images.Methods: All pregnancies with isolated severe CHD in the Northwestern region of The Netherlands between 2015 and 2016 with available stored images were evaluated, as well as a sample of normal fetuses' examinations from the same region and time period. We compared the accuracy of the initial clinical diagnosis (made in real time with access to live imaging) with that of the model (which had only stored imaging available) and with the performance of three blinded human experts who had access only to the stored images (like the model). We analyzed performance according to ultrasound study characteristics, such as duration and quality (scored independently by investigators), number of stored images and availability of screening views.Results: A total of 42 normal fetuses and 66 cases of isolated CHD at birth were analyzed. Of the abnormal cases, 31 were missed and 35 were detected at the time of the clinical anatomy scan (sensitivity, 53%). Model sensitivity and specificity were 91% and 78%, respectively. Blinded human experts (n = 3) achieved mean ± SD sensitivity and specificity of 55 ± 10% (range, 47-67%) and 71 ± 13% (range, 57-83%), respectively. There was a statistically significant difference in model correctness according to expert-graded image quality (P = 0.03). The abnormal cases included 19 lesions that the model had not encountered during its training; the model's performance in these cases (16/19 correct) was not statistically significantly different from that for previously encountered lesions (P = 0.41).Conclusions: A previously trained DL algorithm had higher sensitivity than initial clinical assessment in detecting CHD in a cohort in which over 50% of CHD cases were initially missed clinically. Notably, the DL algorithm performed well on community-acquired images in a low-risk population, including lesions to which it had not been exposed previously. Furthermore, when both the model and blinded human experts had access to only stored images and not the full range of images available to a clinician during a live scan, the model outperformed the human experts. Together, these findings support the proposition that use of DL models can improve prenatal detection of CHD. © 2023 International Society of Ultrasound in Obstetrics and Gynecology.","PeriodicalId":23454,"journal":{"name":"Ultrasound in Obstetrics & Gynecology","volume":" ","pages":"44-52"},"PeriodicalIF":6.3000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10841849/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep-learning model for prenatal congenital heart disease screening generalizes to community setting and outperforms clinical detection.\",\"authors\":\"C Athalye, A van Nisselrooij, S Rizvi, M C Haak, A J Moon-Grady, R Arnaout\",\"doi\":\"10.1002/uog.27503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: Despite nearly universal prenatal ultrasound screening programs, congenital heart defects (CHD) are still missed, which may result in severe morbidity or even death. Deep machine learning (DL) can automate image recognition from ultrasound. The main aim of this study was to assess the performance of a previously developed DL model, trained on images from a tertiary center, using fetal ultrasound images obtained during the second-trimester standard anomaly scan in a low-risk population. A secondary aim was to compare initial screening diagnosis, which made use of live imaging at the point-of-care, with diagnosis by clinicians evaluating only stored images.Methods: All pregnancies with isolated severe CHD in the Northwestern region of The Netherlands between 2015 and 2016 with available stored images were evaluated, as well as a sample of normal fetuses' examinations from the same region and time period. We compared the accuracy of the initial clinical diagnosis (made in real time with access to live imaging) with that of the model (which had only stored imaging available) and with the performance of three blinded human experts who had access only to the stored images (like the model). We analyzed performance according to ultrasound study characteristics, such as duration and quality (scored independently by investigators), number of stored images and availability of screening views.Results: A total of 42 normal fetuses and 66 cases of isolated CHD at birth were analyzed. Of the abnormal cases, 31 were missed and 35 were detected at the time of the clinical anatomy scan (sensitivity, 53%). Model sensitivity and specificity were 91% and 78%, respectively. Blinded human experts (n = 3) achieved mean ± SD sensitivity and specificity of 55 ± 10% (range, 47-67%) and 71 ± 13% (range, 57-83%), respectively. There was a statistically significant difference in model correctness according to expert-graded image quality (P = 0.03). The abnormal cases included 19 lesions that the model had not encountered during its training; the model's performance in these cases (16/19 correct) was not statistically significantly different from that for previously encountered lesions (P = 0.41).Conclusions: A previously trained DL algorithm had higher sensitivity than initial clinical assessment in detecting CHD in a cohort in which over 50% of CHD cases were initially missed clinically. Notably, the DL algorithm performed well on community-acquired images in a low-risk population, including lesions to which it had not been exposed previously. Furthermore, when both the model and blinded human experts had access to only stored images and not the full range of images available to a clinician during a live scan, the model outperformed the human experts. Together, these findings support the proposition that use of DL models can improve prenatal detection of CHD. © 2023 International Society of Ultrasound in Obstetrics and Gynecology.\",\"PeriodicalId\":23454,\"journal\":{\"name\":\"Ultrasound in Obstetrics & Gynecology\",\"volume\":\" \",\"pages\":\"44-52\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10841849/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ultrasound in Obstetrics & Gynecology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/uog.27503\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ultrasound in Obstetrics & Gynecology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/uog.27503","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 0

摘要

目的：尽管产前超声筛查项目几乎普及，但先天性心脏病（CHD）仍然被遗漏，这可能导致严重的发病率甚至死亡。深度机器学习（DL）可以实现超声波图像识别的自动化。本研究的主要目的是将先前开发的基于三级中心图像训练的DL模型应用于低风险人群中孕中期标准异常扫描期间获得的胎儿超声图像。第二个目的是比较最初的筛查诊断，即在护理点使用实时成像，与临床医生仅评估存储的图像。方法：对2015年至2016年间荷兰西北地区所有具有可用存储图像的孤立性严重CHD妊娠进行评估，并对同一地区的正常胎儿检查样本进行评估。我们比较了仅访问存储图像（如模型）的盲法人类专家的初始临床诊断准确性（通过实时访问实时成像）、模型准确性和性能。我们根据研究特征分析了表现，如持续时间、质量（由研究人员独立评分）、存储图像的数量和筛查视图的可用性。结果：对42例正常胎儿和66例先天性CHD进行了分析。在异常病例中，31例漏诊，35例在临床解剖扫描时被发现（敏感性53%）。模型的敏感性和特异性分别为91%和78%。失明的人类专家（n=3）的敏感性和特异性分别为55±10%（范围47-67%）和71±13%（范围57-83%）。专家评分的图像质量在模型正确性方面存在统计学显著差异（p=0.03）。异常病例包括19个模型在训练中没有遇到的损伤；模型的性能（16/19正确）在以前遇到的病变和从未见过的病变上没有统计学上的显著差异（p=0.41）。结论：在一个超过50%的CHD病例最初在临床上被遗漏的队列中，以前训练的DL算法在检测CHD方面比最初的临床评估具有更高的灵敏度。值得注意的是，DL算法在低风险人群中的社区获取图像上表现良好，包括之前没有接触过的病变。此外，当模型和失明的人类专家都可以单独访问存储的图像，而不是临床医生在实时扫描中可获得的全部图像时，该模型的表现优于专家。总之，这些发现支持了DL模型的使用可以改善CHD的产前检测的主张。这篇文章受版权保护。保留所有权利。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep-learning model for prenatal congenital heart disease screening generalizes to community setting and outperforms clinical detection.

Objectives: Despite nearly universal prenatal ultrasound screening programs, congenital heart defects (CHD) are still missed, which may result in severe morbidity or even death. Deep machine learning (DL) can automate image recognition from ultrasound. The main aim of this study was to assess the performance of a previously developed DL model, trained on images from a tertiary center, using fetal ultrasound images obtained during the second-trimester standard anomaly scan in a low-risk population. A secondary aim was to compare initial screening diagnosis, which made use of live imaging at the point-of-care, with diagnosis by clinicians evaluating only stored images.

Methods: All pregnancies with isolated severe CHD in the Northwestern region of The Netherlands between 2015 and 2016 with available stored images were evaluated, as well as a sample of normal fetuses' examinations from the same region and time period. We compared the accuracy of the initial clinical diagnosis (made in real time with access to live imaging) with that of the model (which had only stored imaging available) and with the performance of three blinded human experts who had access only to the stored images (like the model). We analyzed performance according to ultrasound study characteristics, such as duration and quality (scored independently by investigators), number of stored images and availability of screening views.

Results: A total of 42 normal fetuses and 66 cases of isolated CHD at birth were analyzed. Of the abnormal cases, 31 were missed and 35 were detected at the time of the clinical anatomy scan (sensitivity, 53%). Model sensitivity and specificity were 91% and 78%, respectively. Blinded human experts (n = 3) achieved mean ± SD sensitivity and specificity of 55 ± 10% (range, 47-67%) and 71 ± 13% (range, 57-83%), respectively. There was a statistically significant difference in model correctness according to expert-graded image quality (P = 0.03). The abnormal cases included 19 lesions that the model had not encountered during its training; the model's performance in these cases (16/19 correct) was not statistically significantly different from that for previously encountered lesions (P = 0.41).

Conclusions: A previously trained DL algorithm had higher sensitivity than initial clinical assessment in detecting CHD in a cohort in which over 50% of CHD cases were initially missed clinically. Notably, the DL algorithm performed well on community-acquired images in a low-risk population, including lesions to which it had not been exposed previously. Furthermore, when both the model and blinded human experts had access to only stored images and not the full range of images available to a clinician during a live scan, the model outperformed the human experts. Together, these findings support the proposition that use of DL models can improve prenatal detection of CHD. © 2023 International Society of Ultrasound in Obstetrics and Gynecology.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Ultrasound in Obstetrics & Gynecology 医学-妇产科学

CiteScore

12.30

自引率

14.10%

发文量

891

审稿时长

1 months

期刊介绍： Ultrasound in Obstetrics & Gynecology (UOG) is the official journal of the International Society of Ultrasound in Obstetrics and Gynecology (ISUOG) and is considered the foremost international peer-reviewed journal in the field. It publishes cutting-edge research that is highly relevant to clinical practice, which includes guidelines, expert commentaries, consensus statements, original articles, and systematic reviews. UOG is widely recognized and included in prominent abstract and indexing databases such as Index Medicus and Current Contents.