产前先天性心脏病(CHD)筛查的深度学习模型可以应用于社区背景下的回顾性成像,在注释良好的队列中优于初始临床检测。

IF 6.3 1区 医学 Q1 ACOUSTICS
C Athalye, A van Nisselrooij, S Rizvi, M C Haak, A J Moon-Grady, R Arnaout
{"title":"产前先天性心脏病(CHD)筛查的深度学习模型可以应用于社区背景下的回顾性成像,在注释良好的队列中优于初始临床检测。","authors":"C Athalye, A van Nisselrooij, S Rizvi, M C Haak, A J Moon-Grady, R Arnaout","doi":"10.1002/uog.27503","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Despite nearly universal prenatal ultrasound screening programs, congenital heart defects (CHD) are still missed, which may result in severe morbidity or even death. Deep machine learning (DL) can automate image recognition from ultrasound. The main aim of this study was to assess the performance of a previously developed DL model, trained on images from a tertiary center, using fetal ultrasound images obtained during the second-trimester standard anomaly scan in a low-risk population. A secondary aim was to compare initial screening diagnosis, which made use of live imaging at the point-of-care, with diagnosis by clinicians evaluating only stored images.</p><p><strong>Methods: </strong>All pregnancies with isolated severe CHD in the Northwestern region of The Netherlands between 2015 and 2016 with available stored images were evaluated, as well as a sample of normal fetuses' examinations from the same region and time period. We compared the accuracy of the initial clinical diagnosis (made in real time with access to live imaging) with that of the model (which had only stored imaging available) and with the performance of three blinded human experts who had access only to the stored images (like the model). We analyzed performance according to ultrasound study characteristics, such as duration and quality (scored independently by investigators), number of stored images and availability of screening views.</p><p><strong>Results: </strong>A total of 42 normal fetuses and 66 cases of isolated CHD at birth were analyzed. Of the abnormal cases, 31 were missed and 35 were detected at the time of the clinical anatomy scan (sensitivity, 53%). Model sensitivity and specificity were 91% and 78%, respectively. Blinded human experts (n = 3) achieved mean ± SD sensitivity and specificity of 55 ± 10% (range, 47-67%) and 71 ± 13% (range, 57-83%), respectively. There was a statistically significant difference in model correctness according to expert-graded image quality (P = 0.03). The abnormal cases included 19 lesions that the model had not encountered during its training; the model's performance in these cases (16/19 correct) was not statistically significantly different from that for previously encountered lesions (P = 0.41).</p><p><strong>Conclusions: </strong>A previously trained DL algorithm had higher sensitivity than initial clinical assessment in detecting CHD in a cohort in which over 50% of CHD cases were initially missed clinically. Notably, the DL algorithm performed well on community-acquired images in a low-risk population, including lesions to which it had not been exposed previously. Furthermore, when both the model and blinded human experts had access to only stored images and not the full range of images available to a clinician during a live scan, the model outperformed the human experts. Together, these findings support the proposition that use of DL models can improve prenatal detection of CHD. © 2023 International Society of Ultrasound in Obstetrics and Gynecology.</p>","PeriodicalId":23454,"journal":{"name":"Ultrasound in Obstetrics & Gynecology","volume":" ","pages":"44-52"},"PeriodicalIF":6.3000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10841849/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep-learning model for prenatal congenital heart disease screening generalizes to community setting and outperforms clinical detection.\",\"authors\":\"C Athalye, A van Nisselrooij, S Rizvi, M C Haak, A J Moon-Grady, R Arnaout\",\"doi\":\"10.1002/uog.27503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Despite nearly universal prenatal ultrasound screening programs, congenital heart defects (CHD) are still missed, which may result in severe morbidity or even death. Deep machine learning (DL) can automate image recognition from ultrasound. The main aim of this study was to assess the performance of a previously developed DL model, trained on images from a tertiary center, using fetal ultrasound images obtained during the second-trimester standard anomaly scan in a low-risk population. A secondary aim was to compare initial screening diagnosis, which made use of live imaging at the point-of-care, with diagnosis by clinicians evaluating only stored images.</p><p><strong>Methods: </strong>All pregnancies with isolated severe CHD in the Northwestern region of The Netherlands between 2015 and 2016 with available stored images were evaluated, as well as a sample of normal fetuses' examinations from the same region and time period. We compared the accuracy of the initial clinical diagnosis (made in real time with access to live imaging) with that of the model (which had only stored imaging available) and with the performance of three blinded human experts who had access only to the stored images (like the model). We analyzed performance according to ultrasound study characteristics, such as duration and quality (scored independently by investigators), number of stored images and availability of screening views.</p><p><strong>Results: </strong>A total of 42 normal fetuses and 66 cases of isolated CHD at birth were analyzed. Of the abnormal cases, 31 were missed and 35 were detected at the time of the clinical anatomy scan (sensitivity, 53%). Model sensitivity and specificity were 91% and 78%, respectively. Blinded human experts (n = 3) achieved mean ± SD sensitivity and specificity of 55 ± 10% (range, 47-67%) and 71 ± 13% (range, 57-83%), respectively. There was a statistically significant difference in model correctness according to expert-graded image quality (P = 0.03). The abnormal cases included 19 lesions that the model had not encountered during its training; the model's performance in these cases (16/19 correct) was not statistically significantly different from that for previously encountered lesions (P = 0.41).</p><p><strong>Conclusions: </strong>A previously trained DL algorithm had higher sensitivity than initial clinical assessment in detecting CHD in a cohort in which over 50% of CHD cases were initially missed clinically. Notably, the DL algorithm performed well on community-acquired images in a low-risk population, including lesions to which it had not been exposed previously. Furthermore, when both the model and blinded human experts had access to only stored images and not the full range of images available to a clinician during a live scan, the model outperformed the human experts. Together, these findings support the proposition that use of DL models can improve prenatal detection of CHD. © 2023 International Society of Ultrasound in Obstetrics and Gynecology.</p>\",\"PeriodicalId\":23454,\"journal\":{\"name\":\"Ultrasound in Obstetrics & Gynecology\",\"volume\":\" \",\"pages\":\"44-52\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10841849/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ultrasound in Obstetrics & Gynecology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/uog.27503\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ultrasound in Obstetrics & Gynecology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/uog.27503","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

摘要

目的:尽管产前超声筛查项目几乎普及,但先天性心脏病(CHD)仍然被遗漏,这可能导致严重的发病率甚至死亡。深度机器学习(DL)可以实现超声波图像识别的自动化。本研究的主要目的是将先前开发的基于三级中心图像训练的DL模型应用于低风险人群中孕中期标准异常扫描期间获得的胎儿超声图像。第二个目的是比较最初的筛查诊断,即在护理点使用实时成像,与临床医生仅评估存储的图像。方法:对2015年至2016年间荷兰西北地区所有具有可用存储图像的孤立性严重CHD妊娠进行评估,并对同一地区的正常胎儿检查样本进行评估。我们比较了仅访问存储图像(如模型)的盲法人类专家的初始临床诊断准确性(通过实时访问实时成像)、模型准确性和性能。我们根据研究特征分析了表现,如持续时间、质量(由研究人员独立评分)、存储图像的数量和筛查视图的可用性。结果:对42例正常胎儿和66例先天性CHD进行了分析。在异常病例中,31例漏诊,35例在临床解剖扫描时被发现(敏感性53%)。模型的敏感性和特异性分别为91%和78%。失明的人类专家(n=3)的敏感性和特异性分别为55±10%(范围47-67%)和71±13%(范围57-83%)。专家评分的图像质量在模型正确性方面存在统计学显著差异(p=0.03)。异常病例包括19个模型在训练中没有遇到的损伤;模型的性能(16/19正确)在以前遇到的病变和从未见过的病变上没有统计学上的显著差异(p=0.41)。结论:在一个超过50%的CHD病例最初在临床上被遗漏的队列中,以前训练的DL算法在检测CHD方面比最初的临床评估具有更高的灵敏度。值得注意的是,DL算法在低风险人群中的社区获取图像上表现良好,包括之前没有接触过的病变。此外,当模型和失明的人类专家都可以单独访问存储的图像,而不是临床医生在实时扫描中可获得的全部图像时,该模型的表现优于专家。总之,这些发现支持了DL模型的使用可以改善CHD的产前检测的主张。这篇文章受版权保护。保留所有权利。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Deep-learning model for prenatal congenital heart disease screening generalizes to community setting and outperforms clinical detection.

Objectives: Despite nearly universal prenatal ultrasound screening programs, congenital heart defects (CHD) are still missed, which may result in severe morbidity or even death. Deep machine learning (DL) can automate image recognition from ultrasound. The main aim of this study was to assess the performance of a previously developed DL model, trained on images from a tertiary center, using fetal ultrasound images obtained during the second-trimester standard anomaly scan in a low-risk population. A secondary aim was to compare initial screening diagnosis, which made use of live imaging at the point-of-care, with diagnosis by clinicians evaluating only stored images.

Methods: All pregnancies with isolated severe CHD in the Northwestern region of The Netherlands between 2015 and 2016 with available stored images were evaluated, as well as a sample of normal fetuses' examinations from the same region and time period. We compared the accuracy of the initial clinical diagnosis (made in real time with access to live imaging) with that of the model (which had only stored imaging available) and with the performance of three blinded human experts who had access only to the stored images (like the model). We analyzed performance according to ultrasound study characteristics, such as duration and quality (scored independently by investigators), number of stored images and availability of screening views.

Results: A total of 42 normal fetuses and 66 cases of isolated CHD at birth were analyzed. Of the abnormal cases, 31 were missed and 35 were detected at the time of the clinical anatomy scan (sensitivity, 53%). Model sensitivity and specificity were 91% and 78%, respectively. Blinded human experts (n = 3) achieved mean ± SD sensitivity and specificity of 55 ± 10% (range, 47-67%) and 71 ± 13% (range, 57-83%), respectively. There was a statistically significant difference in model correctness according to expert-graded image quality (P = 0.03). The abnormal cases included 19 lesions that the model had not encountered during its training; the model's performance in these cases (16/19 correct) was not statistically significantly different from that for previously encountered lesions (P = 0.41).

Conclusions: A previously trained DL algorithm had higher sensitivity than initial clinical assessment in detecting CHD in a cohort in which over 50% of CHD cases were initially missed clinically. Notably, the DL algorithm performed well on community-acquired images in a low-risk population, including lesions to which it had not been exposed previously. Furthermore, when both the model and blinded human experts had access to only stored images and not the full range of images available to a clinician during a live scan, the model outperformed the human experts. Together, these findings support the proposition that use of DL models can improve prenatal detection of CHD. © 2023 International Society of Ultrasound in Obstetrics and Gynecology.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
12.30
自引率
14.10%
发文量
891
审稿时长
1 months
期刊介绍: Ultrasound in Obstetrics & Gynecology (UOG) is the official journal of the International Society of Ultrasound in Obstetrics and Gynecology (ISUOG) and is considered the foremost international peer-reviewed journal in the field. It publishes cutting-edge research that is highly relevant to clinical practice, which includes guidelines, expert commentaries, consensus statements, original articles, and systematic reviews. UOG is widely recognized and included in prominent abstract and indexing databases such as Index Medicus and Current Contents.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信