基于深度学习的颅内出血检测模型的外部验证和性能分析。

IF 1.3 Q4 NEUROIMAGING
Ayman Nada, Alaa A Sayed, Mourad Hamouda, Mohamed Tantawi, Amna Khan, Addison Alt, Heidi Hassanein, Burak C Sevim, Talissa Altes, Ayman Gaballah
{"title":"基于深度学习的颅内出血检测模型的外部验证和性能分析。","authors":"Ayman Nada, Alaa A Sayed, Mourad Hamouda, Mohamed Tantawi, Amna Khan, Addison Alt, Heidi Hassanein, Burak C Sevim, Talissa Altes, Ayman Gaballah","doi":"10.1177/19714009241303078","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>We aimed to investigate the external validation and performance of an FDA-approved deep learning model in labeling intracranial hemorrhage (ICH) cases on a real-world heterogeneous clinical dataset. Furthermore, we delved deeper into evaluating how patients' risk factors influenced the model's performance and gathered feedback on satisfaction from radiologists of varying ranks.</p><p><strong>Methods: </strong>This prospective IRB approved study included 5600 non-contrast CT scans of the head in various clinical settings, that is, emergency, inpatient, and outpatient units. The patients' risk factors were collected and tested for impacting the performance of DL model utilizing univariate and multivariate regression analyses. The performance of DL model was contrasted to the radiologists' interpretation to determine the presence or absence of ICH with subsequent classification into subcategories of ICH. Key metrics, including accuracy, sensitivity, specificity, positive predictive value, and negative predictive value, were calculated. Receiver operating characteristics curve, along with the area under the curve, were determined. Additionally, a questionnaire was conducted with radiologists of varying ranks to assess their experience with the model.</p><p><strong>Results: </strong>The model exhibited outstanding performance, achieving a high sensitivity of 89% and specificity of 96%. Additional performance metrics, including positive predictive value (82%), negative predictive value (97%), and overall accuracy (94%), underscore its robust capabilities. The area under the ROC curve further demonstrated the model's efficacy, reaching 0.954. Multivariate logistic regression revealed statistical significance for age, sex, history of trauma, operative intervention, HTN, and smoking.</p><p><strong>Conclusion: </strong>Our study highlights the satisfactory performance of the DL model on a diverse real-world dataset, garnering positive feedback from radiology trainees.</p>","PeriodicalId":47358,"journal":{"name":"Neuroradiology Journal","volume":" ","pages":"19714009241303078"},"PeriodicalIF":1.3000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11603421/pdf/","citationCount":"0","resultStr":"{\"title\":\"External validation and performance analysis of a deep learning-based model for the detection of intracranial hemorrhage.\",\"authors\":\"Ayman Nada, Alaa A Sayed, Mourad Hamouda, Mohamed Tantawi, Amna Khan, Addison Alt, Heidi Hassanein, Burak C Sevim, Talissa Altes, Ayman Gaballah\",\"doi\":\"10.1177/19714009241303078\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>We aimed to investigate the external validation and performance of an FDA-approved deep learning model in labeling intracranial hemorrhage (ICH) cases on a real-world heterogeneous clinical dataset. Furthermore, we delved deeper into evaluating how patients' risk factors influenced the model's performance and gathered feedback on satisfaction from radiologists of varying ranks.</p><p><strong>Methods: </strong>This prospective IRB approved study included 5600 non-contrast CT scans of the head in various clinical settings, that is, emergency, inpatient, and outpatient units. The patients' risk factors were collected and tested for impacting the performance of DL model utilizing univariate and multivariate regression analyses. The performance of DL model was contrasted to the radiologists' interpretation to determine the presence or absence of ICH with subsequent classification into subcategories of ICH. Key metrics, including accuracy, sensitivity, specificity, positive predictive value, and negative predictive value, were calculated. Receiver operating characteristics curve, along with the area under the curve, were determined. Additionally, a questionnaire was conducted with radiologists of varying ranks to assess their experience with the model.</p><p><strong>Results: </strong>The model exhibited outstanding performance, achieving a high sensitivity of 89% and specificity of 96%. Additional performance metrics, including positive predictive value (82%), negative predictive value (97%), and overall accuracy (94%), underscore its robust capabilities. The area under the ROC curve further demonstrated the model's efficacy, reaching 0.954. Multivariate logistic regression revealed statistical significance for age, sex, history of trauma, operative intervention, HTN, and smoking.</p><p><strong>Conclusion: </strong>Our study highlights the satisfactory performance of the DL model on a diverse real-world dataset, garnering positive feedback from radiology trainees.</p>\",\"PeriodicalId\":47358,\"journal\":{\"name\":\"Neuroradiology Journal\",\"volume\":\" \",\"pages\":\"19714009241303078\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11603421/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neuroradiology Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/19714009241303078\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"NEUROIMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroradiology Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/19714009241303078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"NEUROIMAGING","Score":null,"Total":0}
引用次数: 0

摘要

目的:我们旨在研究美国食品药品管理局(FDA)批准的深度学习模型在真实世界异构临床数据集上标记颅内出血(ICH)病例的外部验证和性能。此外,我们还深入评估了患者的风险因素对模型性能的影响,并收集了不同级别放射科医生的满意度反馈:这项经 IRB 批准的前瞻性研究包括在不同临床环境(即急诊室、住院部和门诊部)中进行的 5600 次头部非对比 CT 扫描。通过单变量和多变量回归分析,收集并测试了患者的风险因素对 DL 模型性能的影响。将 DL 模型的性能与放射科医生的判读进行对比,以确定是否存在 ICH,并将其划分为 ICH 子类别。计算的关键指标包括准确性、灵敏度、特异性、阳性预测值和阴性预测值。还确定了接收者操作特征曲线和曲线下面积。此外,还对不同级别的放射科医生进行了问卷调查,以评估他们使用该模型的经验:该模型表现出色,灵敏度高达 89%,特异性高达 96%。其他性能指标,包括阳性预测值(82%)、阴性预测值(97%)和总体准确率(94%),都凸显了该模型的强大功能。ROC 曲线下面积进一步证明了该模型的有效性,达到 0.954。多变量逻辑回归显示,年龄、性别、外伤史、手术干预、高血压和吸烟在统计学上具有显著性:我们的研究强调了 DL 模型在不同的真实数据集上令人满意的表现,获得了放射科学员的积极反馈。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
External validation and performance analysis of a deep learning-based model for the detection of intracranial hemorrhage.

Purpose: We aimed to investigate the external validation and performance of an FDA-approved deep learning model in labeling intracranial hemorrhage (ICH) cases on a real-world heterogeneous clinical dataset. Furthermore, we delved deeper into evaluating how patients' risk factors influenced the model's performance and gathered feedback on satisfaction from radiologists of varying ranks.

Methods: This prospective IRB approved study included 5600 non-contrast CT scans of the head in various clinical settings, that is, emergency, inpatient, and outpatient units. The patients' risk factors were collected and tested for impacting the performance of DL model utilizing univariate and multivariate regression analyses. The performance of DL model was contrasted to the radiologists' interpretation to determine the presence or absence of ICH with subsequent classification into subcategories of ICH. Key metrics, including accuracy, sensitivity, specificity, positive predictive value, and negative predictive value, were calculated. Receiver operating characteristics curve, along with the area under the curve, were determined. Additionally, a questionnaire was conducted with radiologists of varying ranks to assess their experience with the model.

Results: The model exhibited outstanding performance, achieving a high sensitivity of 89% and specificity of 96%. Additional performance metrics, including positive predictive value (82%), negative predictive value (97%), and overall accuracy (94%), underscore its robust capabilities. The area under the ROC curve further demonstrated the model's efficacy, reaching 0.954. Multivariate logistic regression revealed statistical significance for age, sex, history of trauma, operative intervention, HTN, and smoking.

Conclusion: Our study highlights the satisfactory performance of the DL model on a diverse real-world dataset, garnering positive feedback from radiology trainees.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Neuroradiology Journal
Neuroradiology Journal NEUROIMAGING-
CiteScore
2.50
自引率
0.00%
发文量
101
期刊介绍: NRJ - The Neuroradiology Journal (formerly Rivista di Neuroradiologia) is the official journal of the Italian Association of Neuroradiology and of the several Scientific Societies from all over the world. Founded in 1988 as Rivista di Neuroradiologia, of June 2006 evolved in NRJ - The Neuroradiology Journal. It is published bimonthly.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信