External validation and performance analysis of a deep learning-based model for the detection of intracranial hemorrhage.

IF 0.8 Q4 NEUROIMAGING

Neuroradiology Journal Pub Date : 2025-06-01 Epub Date: 2024-11-27 DOI:10.1177/19714009241303078

Ayman Nada, Alaa A Sayed, Mourad Hamouda, Mohamed Tantawi, Amna Khan, Addison Alt, Heidi Hassanein, Burak C Sevim, Talissa Altes, Ayman Gaballah

{"title":"External validation and performance analysis of a deep learning-based model for the detection of intracranial hemorrhage.","authors":"Ayman Nada, Alaa A Sayed, Mourad Hamouda, Mohamed Tantawi, Amna Khan, Addison Alt, Heidi Hassanein, Burak C Sevim, Talissa Altes, Ayman Gaballah","doi":"10.1177/19714009241303078","DOIUrl":null,"url":null,"abstract":"<p><p>PurposeWe aimed to investigate the external validation and performance of an FDA-approved deep learning model in labeling intracranial hemorrhage (ICH) cases on a real-world heterogeneous clinical dataset. Furthermore, we delved deeper into evaluating how patients' risk factors influenced the model's performance and gathered feedback on satisfaction from radiologists of varying ranks.MethodsThis prospective IRB approved study included 5600 non-contrast CT scans of the head in various clinical settings, that is, emergency, inpatient, and outpatient units. The patients' risk factors were collected and tested for impacting the performance of DL model utilizing univariate and multivariate regression analyses. The performance of DL model was contrasted to the radiologists' interpretation to determine the presence or absence of ICH with subsequent classification into subcategories of ICH. Key metrics, including accuracy, sensitivity, specificity, positive predictive value, and negative predictive value, were calculated. Receiver operating characteristics curve, along with the area under the curve, were determined. Additionally, a questionnaire was conducted with radiologists of varying ranks to assess their experience with the model.ResultsThe model exhibited outstanding performance, achieving a high sensitivity of 89% and specificity of 96%. Additional performance metrics, including positive predictive value (82%), negative predictive value (97%), and overall accuracy (94%), underscore its robust capabilities. The area under the ROC curve further demonstrated the model's efficacy, reaching 0.954. Multivariate logistic regression revealed statistical significance for age, sex, history of trauma, operative intervention, HTN, and smoking.ConclusionOur study highlights the satisfactory performance of the DL model on a diverse real-world dataset, garnering positive feedback from radiology trainees.</p>","PeriodicalId":47358,"journal":{"name":"Neuroradiology Journal","volume":" ","pages":"312-321"},"PeriodicalIF":0.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11603421/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroradiology Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/19714009241303078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/27 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"NEUROIMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

PurposeWe aimed to investigate the external validation and performance of an FDA-approved deep learning model in labeling intracranial hemorrhage (ICH) cases on a real-world heterogeneous clinical dataset. Furthermore, we delved deeper into evaluating how patients' risk factors influenced the model's performance and gathered feedback on satisfaction from radiologists of varying ranks.MethodsThis prospective IRB approved study included 5600 non-contrast CT scans of the head in various clinical settings, that is, emergency, inpatient, and outpatient units. The patients' risk factors were collected and tested for impacting the performance of DL model utilizing univariate and multivariate regression analyses. The performance of DL model was contrasted to the radiologists' interpretation to determine the presence or absence of ICH with subsequent classification into subcategories of ICH. Key metrics, including accuracy, sensitivity, specificity, positive predictive value, and negative predictive value, were calculated. Receiver operating characteristics curve, along with the area under the curve, were determined. Additionally, a questionnaire was conducted with radiologists of varying ranks to assess their experience with the model.ResultsThe model exhibited outstanding performance, achieving a high sensitivity of 89% and specificity of 96%. Additional performance metrics, including positive predictive value (82%), negative predictive value (97%), and overall accuracy (94%), underscore its robust capabilities. The area under the ROC curve further demonstrated the model's efficacy, reaching 0.954. Multivariate logistic regression revealed statistical significance for age, sex, history of trauma, operative intervention, HTN, and smoking.ConclusionOur study highlights the satisfactory performance of the DL model on a diverse real-world dataset, garnering positive feedback from radiology trainees.

查看原文本刊更多论文

基于深度学习的颅内出血检测模型的外部验证和性能分析。

目的：我们旨在研究美国食品药品管理局（FDA）批准的深度学习模型在真实世界异构临床数据集上标记颅内出血（ICH）病例的外部验证和性能。此外，我们还深入评估了患者的风险因素对模型性能的影响，并收集了不同级别放射科医生的满意度反馈：这项经 IRB 批准的前瞻性研究包括在不同临床环境（即急诊室、住院部和门诊部）中进行的 5600 次头部非对比 CT 扫描。通过单变量和多变量回归分析，收集并测试了患者的风险因素对 DL 模型性能的影响。将 DL 模型的性能与放射科医生的判读进行对比，以确定是否存在 ICH，并将其划分为 ICH 子类别。计算的关键指标包括准确性、灵敏度、特异性、阳性预测值和阴性预测值。还确定了接收者操作特征曲线和曲线下面积。此外，还对不同级别的放射科医生进行了问卷调查，以评估他们使用该模型的经验：该模型表现出色，灵敏度高达 89%，特异性高达 96%。其他性能指标，包括阳性预测值（82%）、阴性预测值（97%）和总体准确率（94%），都凸显了该模型的强大功能。ROC 曲线下面积进一步证明了该模型的有效性，达到 0.954。多变量逻辑回归显示，年龄、性别、外伤史、手术干预、高血压和吸烟在统计学上具有显著性：我们的研究强调了 DL 模型在不同的真实数据集上令人满意的表现，获得了放射科学员的积极反馈。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neuroradiology Journal NEUROIMAGING-

CiteScore

2.50

自引率

0.00%

发文量

101

期刊介绍： NRJ - The Neuroradiology Journal (formerly Rivista di Neuroradiologia) is the official journal of the Italian Association of Neuroradiology and of the several Scientific Societies from all over the world. Founded in 1988 as Rivista di Neuroradiologia, of June 2006 evolved in NRJ - The Neuroradiology Journal. It is published bimonthly.