S Ambrogio, I Verdon, B Laureano, K V Ramnarine, F Fedele, D Vilic, I Honey, E Barton, C Goncalves, Sze Mun Mak, H Shuaib, A Jacques
{"title":"Independent Evaluation of a Commercial AI Software for Incidental Findings of Pulmonary Embolism (IPE) on a Large Hospital Retrospective Dataset.","authors":"S Ambrogio, I Verdon, B Laureano, K V Ramnarine, F Fedele, D Vilic, I Honey, E Barton, C Goncalves, Sze Mun Mak, H Shuaib, A Jacques","doi":"10.1155/rrp/9091895","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background:</b> Early treatment of pulmonary embolism is associated with better outcomes, yet incidental PE (IPE) is frequently missed. This retrospective study aims to provide an independent assessment an artificial intelligence (AI) software, developed for flagging IPEs on CT scans. <b>Methods:</b> The study included consecutive CT examinations of 5042 unique patients (8 scanners and 3 protocols) acquired at a large NHS Trust between 01 January 2022 and 30 September 2022. Two radiologists blindly and independently reviewed the AI \"positive\" and a random selection of \"negative\" cases to establish the reference standard (<i>n</i> = 200). Discrepancies were adjudicated by a third radiologist. The clinical reports of the 200 cases were reviewed for comparison. Performance metrics for the software were calculated for the full (<i>n</i> = 5042) and reviewed (<i>n</i> = 200) cohorts separately. <b>Results:</b> Based on the reference standard, the IPE prevalence was 1.6% (81/5041). Across the reviewed cohort, the algorithm detected PE with a sensitivity of 96.4%, a specificity of 89.7%, a PPV of 87.1%, an NPV of 97.2%, and an accuracy of 92.5%. Across the full cohort, the algorithm exhibited a sensitivity of 96.4%, a specificity of 99.8%, a PPV of 87.1%, an NPV of 99.9%, and an accuracy of 99.7%. A review of the original clinical reports indicated that 11 cases of IPE were initially unreported. A total of 34 examinations were rejected by the software. While the scanner performed consistently across patient sexes and ethnicities, discrepancies were found among CT scanners. <b>Conclusions:</b> The AI software detected IPE with a high diagnostic accuracy on a large NHS dataset, showing that AI-supported reporting could improve diagnostic accuracy and reduce times to diagnosis.</p>","PeriodicalId":51864,"journal":{"name":"Radiology Research and Practice","volume":"2025 ","pages":"9091895"},"PeriodicalIF":2.2000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11991795/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiology Research and Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/rrp/9091895","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Early treatment of pulmonary embolism is associated with better outcomes, yet incidental PE (IPE) is frequently missed. This retrospective study aims to provide an independent assessment an artificial intelligence (AI) software, developed for flagging IPEs on CT scans. Methods: The study included consecutive CT examinations of 5042 unique patients (8 scanners and 3 protocols) acquired at a large NHS Trust between 01 January 2022 and 30 September 2022. Two radiologists blindly and independently reviewed the AI "positive" and a random selection of "negative" cases to establish the reference standard (n = 200). Discrepancies were adjudicated by a third radiologist. The clinical reports of the 200 cases were reviewed for comparison. Performance metrics for the software were calculated for the full (n = 5042) and reviewed (n = 200) cohorts separately. Results: Based on the reference standard, the IPE prevalence was 1.6% (81/5041). Across the reviewed cohort, the algorithm detected PE with a sensitivity of 96.4%, a specificity of 89.7%, a PPV of 87.1%, an NPV of 97.2%, and an accuracy of 92.5%. Across the full cohort, the algorithm exhibited a sensitivity of 96.4%, a specificity of 99.8%, a PPV of 87.1%, an NPV of 99.9%, and an accuracy of 99.7%. A review of the original clinical reports indicated that 11 cases of IPE were initially unreported. A total of 34 examinations were rejected by the software. While the scanner performed consistently across patient sexes and ethnicities, discrepancies were found among CT scanners. Conclusions: The AI software detected IPE with a high diagnostic accuracy on a large NHS dataset, showing that AI-supported reporting could improve diagnostic accuracy and reduce times to diagnosis.
期刊介绍:
Radiology Research and Practice is a peer-reviewed, Open Access journal that publishes articles on all areas of medical imaging. The journal promotes evidence-based radiology practice though the publication of original research, reviews, and clinical studies for a multidisciplinary audience. Radiology Research and Practice is archived in Portico, which provides permanent archiving for electronic scholarly journals, as well as via the LOCKSS initiative. It operates a fully open access publishing model which allows open global access to its published content. This model is supported through Article Processing Charges. For more information on Article Processing charges in gen