Real-life performance of AI-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures in appendicular skeletal trauma
IF 3.3 3区 医学Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Amandine Crombé , Alexandre Ben Cheikh , Mylène Seux , Eric Stéphant , Julien May , Olivier Preteseille , Adrien Vague , Frédéric Nativel , Matthieu Daniel , Johan Etievant , Jessica Aristizabal , Antoine Perrey , Guillaume Gorincour
{"title":"Real-life performance of AI-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures in appendicular skeletal trauma","authors":"Amandine Crombé , Alexandre Ben Cheikh , Mylène Seux , Eric Stéphant , Julien May , Olivier Preteseille , Adrien Vague , Frédéric Nativel , Matthieu Daniel , Johan Etievant , Jessica Aristizabal , Antoine Perrey , Guillaume Gorincour","doi":"10.1016/j.ejrad.2025.112469","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>To compare the performance of artificial intelligence (AI)-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures.</div></div><div><h3>Materials and Methods</h3><div>Consecutive patients treated at two centres for appendicular skeletal traumatic injury between January and April 2021 who underwent X-ray imaging and whose initial conclusions were available and prospectively encoded by emergency physicians, were also prospectively analysed via two AI solutions (BoneView and SmartUrgence) available for the real-life interpretation of AI-aided radiologists. The ground truth was retrospectively assessed by 5 senior musculoskeletal radiologists who were aware of all the clinical, radiological and AI data. Numbers of suspected fractures, true positives and false positives per AI were compared. Diagnostic performance metrics (sensitivity, specificity, positive and negative predictive values and accuracy with 95% confidence intervals) for detecting fractures were estimated for each interpretation (emergency physician, BoneView, SmartUrgence, AI-aided radiologist).</div></div><div><h3>Results</h3><div>969 patients with 1049 radiography sets were included, 287 of whom had fractures (27.4 %). The average number of any fracture and true positive fractures were greater with BoneView than with SmartUrgence (P = 0.0469 and P = 0.0022, respectively). The real-life sensitivity, specificity and accuracy for detecting fracture in the entire cohort were 93 %, 99 % and 97.6 % for AI-aided radiologists; 80.8 %, 97.6 % and 93 % for emergency physicians; 89.5 %, 93.8 % and 92.7 % for BoneView; and 85.7 %, 94.6 % and 92.2 % for SmartUrgence.</div></div><div><h3>Conclusion</h3><div>In a real-life emergency setting, the performance of AI-aided radiologists in diagnosing bone fractures was excellent, and these radiologists outperformed AI solutions alone regardless of age and location.</div></div>","PeriodicalId":12063,"journal":{"name":"European Journal of Radiology","volume":"194 ","pages":"Article 112469"},"PeriodicalIF":3.3000,"publicationDate":"2025-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0720048X25005558","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
To compare the performance of artificial intelligence (AI)-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures.
Materials and Methods
Consecutive patients treated at two centres for appendicular skeletal traumatic injury between January and April 2021 who underwent X-ray imaging and whose initial conclusions were available and prospectively encoded by emergency physicians, were also prospectively analysed via two AI solutions (BoneView and SmartUrgence) available for the real-life interpretation of AI-aided radiologists. The ground truth was retrospectively assessed by 5 senior musculoskeletal radiologists who were aware of all the clinical, radiological and AI data. Numbers of suspected fractures, true positives and false positives per AI were compared. Diagnostic performance metrics (sensitivity, specificity, positive and negative predictive values and accuracy with 95% confidence intervals) for detecting fractures were estimated for each interpretation (emergency physician, BoneView, SmartUrgence, AI-aided radiologist).
Results
969 patients with 1049 radiography sets were included, 287 of whom had fractures (27.4 %). The average number of any fracture and true positive fractures were greater with BoneView than with SmartUrgence (P = 0.0469 and P = 0.0022, respectively). The real-life sensitivity, specificity and accuracy for detecting fracture in the entire cohort were 93 %, 99 % and 97.6 % for AI-aided radiologists; 80.8 %, 97.6 % and 93 % for emergency physicians; 89.5 %, 93.8 % and 92.7 % for BoneView; and 85.7 %, 94.6 % and 92.2 % for SmartUrgence.
Conclusion
In a real-life emergency setting, the performance of AI-aided radiologists in diagnosing bone fractures was excellent, and these radiologists outperformed AI solutions alone regardless of age and location.
期刊介绍:
European Journal of Radiology is an international journal which aims to communicate to its readers, state-of-the-art information on imaging developments in the form of high quality original research articles and timely reviews on current developments in the field.
Its audience includes clinicians at all levels of training including radiology trainees, newly qualified imaging specialists and the experienced radiologist. Its aim is to inform efficient, appropriate and evidence-based imaging practice to the benefit of patients worldwide.