Real-life performance of AI-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures in appendicular skeletal trauma

IF 3.3 3区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Amandine Crombé , Alexandre Ben Cheikh , Mylène Seux , Eric Stéphant , Julien May , Olivier Preteseille , Adrien Vague , Frédéric Nativel , Matthieu Daniel , Johan Etievant , Jessica Aristizabal , Antoine Perrey , Guillaume Gorincour
{"title":"Real-life performance of AI-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures in appendicular skeletal trauma","authors":"Amandine Crombé ,&nbsp;Alexandre Ben Cheikh ,&nbsp;Mylène Seux ,&nbsp;Eric Stéphant ,&nbsp;Julien May ,&nbsp;Olivier Preteseille ,&nbsp;Adrien Vague ,&nbsp;Frédéric Nativel ,&nbsp;Matthieu Daniel ,&nbsp;Johan Etievant ,&nbsp;Jessica Aristizabal ,&nbsp;Antoine Perrey ,&nbsp;Guillaume Gorincour","doi":"10.1016/j.ejrad.2025.112469","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>To compare the performance of artificial intelligence (AI)-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures.</div></div><div><h3>Materials and Methods</h3><div>Consecutive patients treated at two centres for appendicular skeletal traumatic injury between January and April 2021 who underwent X-ray imaging and whose initial conclusions were available and prospectively encoded by emergency physicians, were also prospectively analysed via two AI solutions (BoneView and SmartUrgence) available for the real-life interpretation of AI-aided radiologists. The ground truth was retrospectively assessed by 5 senior musculoskeletal radiologists who were aware of all the clinical, radiological and AI data. Numbers of suspected fractures, true positives and false positives per AI were compared. Diagnostic performance metrics (sensitivity, specificity, positive and negative predictive values and accuracy with 95% confidence intervals) for detecting fractures were estimated for each interpretation (emergency physician, BoneView, SmartUrgence, AI-aided radiologist).</div></div><div><h3>Results</h3><div>969 patients with 1049 radiography sets were included, 287 of whom had fractures (27.4 %). The average number of any fracture and true positive fractures were greater with BoneView than with SmartUrgence (P = 0.0469 and P = 0.0022, respectively). The real-life sensitivity, specificity and accuracy for detecting fracture in the entire cohort were 93 %, 99 % and 97.6 % for AI-aided radiologists; 80.8 %, 97.6 % and 93 % for emergency physicians; 89.5 %, 93.8 % and 92.7 % for BoneView; and 85.7 %, 94.6 % and 92.2 % for SmartUrgence.</div></div><div><h3>Conclusion</h3><div>In a real-life emergency setting, the performance of AI-aided radiologists in diagnosing bone fractures was excellent, and these radiologists outperformed AI solutions alone regardless of age and location.</div></div>","PeriodicalId":12063,"journal":{"name":"European Journal of Radiology","volume":"194 ","pages":"Article 112469"},"PeriodicalIF":3.3000,"publicationDate":"2025-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0720048X25005558","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives

To compare the performance of artificial intelligence (AI)-aided radiologists, emergency physicians and two AI solutions for diagnosing bone fractures.

Materials and Methods

Consecutive patients treated at two centres for appendicular skeletal traumatic injury between January and April 2021 who underwent X-ray imaging and whose initial conclusions were available and prospectively encoded by emergency physicians, were also prospectively analysed via two AI solutions (BoneView and SmartUrgence) available for the real-life interpretation of AI-aided radiologists. The ground truth was retrospectively assessed by 5 senior musculoskeletal radiologists who were aware of all the clinical, radiological and AI data. Numbers of suspected fractures, true positives and false positives per AI were compared. Diagnostic performance metrics (sensitivity, specificity, positive and negative predictive values and accuracy with 95% confidence intervals) for detecting fractures were estimated for each interpretation (emergency physician, BoneView, SmartUrgence, AI-aided radiologist).

Results

969 patients with 1049 radiography sets were included, 287 of whom had fractures (27.4 %). The average number of any fracture and true positive fractures were greater with BoneView than with SmartUrgence (P = 0.0469 and P = 0.0022, respectively). The real-life sensitivity, specificity and accuracy for detecting fracture in the entire cohort were 93 %, 99 % and 97.6 % for AI-aided radiologists; 80.8 %, 97.6 % and 93 % for emergency physicians; 89.5 %, 93.8 % and 92.7 % for BoneView; and 85.7 %, 94.6 % and 92.2 % for SmartUrgence.

Conclusion

In a real-life emergency setting, the performance of AI-aided radiologists in diagnosing bone fractures was excellent, and these radiologists outperformed AI solutions alone regardless of age and location.

Abstract Image

人工智能辅助放射科医生、急诊医生和两种人工智能解决方案诊断阑尾骨骼创伤骨折的真实表现。
目的:比较人工智能(AI)辅助放射科医生、急诊医生和两种人工智能解决方案在骨折诊断中的表现。材料和方法:2021年1月至4月期间在两个中心连续治疗的阑尾骨骼创伤性损伤患者接受了x射线成像,急诊医生可获得初步结论并对其进行前瞻性编码,也可通过两种人工智能解决方案(BoneView和smarturence)进行前瞻性分析,用于人工智能辅助放射科医生的现实解释。5名了解所有临床、放射学和人工智能数据的高级肌肉骨骼放射科医生对基本事实进行了回顾性评估。比较每个AI的疑似骨折数量、真阳性和假阳性。评估每种解释(急诊医生、BoneView、smarturence、ai辅助放射科医生)检测骨折的诊断性能指标(敏感性、特异性、阳性和阴性预测值以及95%置信区间的准确性)。结果:纳入969例患者1049组x线片,其中骨折287例(27.4%)。BoneView的平均任意骨折数和真阳性骨折数均大于smarturence (P = 0.0469和P = 0.0022)。人工智能辅助放射科医师在整个队列中检测骨折的实际敏感性、特异性和准确性分别为93%、99%和97.6%;急诊医师占80.8%、97.6%和93%;BoneView分别为89.5%、93.8%和92.7%;smarturence分别占85.7%、94.6%和92.2%。结论:在现实急诊环境中,人工智能辅助放射科医生诊断骨折的表现非常出色,无论年龄和位置如何,这些放射科医生的表现都优于单独使用人工智能解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.70
自引率
3.00%
发文量
398
审稿时长
42 days
期刊介绍: European Journal of Radiology is an international journal which aims to communicate to its readers, state-of-the-art information on imaging developments in the form of high quality original research articles and timely reviews on current developments in the field. Its audience includes clinicians at all levels of training including radiology trainees, newly qualified imaging specialists and the experienced radiologist. Its aim is to inform efficient, appropriate and evidence-based imaging practice to the benefit of patients worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信