Elbow trauma in children: development and evaluation of radiological artificial intelligence models

Clémence ROZWAG , Franck VALENTINI , Anne COTTEN , Xavier DEMONDION , Philippe PREUX , Thibaut JACQUES
{"title":"Elbow trauma in children: development and evaluation of radiological artificial intelligence models","authors":"Clémence ROZWAG ,&nbsp;Franck VALENTINI ,&nbsp;Anne COTTEN ,&nbsp;Xavier DEMONDION ,&nbsp;Philippe PREUX ,&nbsp;Thibaut JACQUES","doi":"10.1016/j.redii.2023.100029","DOIUrl":null,"url":null,"abstract":"<div><h3>Rationale and Objectives</h3><p>To develop a model using artificial intelligence (A.I.) able to detect post-traumatic injuries on pediatric elbow X-rays then to evaluate its performances in silico and its impact on radiologists’ interpretation in clinical practice.</p></div><div><h3>Material and Methods</h3><p>A total of 1956 pediatric elbow radiographs performed following a trauma were retrospectively collected from 935 patients aged between 0 and 18 years. Deep convolutional neural networks were trained on these X-rays. The two best models were selected then evaluated on an external test set involving 120 patients, whose X-rays were performed on a different radiological equipment in another time period. Eight radiologists interpreted this external test set without then with the help of the A.I. models .</p></div><div><h3>Results</h3><p>Two models stood out: model 1 had an accuracy of 95.8% and an AUROC of 0.983 and model 2 had an accuracy of 90.5% and an AUROC of 0.975. On the external test set, model 1 kept a good accuracy of 82.5% and AUROC of 0.916 while model 2 had a loss of accuracy down to 69.2% and of AUROC to 0.793. Model 1 significantly improved radiologist's sensitivity (0.82 to 0.88, <em>P</em> = 0.016) and accuracy (0.86 to 0.88, <em>P</em> = 0,047) while model 2 significantly decreased specificity of readers (0.86 to 0.83, <em>P</em> = 0.031).</p></div><div><h3>Conclusion</h3><p>End-to-end development of a deep learning model to assess post-traumatic injuries on elbow X-ray in children was feasible and showed that models with close metrics in silico can unpredictably lead radiologists to either improve or lower their performances in clinical settings.</p></div>","PeriodicalId":74676,"journal":{"name":"Research in diagnostic and interventional imaging","volume":"6 ","pages":"Article 100029"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in diagnostic and interventional imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S277265252300008X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Rationale and Objectives

To develop a model using artificial intelligence (A.I.) able to detect post-traumatic injuries on pediatric elbow X-rays then to evaluate its performances in silico and its impact on radiologists’ interpretation in clinical practice.

Material and Methods

A total of 1956 pediatric elbow radiographs performed following a trauma were retrospectively collected from 935 patients aged between 0 and 18 years. Deep convolutional neural networks were trained on these X-rays. The two best models were selected then evaluated on an external test set involving 120 patients, whose X-rays were performed on a different radiological equipment in another time period. Eight radiologists interpreted this external test set without then with the help of the A.I. models .

Results

Two models stood out: model 1 had an accuracy of 95.8% and an AUROC of 0.983 and model 2 had an accuracy of 90.5% and an AUROC of 0.975. On the external test set, model 1 kept a good accuracy of 82.5% and AUROC of 0.916 while model 2 had a loss of accuracy down to 69.2% and of AUROC to 0.793. Model 1 significantly improved radiologist's sensitivity (0.82 to 0.88, P = 0.016) and accuracy (0.86 to 0.88, P = 0,047) while model 2 significantly decreased specificity of readers (0.86 to 0.83, P = 0.031).

Conclusion

End-to-end development of a deep learning model to assess post-traumatic injuries on elbow X-ray in children was feasible and showed that models with close metrics in silico can unpredictably lead radiologists to either improve or lower their performances in clinical settings.

儿童肘部创伤:放射学人工智能模型的建立与评价
原理和目的使用人工智能(a.I.)开发一个模型,该模型能够在儿童肘部X光片上检测创伤后损伤,然后评估其在计算机上的表现及其对放射科医生在临床实践中的解释的影响。材料和方法回顾性收集935名年龄在0至18岁之间的儿童在创伤后进行的1956张肘部x线片。在这些X射线上训练了深度卷积神经网络。选择两个最佳模型,然后在一个涉及120名患者的外部测试集上进行评估,这些患者在另一个时间段内在不同的放射设备上进行X光检查。8名放射科医生在人工智能模型的帮助下对该外部测试集进行了解释。在外部测试集上,模型1保持了82.5%的良好精度和0.916的AUROC,而模型2的精度下降到69.2%,AUROC下降到0.793。模型1显著提高了放射科医生的敏感性(0.82~0.88,P=0.016)和准确性(0.86~0.88,P=0.047),而模型2显著降低了读者的特异性(0.86~ 0.83,P=0.031)不可预测地导致放射科医生在临床环境中提高或降低他们的表现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信