Ghee Rye Lee, Adam E Flanders, Tyler Richards, Felipe Kitamura, Errol Colak, Hui Ming Lin, Robyn L Ball, Jason Talbott, Luciano M Prevedello
下载PDF
{"title":"Performance of the Winning Algorithms of the RSNA 2022 Cervical Spine Fracture Detection Challenge.","authors":"Ghee Rye Lee, Adam E Flanders, Tyler Richards, Felipe Kitamura, Errol Colak, Hui Ming Lin, Robyn L Ball, Jason Talbott, Luciano M Prevedello","doi":"10.1148/ryai.230256","DOIUrl":null,"url":null,"abstract":"<p><p>Purpose To evaluate and report the performance of the winning algorithms of the Radiological Society of North America Cervical Spine Fracture AI Challenge. Materials and Methods The competition was open to the public on Kaggle from July 28 to October 27, 2022. A sample of 3112 CT scans with and without cervical spine fractures (CSFx) were assembled from multiple sites (12 institutions across six continents) and prepared for the competition. The test set had 1093 scans (private test set: <i>n</i> = 789; mean age, 53.40 years ± 22.86 [SD]; 509 males; public test set: <i>n</i> = 304; mean age, 52.51 years ± 20.73; 189 males) and 847 fractures. The eight top-performing artificial intelligence (AI) algorithms were retrospectively evaluated, and the area under the receiver operating characteristic curve (AUC) value, F1 score, sensitivity, and specificity were calculated. Results A total of 1108 contestants composing 883 teams worldwide participated in the competition. The top eight AI models showed high performance, with a mean AUC value of 0.96 (95% CI: 0.95, 0.96), mean F1 score of 90% (95% CI: 90%, 91%), mean sensitivity of 88% (95% Cl: 86%, 90%), and mean specificity of 94% (95% CI: 93%, 96%). The highest values reported for previous models were an AUC of 0.85, F1 score of 81%, sensitivity of 76%, and specificity of 97%. Conclusion The competition successfully facilitated the development of AI models that could detect and localize CSFx on CT scans with high performance outcomes, which appear to exceed known values of previously reported models. Further study is needed to evaluate the generalizability of these models in a clinical environment. <b>Keywords:</b> Cervical Spine, Fracture Detection, Machine Learning, Artificial Intelligence Algorithms, CT, Head/Neck <i>Supplemental material is available for this article.</i> © RSNA, 2024.</p>","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230256"},"PeriodicalIF":8.1000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10831508/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiology-Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1148/ryai.230256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
引用
批量引用
Abstract
Purpose To evaluate and report the performance of the winning algorithms of the Radiological Society of North America Cervical Spine Fracture AI Challenge. Materials and Methods The competition was open to the public on Kaggle from July 28 to October 27, 2022. A sample of 3112 CT scans with and without cervical spine fractures (CSFx) were assembled from multiple sites (12 institutions across six continents) and prepared for the competition. The test set had 1093 scans (private test set: n = 789; mean age, 53.40 years ± 22.86 [SD]; 509 males; public test set: n = 304; mean age, 52.51 years ± 20.73; 189 males) and 847 fractures. The eight top-performing artificial intelligence (AI) algorithms were retrospectively evaluated, and the area under the receiver operating characteristic curve (AUC) value, F1 score, sensitivity, and specificity were calculated. Results A total of 1108 contestants composing 883 teams worldwide participated in the competition. The top eight AI models showed high performance, with a mean AUC value of 0.96 (95% CI: 0.95, 0.96), mean F1 score of 90% (95% CI: 90%, 91%), mean sensitivity of 88% (95% Cl: 86%, 90%), and mean specificity of 94% (95% CI: 93%, 96%). The highest values reported for previous models were an AUC of 0.85, F1 score of 81%, sensitivity of 76%, and specificity of 97%. Conclusion The competition successfully facilitated the development of AI models that could detect and localize CSFx on CT scans with high performance outcomes, which appear to exceed known values of previously reported models. Further study is needed to evaluate the generalizability of these models in a clinical environment. Keywords: Cervical Spine, Fracture Detection, Machine Learning, Artificial Intelligence Algorithms, CT, Head/Neck Supplemental material is available for this article. © RSNA, 2024.
RSNA 2022 年颈椎骨折检测挑战赛获奖算法的性能。
"刚刚接受 "的论文经过同行评审,已被接受在《放射学》上发表:人工智能》上发表。这篇文章在以最终版本发表之前,还将经过校对、排版和校对审核。请注意,在制作最终校对稿的过程中,可能会发现影响内容的错误。目的 评估并报告北美放射学会颈椎骨折人工智能(AI)挑战赛获奖算法的性能。材料与方法 比赛于 2022 年 7 月 28 日至 10 月 27 日在 Kaggle 上向公众开放。从多个地点(横跨 6 大洲的 12 家机构)收集了 3,112 份有无颈椎骨折的 CT 扫描图像,并为比赛做好了准备。测试集有 1,093 份扫描(私人测试集:n= 789;平均年龄 53.40 ± [SD] 22.86 岁;509 名男性;公共测试集:n = 304;平均年龄 52.51 ± 20.73 岁;189 名男性)和 847 处骨折。对表现最好的 8 种算法进行了回顾性评估,并报告了接收者操作特征曲线下面积(AUC)值、F1 分数、灵敏度和特异性。结果 全球共有 883 个团队的 1 108 名选手参加了比赛。前 8 名的人工智能模型表现出较高的平均水平:AUC值为0.96(95% CI为0.95-0.96);F1得分率为90%(95% CI为90%-91%);灵敏度为88%(95% Cl为86%-90%),特异度为94%(95% CI为93%-96%)。以往模型的 AUC 为 0.85,F1 得分为 81%,灵敏度为 76%,特异性为 97%。结论 本次竞赛成功地促进了人工智能模型的开发,这些模型可以在 CT 上检测和定位颈椎骨折,并具有较高的性能结果,似乎超过了以前报告的模型的已知值。需要进一步研究以评估其在临床环境中的通用性。©RSNA,2024。
本文章由计算机程序翻译,如有差异,请以英文原文为准。