迈向可信赖的肌肉骨骼医学人工智能:不确定性量化的叙述性回顾。

IF 5 2区 医学 Q1 ORTHOPEDICS
Amir M. Vahdani, Moein Shariatnia, Pranav Rajpurkar, Ayoosh Pareek
{"title":"迈向可信赖的肌肉骨骼医学人工智能:不确定性量化的叙述性回顾。","authors":"Amir M. Vahdani,&nbsp;Moein Shariatnia,&nbsp;Pranav Rajpurkar,&nbsp;Ayoosh Pareek","doi":"10.1002/ksa.12737","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Introduction</h3>\n \n <p>Deep learning (DL) models have achieved remarkable performance in musculoskeletal (MSK) medical imaging research, yet their clinical integration remains hindered by their black-box nature and the absence of reliable confidence measures. Uncertainty quantification (UQ) seeks to bridge this gap by providing each DL prediction with a calibrated estimate of uncertainty, thereby fostering clinician trust and safer deployment.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We conducted a targeted narrative review, performing expert-driven searches in PubMed, Scopus, and arXiv and mining references from relevant publications in MSK imaging utilizing UQ, and a thematic synthesis was used to derive a cohesive taxonomy of UQ methodologies.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>UQ approaches encompass multi-pass methods (e.g., test-time augmentation, Monte Carlo dropout, and model ensembling) that infer uncertainty from variability across repeated inferences; single-pass methods (e.g., conformal prediction, and evidential deep learning) that augment each individual prediction with uncertainty metrics; and other techniques that leverage auxiliary information, such as inter-rater variability, hidden-layer activations, or generative reconstruction errors, to estimate confidence. Applications in MSK imaging, include highlighting uncertain areas in cartilage segmentation and identifying uncertain predictions in joint implant design detections; downstream applications include enhanced clinical utility and more efficient data annotation pipelines.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Embedding UQ into DL workflows is essential for translating high-performance models into clinical practice. Future research should prioritize robust out-of-distribution handling, computational efficiency, and standardized evaluation metrics to accelerate the adoption of trustworthy AI in MSK medicine.</p>\n </section>\n \n <section>\n \n <h3> Level of Evidence</h3>\n \n <p>Not applicable.</p>\n </section>\n </div>","PeriodicalId":17880,"journal":{"name":"Knee Surgery, Sports Traumatology, Arthroscopy","volume":"33 9","pages":"3418-3437"},"PeriodicalIF":5.0000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards trustworthy artificial intelligence in musculoskeletal medicine: A narrative review on uncertainty quantification\",\"authors\":\"Amir M. Vahdani,&nbsp;Moein Shariatnia,&nbsp;Pranav Rajpurkar,&nbsp;Ayoosh Pareek\",\"doi\":\"10.1002/ksa.12737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Introduction</h3>\\n \\n <p>Deep learning (DL) models have achieved remarkable performance in musculoskeletal (MSK) medical imaging research, yet their clinical integration remains hindered by their black-box nature and the absence of reliable confidence measures. Uncertainty quantification (UQ) seeks to bridge this gap by providing each DL prediction with a calibrated estimate of uncertainty, thereby fostering clinician trust and safer deployment.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>We conducted a targeted narrative review, performing expert-driven searches in PubMed, Scopus, and arXiv and mining references from relevant publications in MSK imaging utilizing UQ, and a thematic synthesis was used to derive a cohesive taxonomy of UQ methodologies.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>UQ approaches encompass multi-pass methods (e.g., test-time augmentation, Monte Carlo dropout, and model ensembling) that infer uncertainty from variability across repeated inferences; single-pass methods (e.g., conformal prediction, and evidential deep learning) that augment each individual prediction with uncertainty metrics; and other techniques that leverage auxiliary information, such as inter-rater variability, hidden-layer activations, or generative reconstruction errors, to estimate confidence. Applications in MSK imaging, include highlighting uncertain areas in cartilage segmentation and identifying uncertain predictions in joint implant design detections; downstream applications include enhanced clinical utility and more efficient data annotation pipelines.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusion</h3>\\n \\n <p>Embedding UQ into DL workflows is essential for translating high-performance models into clinical practice. Future research should prioritize robust out-of-distribution handling, computational efficiency, and standardized evaluation metrics to accelerate the adoption of trustworthy AI in MSK medicine.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Level of Evidence</h3>\\n \\n <p>Not applicable.</p>\\n </section>\\n </div>\",\"PeriodicalId\":17880,\"journal\":{\"name\":\"Knee Surgery, Sports Traumatology, Arthroscopy\",\"volume\":\"33 9\",\"pages\":\"3418-3437\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-07-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knee Surgery, Sports Traumatology, Arthroscopy\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://esskajournals.onlinelibrary.wiley.com/doi/10.1002/ksa.12737\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ORTHOPEDICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knee Surgery, Sports Traumatology, Arthroscopy","FirstCategoryId":"3","ListUrlMain":"https://esskajournals.onlinelibrary.wiley.com/doi/10.1002/ksa.12737","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0

摘要

深度学习(DL)模型在肌肉骨骼(MSK)医学成像研究中取得了显著的成绩,但由于其黑箱性质和缺乏可靠的置信度措施,它们的临床整合仍然受到阻碍。不确定性量化(UQ)旨在通过为每个深度学习预测提供校准的不确定性估计来弥合这一差距,从而促进临床医生的信任和更安全的部署。方法:我们进行了有针对性的叙述性回顾,在PubMed、Scopus和arXiv中进行专家驱动的搜索,并利用UQ从MSK成像的相关出版物中挖掘参考文献,并使用主题综合来获得UQ方法的内聚分类法。结果:UQ方法包括多通道方法(例如,测试时间增强,蒙特卡罗退出和模型集成),从反复推断的可变性中推断不确定性;用不确定性度量增强每个单独预测的单次方法(例如,保形预测和证据深度学习);以及其他利用辅助信息的技术,如评级间变异性、隐藏层激活或生成重建误差,以估计置信度。MSK成像的应用包括突出软骨分割中的不确定区域和识别关节植入物设计检测中的不确定预测;下游应用程序包括增强的临床实用程序和更高效的数据注释管道。结论:将UQ嵌入到DL工作流中对于将高性能模型转化为临床实践至关重要。未来的研究应优先考虑健壮的配送外处理、计算效率和标准化评估指标,以加速在MSK医学中采用可信赖的人工智能。证据等级:不适用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Towards trustworthy artificial intelligence in musculoskeletal medicine: A narrative review on uncertainty quantification

Towards trustworthy artificial intelligence in musculoskeletal medicine: A narrative review on uncertainty quantification

Towards trustworthy artificial intelligence in musculoskeletal medicine: A narrative review on uncertainty quantification

Towards trustworthy artificial intelligence in musculoskeletal medicine: A narrative review on uncertainty quantification

Towards trustworthy artificial intelligence in musculoskeletal medicine: A narrative review on uncertainty quantification

Introduction

Deep learning (DL) models have achieved remarkable performance in musculoskeletal (MSK) medical imaging research, yet their clinical integration remains hindered by their black-box nature and the absence of reliable confidence measures. Uncertainty quantification (UQ) seeks to bridge this gap by providing each DL prediction with a calibrated estimate of uncertainty, thereby fostering clinician trust and safer deployment.

Methods

We conducted a targeted narrative review, performing expert-driven searches in PubMed, Scopus, and arXiv and mining references from relevant publications in MSK imaging utilizing UQ, and a thematic synthesis was used to derive a cohesive taxonomy of UQ methodologies.

Results

UQ approaches encompass multi-pass methods (e.g., test-time augmentation, Monte Carlo dropout, and model ensembling) that infer uncertainty from variability across repeated inferences; single-pass methods (e.g., conformal prediction, and evidential deep learning) that augment each individual prediction with uncertainty metrics; and other techniques that leverage auxiliary information, such as inter-rater variability, hidden-layer activations, or generative reconstruction errors, to estimate confidence. Applications in MSK imaging, include highlighting uncertain areas in cartilage segmentation and identifying uncertain predictions in joint implant design detections; downstream applications include enhanced clinical utility and more efficient data annotation pipelines.

Conclusion

Embedding UQ into DL workflows is essential for translating high-performance models into clinical practice. Future research should prioritize robust out-of-distribution handling, computational efficiency, and standardized evaluation metrics to accelerate the adoption of trustworthy AI in MSK medicine.

Level of Evidence

Not applicable.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.10
自引率
18.40%
发文量
418
审稿时长
2 months
期刊介绍: Few other areas of orthopedic surgery and traumatology have undergone such a dramatic evolution in the last 10 years as knee surgery, arthroscopy and sports traumatology. Ranked among the top 33% of journals in both Orthopedics and Sports Sciences, the goal of this European journal is to publish papers about innovative knee surgery, sports trauma surgery and arthroscopy. Each issue features a series of peer-reviewed articles that deal with diagnosis and management and with basic research. Each issue also contains at least one review article about an important clinical problem. Case presentations or short notes about technical innovations are also accepted for publication. The articles cover all aspects of knee surgery and all types of sports trauma; in addition, epidemiology, diagnosis, treatment and prevention, and all types of arthroscopy (not only the knee but also the shoulder, elbow, wrist, hip, ankle, etc.) are addressed. Articles on new diagnostic techniques such as MRI and ultrasound and high-quality articles about the biomechanics of joints, muscles and tendons are included. Although this is largely a clinical journal, it is also open to basic research with clinical relevance. Because the journal is supported by a distinguished European Editorial Board, assisted by an international Advisory Board, you can be assured that the journal maintains the highest standards. Official Clinical Journal of the European Society of Sports Traumatology, Knee Surgery and Arthroscopy (ESSKA).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信