A Deep Learning-Based Clinical Classification System for the Differential Diagnosis of Hip Prosthesis Failures Using Radiographs: A Multicenter Study.

Limin Wu,Biao Wang,Bin Lin,Mingyang Li,Yuangang Wu,Haibo Si,Yi Zeng,Liangji Lu,Lulu Gao,Zheting Chen,Risheng Yu,Liang Zhao,Yong Nie,Kang Li,Bin Shen
{"title":"A Deep Learning-Based Clinical Classification System for the Differential Diagnosis of Hip Prosthesis Failures Using Radiographs: A Multicenter Study.","authors":"Limin Wu,Biao Wang,Bin Lin,Mingyang Li,Yuangang Wu,Haibo Si,Yi Zeng,Liangji Lu,Lulu Gao,Zheting Chen,Risheng Yu,Liang Zhao,Yong Nie,Kang Li,Bin Shen","doi":"10.2106/jbjs.24.01601","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nAccurate and timely differential diagnosis of hip prosthesis failures remains a major clinical challenge. Radiographic examination remains the most cost-effective and common first-line imaging modality for hip prostheses, and integrating deep learning has the potential to improve its diagnostic accuracy and efficiency.\r\n\r\nMETHODS\r\nA deep learning-based clinical classification system (Hip-Net) was developed to classify multiple causes of total hip arthroplasty failure, including periprosthetic joint infection (PJI), aseptic loosening, dislocation, periprosthetic fracture, and polyethylene wear. Hip-Net employed a dual-channel ensemble of 4 deep learning models trained on 2,908 routine dual-view (anteroposterior and lateral) radiographs for 1,454 patients (Asian) across 3 medical centers. An interpretive subnetwork generated spatially resolved disease probability maps. Discrimination performance and interpretability were tested in external and prospective cohorts, respectively. The correlation between model-generated individual PJI risk and inflammatory biomarkers was assessed.\r\n\r\nRESULTS\r\nHip-Net demonstrated strong generalizability across different settings, effectively distinguishing between 5 common types of hip prosthesis failures with an accuracy of 0.904 (95% confidence interval [CI], 0.894 to 0.914) and an area under the receiver operating characteristic curve (AUC) of 0.937 (95% CI, 0.925 to 0.948) in the external cohort. The spatially resolved disease-probability maps for PJI closely aligned with intraoperative and pathological findings. The model-generated individual PJI risk scores exhibited a positive correlation with the C-reactive protein (CRP) level and erythrocyte sedimentation rate (ESR).\r\n\r\nCONCLUSIONS\r\nHip-Net provided a clinically applicable strategy for accurately classifying and characterizing multiple etiologies of hip prosthesis failure. Such an approach is highly beneficial for providing interpretable, pathology-aligned probability maps that enhance the understanding of PJI. Its integration into clinical workflows may streamline decision-making and improve patient outcomes.\r\n\r\nLEVEL OF EVIDENCE\r\nPrognostic Level III. See Instructions for Authors for a complete description of levels of evidence.","PeriodicalId":22625,"journal":{"name":"The Journal of Bone & Joint Surgery","volume":"13 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Bone & Joint Surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2106/jbjs.24.01601","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

BACKGROUND Accurate and timely differential diagnosis of hip prosthesis failures remains a major clinical challenge. Radiographic examination remains the most cost-effective and common first-line imaging modality for hip prostheses, and integrating deep learning has the potential to improve its diagnostic accuracy and efficiency. METHODS A deep learning-based clinical classification system (Hip-Net) was developed to classify multiple causes of total hip arthroplasty failure, including periprosthetic joint infection (PJI), aseptic loosening, dislocation, periprosthetic fracture, and polyethylene wear. Hip-Net employed a dual-channel ensemble of 4 deep learning models trained on 2,908 routine dual-view (anteroposterior and lateral) radiographs for 1,454 patients (Asian) across 3 medical centers. An interpretive subnetwork generated spatially resolved disease probability maps. Discrimination performance and interpretability were tested in external and prospective cohorts, respectively. The correlation between model-generated individual PJI risk and inflammatory biomarkers was assessed. RESULTS Hip-Net demonstrated strong generalizability across different settings, effectively distinguishing between 5 common types of hip prosthesis failures with an accuracy of 0.904 (95% confidence interval [CI], 0.894 to 0.914) and an area under the receiver operating characteristic curve (AUC) of 0.937 (95% CI, 0.925 to 0.948) in the external cohort. The spatially resolved disease-probability maps for PJI closely aligned with intraoperative and pathological findings. The model-generated individual PJI risk scores exhibited a positive correlation with the C-reactive protein (CRP) level and erythrocyte sedimentation rate (ESR). CONCLUSIONS Hip-Net provided a clinically applicable strategy for accurately classifying and characterizing multiple etiologies of hip prosthesis failure. Such an approach is highly beneficial for providing interpretable, pathology-aligned probability maps that enhance the understanding of PJI. Its integration into clinical workflows may streamline decision-making and improve patient outcomes. LEVEL OF EVIDENCE Prognostic Level III. See Instructions for Authors for a complete description of levels of evidence.
基于深度学习的临床分类系统用于髋关节假体失效的x线鉴别诊断:一项多中心研究。
背景准确和及时的鉴别诊断髋关节假体失败仍然是一个主要的临床挑战。放射学检查仍然是髋关节假体最具成本效益和最常见的一线成像方式,整合深度学习有可能提高其诊断的准确性和效率。方法采用基于深度学习的临床分类系统(hip - net)对全髋关节置换术失败的多种原因进行分类,包括假体周围关节感染(PJI)、无菌性松动、脱位、假体周围骨折和聚乙烯磨损。Hip-Net采用了由4个深度学习模型组成的双通道集合,对3个医疗中心的1454名患者(亚洲)的2,908张常规双视图(正位和侧位)x线片进行了训练。一个解释子网络生成空间分解的疾病概率图。分别在外部和前瞻性队列中测试了歧视表现和可解释性。评估模型生成的个体PJI风险与炎症生物标志物之间的相关性。结果ship - net在不同情况下表现出很强的泛化性,在外部队列中有效区分5种常见的髋关节假体失败类型,准确率为0.904(95%可信区间[CI], 0.894 ~ 0.914),受者工作特征曲线下面积(AUC)为0.937 (95% CI, 0.925 ~ 0.948)。PJI的空间分辨率疾病概率图与术中和病理结果密切相关。模型生成的个体PJI风险评分与c反应蛋白(CRP)水平和红细胞沉降率(ESR)呈正相关。结论ship - net为准确分类和描述髋关节假体失效的多种病因提供了一种临床适用的策略。这种方法非常有利于提供可解释的、病理一致的概率图,从而增强对PJI的理解。将其整合到临床工作流程中可以简化决策并改善患者的预后。证据水平:预后III级。有关证据水平的完整描述,请参见作者说明。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信