基于深度学习的临床分类系统用于髋关节假体失效的x线鉴别诊断：一项多中心研究。

The Journal of Bone & Joint Surgery Pub Date : 2025-06-18 DOI:10.2106/jbjs.24.01601

Limin Wu,Biao Wang,Bin Lin,Mingyang Li,Yuangang Wu,Haibo Si,Yi Zeng,Liangji Lu,Lulu Gao,Zheting Chen,Risheng Yu,Liang Zhao,Yong Nie,Kang Li,Bin Shen

{"title":"基于深度学习的临床分类系统用于髋关节假体失效的x线鉴别诊断：一项多中心研究。","authors":"Limin Wu,Biao Wang,Bin Lin,Mingyang Li,Yuangang Wu,Haibo Si,Yi Zeng,Liangji Lu,Lulu Gao,Zheting Chen,Risheng Yu,Liang Zhao,Yong Nie,Kang Li,Bin Shen","doi":"10.2106/jbjs.24.01601","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nAccurate and timely differential diagnosis of hip prosthesis failures remains a major clinical challenge. Radiographic examination remains the most cost-effective and common first-line imaging modality for hip prostheses, and integrating deep learning has the potential to improve its diagnostic accuracy and efficiency.\r\n\r\nMETHODS\r\nA deep learning-based clinical classification system (Hip-Net) was developed to classify multiple causes of total hip arthroplasty failure, including periprosthetic joint infection (PJI), aseptic loosening, dislocation, periprosthetic fracture, and polyethylene wear. Hip-Net employed a dual-channel ensemble of 4 deep learning models trained on 2,908 routine dual-view (anteroposterior and lateral) radiographs for 1,454 patients (Asian) across 3 medical centers. An interpretive subnetwork generated spatially resolved disease probability maps. Discrimination performance and interpretability were tested in external and prospective cohorts, respectively. The correlation between model-generated individual PJI risk and inflammatory biomarkers was assessed.\r\n\r\nRESULTS\r\nHip-Net demonstrated strong generalizability across different settings, effectively distinguishing between 5 common types of hip prosthesis failures with an accuracy of 0.904 (95% confidence interval [CI], 0.894 to 0.914) and an area under the receiver operating characteristic curve (AUC) of 0.937 (95% CI, 0.925 to 0.948) in the external cohort. The spatially resolved disease-probability maps for PJI closely aligned with intraoperative and pathological findings. The model-generated individual PJI risk scores exhibited a positive correlation with the C-reactive protein (CRP) level and erythrocyte sedimentation rate (ESR).\r\n\r\nCONCLUSIONS\r\nHip-Net provided a clinically applicable strategy for accurately classifying and characterizing multiple etiologies of hip prosthesis failure. Such an approach is highly beneficial for providing interpretable, pathology-aligned probability maps that enhance the understanding of PJI. Its integration into clinical workflows may streamline decision-making and improve patient outcomes.\r\n\r\nLEVEL OF EVIDENCE\r\nPrognostic Level III. See Instructions for Authors for a complete description of levels of evidence.","PeriodicalId":22625,"journal":{"name":"The Journal of Bone & Joint Surgery","volume":"13 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Deep Learning-Based Clinical Classification System for the Differential Diagnosis of Hip Prosthesis Failures Using Radiographs: A Multicenter Study.\",\"authors\":\"Limin Wu,Biao Wang,Bin Lin,Mingyang Li,Yuangang Wu,Haibo Si,Yi Zeng,Liangji Lu,Lulu Gao,Zheting Chen,Risheng Yu,Liang Zhao,Yong Nie,Kang Li,Bin Shen\",\"doi\":\"10.2106/jbjs.24.01601\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BACKGROUND\\r\\nAccurate and timely differential diagnosis of hip prosthesis failures remains a major clinical challenge. Radiographic examination remains the most cost-effective and common first-line imaging modality for hip prostheses, and integrating deep learning has the potential to improve its diagnostic accuracy and efficiency.\\r\\n\\r\\nMETHODS\\r\\nA deep learning-based clinical classification system (Hip-Net) was developed to classify multiple causes of total hip arthroplasty failure, including periprosthetic joint infection (PJI), aseptic loosening, dislocation, periprosthetic fracture, and polyethylene wear. Hip-Net employed a dual-channel ensemble of 4 deep learning models trained on 2,908 routine dual-view (anteroposterior and lateral) radiographs for 1,454 patients (Asian) across 3 medical centers. An interpretive subnetwork generated spatially resolved disease probability maps. Discrimination performance and interpretability were tested in external and prospective cohorts, respectively. The correlation between model-generated individual PJI risk and inflammatory biomarkers was assessed.\\r\\n\\r\\nRESULTS\\r\\nHip-Net demonstrated strong generalizability across different settings, effectively distinguishing between 5 common types of hip prosthesis failures with an accuracy of 0.904 (95% confidence interval [CI], 0.894 to 0.914) and an area under the receiver operating characteristic curve (AUC) of 0.937 (95% CI, 0.925 to 0.948) in the external cohort. The spatially resolved disease-probability maps for PJI closely aligned with intraoperative and pathological findings. The model-generated individual PJI risk scores exhibited a positive correlation with the C-reactive protein (CRP) level and erythrocyte sedimentation rate (ESR).\\r\\n\\r\\nCONCLUSIONS\\r\\nHip-Net provided a clinically applicable strategy for accurately classifying and characterizing multiple etiologies of hip prosthesis failure. Such an approach is highly beneficial for providing interpretable, pathology-aligned probability maps that enhance the understanding of PJI. Its integration into clinical workflows may streamline decision-making and improve patient outcomes.\\r\\n\\r\\nLEVEL OF EVIDENCE\\r\\nPrognostic Level III. See Instructions for Authors for a complete description of levels of evidence.\",\"PeriodicalId\":22625,\"journal\":{\"name\":\"The Journal of Bone & Joint Surgery\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Bone & Joint Surgery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2106/jbjs.24.01601\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Bone & Joint Surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2106/jbjs.24.01601","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

背景准确和及时的鉴别诊断髋关节假体失败仍然是一个主要的临床挑战。放射学检查仍然是髋关节假体最具成本效益和最常见的一线成像方式，整合深度学习有可能提高其诊断的准确性和效率。方法采用基于深度学习的临床分类系统（hip - net）对全髋关节置换术失败的多种原因进行分类，包括假体周围关节感染（PJI）、无菌性松动、脱位、假体周围骨折和聚乙烯磨损。Hip-Net采用了由4个深度学习模型组成的双通道集合，对3个医疗中心的1454名患者（亚洲）的2,908张常规双视图（正位和侧位）x线片进行了训练。一个解释子网络生成空间分解的疾病概率图。分别在外部和前瞻性队列中测试了歧视表现和可解释性。评估模型生成的个体PJI风险与炎症生物标志物之间的相关性。结果ship - net在不同情况下表现出很强的泛化性，在外部队列中有效区分5种常见的髋关节假体失败类型，准确率为0.904(95%可信区间[CI]， 0.894 ~ 0.914)，受者工作特征曲线下面积（AUC）为0.937 （95% CI, 0.925 ~ 0.948）。PJI的空间分辨率疾病概率图与术中和病理结果密切相关。模型生成的个体PJI风险评分与c反应蛋白（CRP）水平和红细胞沉降率（ESR）呈正相关。结论ship - net为准确分类和描述髋关节假体失效的多种病因提供了一种临床适用的策略。这种方法非常有利于提供可解释的、病理一致的概率图，从而增强对PJI的理解。将其整合到临床工作流程中可以简化决策并改善患者的预后。证据水平：预后III级。有关证据水平的完整描述，请参见作者说明。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Deep Learning-Based Clinical Classification System for the Differential Diagnosis of Hip Prosthesis Failures Using Radiographs: A Multicenter Study.

BACKGROUND Accurate and timely differential diagnosis of hip prosthesis failures remains a major clinical challenge. Radiographic examination remains the most cost-effective and common first-line imaging modality for hip prostheses, and integrating deep learning has the potential to improve its diagnostic accuracy and efficiency. METHODS A deep learning-based clinical classification system (Hip-Net) was developed to classify multiple causes of total hip arthroplasty failure, including periprosthetic joint infection (PJI), aseptic loosening, dislocation, periprosthetic fracture, and polyethylene wear. Hip-Net employed a dual-channel ensemble of 4 deep learning models trained on 2,908 routine dual-view (anteroposterior and lateral) radiographs for 1,454 patients (Asian) across 3 medical centers. An interpretive subnetwork generated spatially resolved disease probability maps. Discrimination performance and interpretability were tested in external and prospective cohorts, respectively. The correlation between model-generated individual PJI risk and inflammatory biomarkers was assessed. RESULTS Hip-Net demonstrated strong generalizability across different settings, effectively distinguishing between 5 common types of hip prosthesis failures with an accuracy of 0.904 (95% confidence interval [CI], 0.894 to 0.914) and an area under the receiver operating characteristic curve (AUC) of 0.937 (95% CI, 0.925 to 0.948) in the external cohort. The spatially resolved disease-probability maps for PJI closely aligned with intraoperative and pathological findings. The model-generated individual PJI risk scores exhibited a positive correlation with the C-reactive protein (CRP) level and erythrocyte sedimentation rate (ESR). CONCLUSIONS Hip-Net provided a clinically applicable strategy for accurately classifying and characterizing multiple etiologies of hip prosthesis failure. Such an approach is highly beneficial for providing interpretable, pathology-aligned probability maps that enhance the understanding of PJI. Its integration into clinical workflows may streamline decision-making and improve patient outcomes. LEVEL OF EVIDENCE Prognostic Level III. See Instructions for Authors for a complete description of levels of evidence.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Journal of Bone & Joint Surgery

自引率

0.00%

发文量