PREDICTING KNEE OSTEOARTHRITIS PROGRESSION USING EXPLAINABLE MACHINE LEARNING AND CLINICAL IMAGING DATA

Osteoarthritis imaging Pub Date : 2025-01-01 DOI:10.1016/j.ostima.2025.100348

R.E. Harari , J. Collins , S.E. Smith , S. Wells , J. Duryea

{"title":"PREDICTING KNEE OSTEOARTHRITIS PROGRESSION USING EXPLAINABLE MACHINE LEARNING AND CLINICAL IMAGING DATA","authors":"R.E. Harari , J. Collins , S.E. Smith , S. Wells , J. Duryea","doi":"10.1016/j.ostima.2025.100348","DOIUrl":null,"url":null,"abstract":"<div><h3>INTRODUCTION</h3><div>Accurate prediction of knee osteoarthritis (KOA) progression remains a clinical challenge due to its heterogeneous nature and discordance between structural and symptomatic outcomes. Integrated imaging and machine learning (ML) approaches may enhance prognostic modeling but often suffer from limited interpretability or reliance on static features.</div></div><div><h3>OBJECTIVE</h3><div>We aim to develop explainable ML models for predicting KOA progression using baseline and longitudinal imaging and clinical features. This study also aims to identify key imaging biomarkers associated with structural and symptomatic progression.</div></div><div><h3>METHODS</h3><div>Data and 3T MRI measurements from 600 participants in the FNIH OA Biomarkers Consortium were analyzed. Participants were grouped into four progression categories based on 48-month joint space narrowing and WOMAC pain: (1) radiographic + pain progressors, (2) radiographic-only, (3) pain-only, and (4) non-progressors. Two binary classification frameworks were defined: (1) radiographic + pain vs. all others (primary), and (2) all radiographic progressors vs. pain-only + non-progressors (secondary). ML models included Random Forest, XGBoost, logistic regression, decision tree, and multilayer perceptron (MLP). The model used demographic information and imaging features from semi-automated segmentation software. We measured the volume of medial compartment femur cartilage (Cart), bone marrow lesion (BML) in the MF, LF, MT, LT, patella, and trochlea, osteophytes (Ost) in the MF, LF, MT, and LT, Hoffa’s synovitis (HS), and effusion/synovitis (ES). Longitudinal delta values were computed over 24 months. Performance was assessed via 10-fold stratified cross-validation (AUC, F1-score). Explainability tools included SHAP, Gini importance, coefficients, and permutation importance.</div></div><div><h3>RESULTS</h3><div>In the cross-sectional setting, the Random Forest classifier achieved the highest discrimination performance, with AUC values of 0.672 for the primary task (radiographic + pain progressors vs. others) and 0.791 for the secondary task (all radiographic progressors vs. others). The MLP model showed similar results in the secondary task (AUC = 0.743). AUC performance metrics for all models are shown in Table 1. Model performance improved notably when incorporating 24-month changes in imaging features. In the longitudinal analysis, Random Forest again performed best in the secondary task (AUC = 0.873), followed by XGBoost and MLP. The strongest predictors in these models were changes in medial femoral cartilage thickness, medial tibial bone marrow lesions, and osteophyte scores. To better understand the basis of model predictions, we applied four feature ranking methods. Among them, the SHAP method produced the most consistent and clinically interpretable results. As an example, shown in Figure 1 which show top 15 important features, SHAP highlighted 24-month reductions in cartilage thickness and increases in bone marrow lesion scores as the most influential variables, especially in the medial compartment.</div></div><div><h3>CONCLUSION</h3><div>Explainable ML models can identify individuals at risk of KOA progression using multimodal data. Longitudinal imaging features enhanced predictive power, and transparent interpretation techniques revealed important markers of joint deterioration.</div></div>","PeriodicalId":74378,"journal":{"name":"Osteoarthritis imaging","volume":"5 ","pages":"Article 100348"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Osteoarthritis imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772654125000881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

INTRODUCTION

Accurate prediction of knee osteoarthritis (KOA) progression remains a clinical challenge due to its heterogeneous nature and discordance between structural and symptomatic outcomes. Integrated imaging and machine learning (ML) approaches may enhance prognostic modeling but often suffer from limited interpretability or reliance on static features.

OBJECTIVE

We aim to develop explainable ML models for predicting KOA progression using baseline and longitudinal imaging and clinical features. This study also aims to identify key imaging biomarkers associated with structural and symptomatic progression.

METHODS

Data and 3T MRI measurements from 600 participants in the FNIH OA Biomarkers Consortium were analyzed. Participants were grouped into four progression categories based on 48-month joint space narrowing and WOMAC pain: (1) radiographic + pain progressors, (2) radiographic-only, (3) pain-only, and (4) non-progressors. Two binary classification frameworks were defined: (1) radiographic + pain vs. all others (primary), and (2) all radiographic progressors vs. pain-only + non-progressors (secondary). ML models included Random Forest, XGBoost, logistic regression, decision tree, and multilayer perceptron (MLP). The model used demographic information and imaging features from semi-automated segmentation software. We measured the volume of medial compartment femur cartilage (Cart), bone marrow lesion (BML) in the MF, LF, MT, LT, patella, and trochlea, osteophytes (Ost) in the MF, LF, MT, and LT, Hoffa’s synovitis (HS), and effusion/synovitis (ES). Longitudinal delta values were computed over 24 months. Performance was assessed via 10-fold stratified cross-validation (AUC, F1-score). Explainability tools included SHAP, Gini importance, coefficients, and permutation importance.

RESULTS

In the cross-sectional setting, the Random Forest classifier achieved the highest discrimination performance, with AUC values of 0.672 for the primary task (radiographic + pain progressors vs. others) and 0.791 for the secondary task (all radiographic progressors vs. others). The MLP model showed similar results in the secondary task (AUC = 0.743). AUC performance metrics for all models are shown in Table 1. Model performance improved notably when incorporating 24-month changes in imaging features. In the longitudinal analysis, Random Forest again performed best in the secondary task (AUC = 0.873), followed by XGBoost and MLP. The strongest predictors in these models were changes in medial femoral cartilage thickness, medial tibial bone marrow lesions, and osteophyte scores. To better understand the basis of model predictions, we applied four feature ranking methods. Among them, the SHAP method produced the most consistent and clinically interpretable results. As an example, shown in Figure 1 which show top 15 important features, SHAP highlighted 24-month reductions in cartilage thickness and increases in bone marrow lesion scores as the most influential variables, especially in the medial compartment.

CONCLUSION

Explainable ML models can identify individuals at risk of KOA progression using multimodal data. Longitudinal imaging features enhanced predictive power, and transparent interpretation techniques revealed important markers of joint deterioration.

查看原文本刊更多论文

使用可解释的机器学习和临床影像数据预测膝关节骨关节炎的进展

准确预测膝骨关节炎（KOA）的进展仍然是一个临床挑战，因为它的异质性和结构和症状结果之间的不一致。集成成像和机器学习（ML）方法可以增强预后建模，但通常存在可解释性有限或依赖静态特征的问题。目的：建立可解释的ML模型，利用基线、纵向成像和临床特征预测KOA的进展。本研究还旨在确定与结构和症状进展相关的关键成像生物标志物。方法分析FNIH OA生物标志物联盟600名参与者的数据和3T MRI测量结果。参与者根据48个月关节间隙狭窄和WOMAC疼痛分为四个进展类别：(1)放射学 + 疼痛进展者，(2)单纯放射学，(3)单纯疼痛，(4)无进展者。定义了两个二元分类框架：(1)放射学 + 疼痛与所有其他（主要），(2)所有放射学进展者与仅疼痛 + 非进展者（次要）。机器学习模型包括随机森林、XGBoost、逻辑回归、决策树和多层感知器（MLP）。该模型使用了半自动分割软件的人口统计信息和图像特征。我们测量了内侧室股骨软骨（Cart）的体积，MF、LF、MT、LT、髌骨和滑车的骨髓病变（BML）， MF、LF、MT和LT的骨赘（Ost）， Hoffa滑膜炎（HS）和积液/滑膜炎（ES）。在24个月内计算纵向delta值。通过10倍分层交叉验证（AUC， f1评分）评估绩效。可解释性工具包括SHAP、基尼重要性、系数和排列重要性。结果在横断面设置中，随机森林分类器取得了最高的识别性能，主要任务（放射学 + 疼痛进展者与其他）的AUC值为0.672，次要任务（所有放射学进展者与其他）的AUC值为0.791。MLP模型在次要任务上也有类似的结果（AUC = 0.743）。表1显示了所有模型的AUC性能指标。当纳入24个月的成像特征变化时，模型性能显著提高。在纵向分析中，Random Forest在次要任务中仍然表现最好（AUC = 0.873），其次是XGBoost和MLP。这些模型中最强的预测因子是股骨内侧软骨厚度、胫骨内侧骨髓病变和骨赘评分的变化。为了更好地理解模型预测的基础，我们应用了四种特征排序方法。其中，SHAP方法的结果最一致，且具有临床可解释性。例如，如图1所示，它显示了前15个重要特征，SHAP突出了24个月软骨厚度的减少和骨髓病变评分的增加，这是最具影响力的变量，特别是在内侧室。结论可解释的ML模型可以使用多模态数据识别有KOA进展风险的个体。纵向成像特征增强了预测能力，透明的解释技术揭示了关节恶化的重要标志。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Osteoarthritis imaging Radiology and Imaging

自引率

0.00%

发文量