M.A. van den Berg , F. Boel , M.M.A. van Buuren , N.S. Riedstra , J. Tang , H. Ahedi , N. Arden , S.M.A. Bierma-Zeinstra , C.G. Boer , F.M. Cicuttini , T.F. Cootes , K.M. Crossley , D.T. Felson , W.P. Gielis , J.J. Heerey , G. Jones , S. Kluzek , N.E. Lane , C. Lindner , J.A. Lynch , R. Agricola
{"title":"ADVANCING HIP OSTEOARTHRITIS PREDICTION: INSIGHTS FROM MULTI-MODAL PREDICTIVE MODELING WITH INDIVIDUAL PARTICIPANT DATA OF THE WORLD COACH CONSORTIUM","authors":"M.A. van den Berg , F. Boel , M.M.A. van Buuren , N.S. Riedstra , J. Tang , H. Ahedi , N. Arden , S.M.A. Bierma-Zeinstra , C.G. Boer , F.M. Cicuttini , T.F. Cootes , K.M. Crossley , D.T. Felson , W.P. Gielis , J.J. Heerey , G. Jones , S. Kluzek , N.E. Lane , C. Lindner , J.A. Lynch , R. Agricola","doi":"10.1016/j.ostima.2025.100343","DOIUrl":null,"url":null,"abstract":"<div><h3>INTRODUCTION</h3><div>Radiographic hip osteoarthritis (RHOA) is a multifactorial disease, making early detection of individuals at risk challenging yet essential for timely intervention and evaluation of preventive strategies. Integrating information on multiple different data modalities using individual participant data from diverse cohorts may enhance predictive modeling in the early stages of RHOA. A focus on model interpretability may further enable the identification of clinically relevant patient subgroups and potential intervention targets.</div></div><div><h3>OBJECTIVE</h3><div>Creating a multi-modal prediction model for improving the performance of RHOA incidence prediction models compared to clinical features alone, and investigating the estimated predictor effects and the generalizability of the models to similar populations.</div></div><div><h3>METHODS</h3><div>We pooled individual participant data from nine prospective cohort studies within the Worldwide Collaboration on OsteoArthritis prediCtion for the Hip (World COACH consortium). All studies included standardized anteroposterior pelvic, long-limb, and/or hip radiographs, assessed for RHOA at baseline and after 4–8 years of follow-up. Incident RHOA was defined as the development of RHOA (grade ≥2) in hips without definite RHOA at baseline (grade <2). The original cohort values of clinical predictors including age, birth-assigned sex, body mass index (BMI), smoking status, diabetes, and hip pain were harmonized into one consistent dataset. X-ray-derived predictors describing the hip morphology, the alpha angle and the lateral center edge angle, were automatically and uniformly determined using automated landmark points placed with Bonefinder®. Additionally, the values of 13 shape modes explaining 85% of the variation from a landmark-based statistical shape model were included. This SSM was built on all baseline RHOA grade <2 hips within World COACH. Risk prediction models were built with generalized linear mixed effects models (GLMM) and Random Forest (RF) models while adjusting for correlations within cohorts and individuals. The discriminative performance (AUC) of different model configurations and the linear versus non-linear approaches were compared through stratified 5-fold cross-validation. For each model configuration, predictions were made with and without cohort labels to assess heterogeneity between cohorts.</div></div><div><h3>RESULTS</h3><div>In total, 29,110 hips without definite RHOA at baseline were included of which 5.0% developed RHOA within 4-8 years (mean age 63.7 (8.6) years, 75.5% female, mean BMI 27.5 (4.7) kg/m<sup>2</sup>). When comparing our uni-modal prediction model using only the clinical predictors (Model 1) to those with X-ray information added (Table 1), we observed a higher discriminative performance for the multi-modal models. Overall, including cohort information significantly improved model performance (p < 0.05), and the RF models have a slightly but not significantly better performance than the GLMMs. Comparing the average effects of the significant predictors of the models including all predictors on incident RHOA (Figure 1), showed most differences between the GLMM and RF estimated effects at the maximum and minimum predictor values.</div></div><div><h3>CONCLUSION</h3><div>By leveraging multi-modal data, we could improve our predictions of incident RHOA compared to clinical features alone. Our findings indicate that there would be a benefit for considering non-linear modeling approaches for this task in future work.</div></div>","PeriodicalId":74378,"journal":{"name":"Osteoarthritis imaging","volume":"5 ","pages":"Article 100343"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Osteoarthritis imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772654125000832","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
INTRODUCTION
Radiographic hip osteoarthritis (RHOA) is a multifactorial disease, making early detection of individuals at risk challenging yet essential for timely intervention and evaluation of preventive strategies. Integrating information on multiple different data modalities using individual participant data from diverse cohorts may enhance predictive modeling in the early stages of RHOA. A focus on model interpretability may further enable the identification of clinically relevant patient subgroups and potential intervention targets.
OBJECTIVE
Creating a multi-modal prediction model for improving the performance of RHOA incidence prediction models compared to clinical features alone, and investigating the estimated predictor effects and the generalizability of the models to similar populations.
METHODS
We pooled individual participant data from nine prospective cohort studies within the Worldwide Collaboration on OsteoArthritis prediCtion for the Hip (World COACH consortium). All studies included standardized anteroposterior pelvic, long-limb, and/or hip radiographs, assessed for RHOA at baseline and after 4–8 years of follow-up. Incident RHOA was defined as the development of RHOA (grade ≥2) in hips without definite RHOA at baseline (grade <2). The original cohort values of clinical predictors including age, birth-assigned sex, body mass index (BMI), smoking status, diabetes, and hip pain were harmonized into one consistent dataset. X-ray-derived predictors describing the hip morphology, the alpha angle and the lateral center edge angle, were automatically and uniformly determined using automated landmark points placed with Bonefinder®. Additionally, the values of 13 shape modes explaining 85% of the variation from a landmark-based statistical shape model were included. This SSM was built on all baseline RHOA grade <2 hips within World COACH. Risk prediction models were built with generalized linear mixed effects models (GLMM) and Random Forest (RF) models while adjusting for correlations within cohorts and individuals. The discriminative performance (AUC) of different model configurations and the linear versus non-linear approaches were compared through stratified 5-fold cross-validation. For each model configuration, predictions were made with and without cohort labels to assess heterogeneity between cohorts.
RESULTS
In total, 29,110 hips without definite RHOA at baseline were included of which 5.0% developed RHOA within 4-8 years (mean age 63.7 (8.6) years, 75.5% female, mean BMI 27.5 (4.7) kg/m2). When comparing our uni-modal prediction model using only the clinical predictors (Model 1) to those with X-ray information added (Table 1), we observed a higher discriminative performance for the multi-modal models. Overall, including cohort information significantly improved model performance (p < 0.05), and the RF models have a slightly but not significantly better performance than the GLMMs. Comparing the average effects of the significant predictors of the models including all predictors on incident RHOA (Figure 1), showed most differences between the GLMM and RF estimated effects at the maximum and minimum predictor values.
CONCLUSION
By leveraging multi-modal data, we could improve our predictions of incident RHOA compared to clinical features alone. Our findings indicate that there would be a benefit for considering non-linear modeling approaches for this task in future work.