Khadija Mahmoud, M Abdulhadi Alagha, Zuzanna Nowinka, Gareth Jones
{"title":"Predicting total knee replacement at 2 and 5 years in osteoarthritis patients using machine learning.","authors":"Khadija Mahmoud, M Abdulhadi Alagha, Zuzanna Nowinka, Gareth Jones","doi":"10.1136/bmjsit-2022-000141","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Knee osteoarthritis is a major cause of physical disability and reduced quality of life, with end-stage disease often treated by total knee replacement (TKR). We set out to develop and externally validate a machine learning model capable of predicting the need for a TKR in 2 and 5 years time using routinely collected health data.</p><p><strong>Design: </strong>A prospective study using datasets Osteoarthritis Initiative (OAI) and the Multicentre Osteoarthritis Study (MOST). OAI data were used to train the models while MOST data formed the external test set. The data were preprocessed using feature selection to curate 45 candidate features including demographics, medical history, imaging assessments, history of intervention and outcome.</p><p><strong>Setting: </strong>The study was conducted using two multicentre USA-based datasets of participants with or at high risk of knee OA.</p><p><strong>Participants: </strong>The study excluded participants with at least one existing TKR. OAI dataset included participants aged 45-79 years of which 3234 were used for training and 809 for internal testing, while MOST involved participants aged 50-79 and 2248 were used for external testing.</p><p><strong>Main outcome measures: </strong>The primary outcome of this study was prediction of TKR onset at 2 and 5 years. Performance was evaluated using area under the curve (AUC) and F1-score and key predictors identified.</p><p><strong>Results: </strong>For the best performing model (gradient boosting machine), the AUC at 2 years was 0.913 (95% CI 0.876 to 0.951), and at 5 years 0.873 (95% CI 0.839 to 0.907). Radiographic-derived features, questionnaire-based assessments alongside the patient's educational attainment were key predictors for these models.</p><p><strong>Conclusions: </strong>Our approach suggests that routinely collected patient data are sufficient to drive a predictive model with a clinically acceptable level of accuracy (AUC>0.7) and is the first such tool to be externally validated. This level of accuracy is higher than previously published models utilising MRI data, which is not routinely collected.</p>","PeriodicalId":33349,"journal":{"name":"BMJ Surgery Interventions Health Technologies","volume":"5 1","pages":"e000141"},"PeriodicalIF":2.1000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/18/4d/bmjsit-2022-000141.PMC9933661.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Surgery Interventions Health Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjsit-2022-000141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Knee osteoarthritis is a major cause of physical disability and reduced quality of life, with end-stage disease often treated by total knee replacement (TKR). We set out to develop and externally validate a machine learning model capable of predicting the need for a TKR in 2 and 5 years time using routinely collected health data.
Design: A prospective study using datasets Osteoarthritis Initiative (OAI) and the Multicentre Osteoarthritis Study (MOST). OAI data were used to train the models while MOST data formed the external test set. The data were preprocessed using feature selection to curate 45 candidate features including demographics, medical history, imaging assessments, history of intervention and outcome.
Setting: The study was conducted using two multicentre USA-based datasets of participants with or at high risk of knee OA.
Participants: The study excluded participants with at least one existing TKR. OAI dataset included participants aged 45-79 years of which 3234 were used for training and 809 for internal testing, while MOST involved participants aged 50-79 and 2248 were used for external testing.
Main outcome measures: The primary outcome of this study was prediction of TKR onset at 2 and 5 years. Performance was evaluated using area under the curve (AUC) and F1-score and key predictors identified.
Results: For the best performing model (gradient boosting machine), the AUC at 2 years was 0.913 (95% CI 0.876 to 0.951), and at 5 years 0.873 (95% CI 0.839 to 0.907). Radiographic-derived features, questionnaire-based assessments alongside the patient's educational attainment were key predictors for these models.
Conclusions: Our approach suggests that routinely collected patient data are sufficient to drive a predictive model with a clinically acceptable level of accuracy (AUC>0.7) and is the first such tool to be externally validated. This level of accuracy is higher than previously published models utilising MRI data, which is not routinely collected.