Predicting total knee replacement at 2 and 5 years in osteoarthritis patients using machine learning.

IF 1.6 Q2 SURGERY

BMJ Surgery Interventions Health Technologies Pub Date : 2023-01-01 DOI:10.1136/bmjsit-2022-000141

Khadija Mahmoud, M Abdulhadi Alagha, Zuzanna Nowinka, Gareth Jones

{"title":"Predicting total knee replacement at 2 and 5 years in osteoarthritis patients using machine learning.","authors":"Khadija Mahmoud, M Abdulhadi Alagha, Zuzanna Nowinka, Gareth Jones","doi":"10.1136/bmjsit-2022-000141","DOIUrl":null,"url":null,"abstract":"Objectives: Knee osteoarthritis is a major cause of physical disability and reduced quality of life, with end-stage disease often treated by total knee replacement (TKR). We set out to develop and externally validate a machine learning model capable of predicting the need for a TKR in 2 and 5 years time using routinely collected health data.Design: A prospective study using datasets Osteoarthritis Initiative (OAI) and the Multicentre Osteoarthritis Study (MOST). OAI data were used to train the models while MOST data formed the external test set. The data were preprocessed using feature selection to curate 45 candidate features including demographics, medical history, imaging assessments, history of intervention and outcome.Setting: The study was conducted using two multicentre USA-based datasets of participants with or at high risk of knee OA.Participants: The study excluded participants with at least one existing TKR. OAI dataset included participants aged 45-79 years of which 3234 were used for training and 809 for internal testing, while MOST involved participants aged 50-79 and 2248 were used for external testing.Main outcome measures: The primary outcome of this study was prediction of TKR onset at 2 and 5 years. Performance was evaluated using area under the curve (AUC) and F1-score and key predictors identified.Results: For the best performing model (gradient boosting machine), the AUC at 2 years was 0.913 (95% CI 0.876 to 0.951), and at 5 years 0.873 (95% CI 0.839 to 0.907). Radiographic-derived features, questionnaire-based assessments alongside the patient's educational attainment were key predictors for these models.Conclusions: Our approach suggests that routinely collected patient data are sufficient to drive a predictive model with a clinically acceptable level of accuracy (AUC>0.7) and is the first such tool to be externally validated. This level of accuracy is higher than previously published models utilising MRI data, which is not routinely collected.","PeriodicalId":33349,"journal":{"name":"BMJ Surgery Interventions Health Technologies","volume":"5 1","pages":"e000141"},"PeriodicalIF":1.6000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/18/4d/bmjsit-2022-000141.PMC9933661.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Surgery Interventions Health Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjsit-2022-000141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SURGERY","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: Knee osteoarthritis is a major cause of physical disability and reduced quality of life, with end-stage disease often treated by total knee replacement (TKR). We set out to develop and externally validate a machine learning model capable of predicting the need for a TKR in 2 and 5 years time using routinely collected health data.

Design: A prospective study using datasets Osteoarthritis Initiative (OAI) and the Multicentre Osteoarthritis Study (MOST). OAI data were used to train the models while MOST data formed the external test set. The data were preprocessed using feature selection to curate 45 candidate features including demographics, medical history, imaging assessments, history of intervention and outcome.

Setting: The study was conducted using two multicentre USA-based datasets of participants with or at high risk of knee OA.

Participants: The study excluded participants with at least one existing TKR. OAI dataset included participants aged 45-79 years of which 3234 were used for training and 809 for internal testing, while MOST involved participants aged 50-79 and 2248 were used for external testing.

Main outcome measures: The primary outcome of this study was prediction of TKR onset at 2 and 5 years. Performance was evaluated using area under the curve (AUC) and F1-score and key predictors identified.

Results: For the best performing model (gradient boosting machine), the AUC at 2 years was 0.913 (95% CI 0.876 to 0.951), and at 5 years 0.873 (95% CI 0.839 to 0.907). Radiographic-derived features, questionnaire-based assessments alongside the patient's educational attainment were key predictors for these models.

Conclusions: Our approach suggests that routinely collected patient data are sufficient to drive a predictive model with a clinically acceptable level of accuracy (AUC>0.7) and is the first such tool to be externally validated. This level of accuracy is higher than previously published models utilising MRI data, which is not routinely collected.

Abstract Image

查看原文本刊更多论文

使用机器学习预测骨关节炎患者2年和5年的全膝关节置换术。

目的:膝关节骨性关节炎是导致身体残疾和生活质量下降的主要原因，终末期疾病通常通过全膝关节置换术(TKR)治疗。我们着手开发并外部验证一种机器学习模型，该模型能够使用常规收集的健康数据预测2至5年内对TKR的需求。设计:一项使用骨关节炎倡议(OAI)和多中心骨关节炎研究(MOST)数据集的前瞻性研究。OAI数据用于训练模型，MOST数据构成外部测试集。使用特征选择对数据进行预处理，筛选出45个候选特征，包括人口统计学、病史、影像学评估、干预史和结果。背景:本研究采用美国的两个多中心数据集进行，参与者均为膝关节OA的高危人群。参与者:该研究排除了至少有一个现有TKR的参与者。OAI数据集包括45-79岁的参与者，其中3234人用于培训，809人用于内部测试，而大多数参与者年龄为50-79岁，2248人用于外部测试。主要结局指标:本研究的主要结局是预测2年和5年TKR发病情况。使用曲线下面积(AUC)和f1评分以及确定的关键预测因子来评估性能。结果:对于表现最好的模型(梯度增强机)，2年的AUC为0.913 (95% CI 0.876 ~ 0.951)， 5年的AUC为0.873 (95% CI 0.839 ~ 0.907)。放射学衍生的特征、基于问卷的评估以及患者的教育程度是这些模型的关键预测因素。结论:我们的方法表明，常规收集的患者数据足以驱动具有临床可接受精度水平(AUC>0.7)的预测模型，并且是第一个外部验证的此类工具。这一精度水平高于以前发表的利用MRI数据的模型，而MRI数据不是常规收集的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊