Comparable performance of machine learning algorithms in predicting readmission and complications following total joint arthroplasty with external validation.

IF 2.3 4区医学 Q2 ORTHOPEDICS

Arthroplasty Pub Date : 2023-11-08 DOI:10.1186/s42836-023-00208-0

Hashim J F Shaikh, Mina Botros, Gabriel Ramirez, Caroline P Thirukumaran, Benjamin Ricciardi, Thomas G Myers

{"title":"Comparable performance of machine learning algorithms in predicting readmission and complications following total joint arthroplasty with external validation.","authors":"Hashim J F Shaikh, Mina Botros, Gabriel Ramirez, Caroline P Thirukumaran, Benjamin Ricciardi, Thomas G Myers","doi":"10.1186/s42836-023-00208-0","DOIUrl":null,"url":null,"abstract":"Background: The purpose of the study was to use Machine Learning (ML) to construct a risk calculator for patients who undergo Total Joint Arthroplasty (TJA) on the basis of New York State Statewide Planning and Research Cooperative System (SPARCS) data and externally validate the calculator on a single TJA center.Methods: Seven ML algorithms, i.e., logistic regression, adaptive boosting, gradient boosting (Xg Boost), random forest (RF) classifier, support vector machine, and single and a five-layered neural network were trained on the derivation cohort. Models were trained on 68% of data, validated on 15%, tested on 15%, and externally validated on 2% of the data from a single arthroplasty center.Results: Validation of the models showed that the RF classifier performed best in terms of 30-d mortality AUROC (Area Under the Receiver Operating Characteristic) 0.78, 30-d readmission (AUROC 0.61) and 90-d composite complications (AUROC 0.73) amongst the test set. Additionally, Xg Boost was found to be the best predicting model for 90-d readmission and 90-d composite complications (AUC 0.73). External validation demonstrated that models achieved similar AUROCs to the test set although variation occurred in top model performance for 90-d composite complications and readmissions between our test and external validation set.Conclusion: This was the first study to investigate the use of ML to create a predictive risk calculator from state-wide data and then externally validate it with data from a single arthroplasty center. Discrimination between best performing ML models and between the test set and the external validation set are comparable.Level of evidence: III.","PeriodicalId":52831,"journal":{"name":"Arthroplasty","volume":"5 1","pages":"58"},"PeriodicalIF":2.3000,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10631030/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arthroplasty","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s42836-023-00208-0","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ORTHOPEDICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The purpose of the study was to use Machine Learning (ML) to construct a risk calculator for patients who undergo Total Joint Arthroplasty (TJA) on the basis of New York State Statewide Planning and Research Cooperative System (SPARCS) data and externally validate the calculator on a single TJA center.

Methods: Seven ML algorithms, i.e., logistic regression, adaptive boosting, gradient boosting (Xg Boost), random forest (RF) classifier, support vector machine, and single and a five-layered neural network were trained on the derivation cohort. Models were trained on 68% of data, validated on 15%, tested on 15%, and externally validated on 2% of the data from a single arthroplasty center.

Results: Validation of the models showed that the RF classifier performed best in terms of 30-d mortality AUROC (Area Under the Receiver Operating Characteristic) 0.78, 30-d readmission (AUROC 0.61) and 90-d composite complications (AUROC 0.73) amongst the test set. Additionally, Xg Boost was found to be the best predicting model for 90-d readmission and 90-d composite complications (AUC 0.73). External validation demonstrated that models achieved similar AUROCs to the test set although variation occurred in top model performance for 90-d composite complications and readmissions between our test and external validation set.

Conclusion: This was the first study to investigate the use of ML to create a predictive risk calculator from state-wide data and then externally validate it with data from a single arthroplasty center. Discrimination between best performing ML models and between the test set and the external validation set are comparable.

Level of evidence: III.

查看原文本刊更多论文

机器学习算法在预测全关节置换术后再次入院和并发症方面的可比较性能与外部验证。

背景：本研究的目的是使用机器学习（ML），在纽约州规划和研究合作系统（SPARCS）数据的基础上，为接受全关节置换术（TJA）的患者构建一个风险计算器，并在单个TJA中心对该计算器进行外部验证。方法：在推导队列上训练七种ML算法，即逻辑回归、自适应增强、梯度增强（Xg-Boost）、随机森林（RF）分类器、支持向量机以及单层和五层神经网络。模型根据68%的数据进行训练，15%进行验证，15%进行测试，2%的数据来自单个关节成形术中心进行外部验证。结果：模型的验证表明，在测试集中，RF分类器在30天死亡率AUROC（受试者操作特征下面积）0.78、30天再次入院（AUROC 0.61）和90天复合并发症（AUROC0.73）方面表现最好。此外，Xg Boost被发现是90-d再入院和90-d复合并发症的最佳预测模型（AUC 0.73）。外部验证表明，尽管我们的测试和外部验证集之间90-d复合复杂性和再入院的顶级模型性能发生了变化，但模型实现了与测试集相似的AUROC。结论：这是第一项研究使用ML根据全州数据创建预测风险计算器，然后用单个关节成形术中心的数据进行外部验证。性能最好的ML模型之间以及测试集和外部验证集之间的区别是可比较的。证据级别：三。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊