四肢转移性疾病术前生存预测机器学习算法的发展和内部验证。

Clinical Orthopaedics & Related Research Pub Date : 2020-02-01 DOI:10.1097/CORR.0000000000000997

Q. Thio, A. Karhade, Paul T. Ogink, J. Bramer, M. Ferrone, S. Calderón, K. Raskin, J. Schwab

{"title":"四肢转移性疾病术前生存预测机器学习算法的发展和内部验证。","authors":"Q. Thio, A. Karhade, Paul T. Ogink, J. Bramer, M. Ferrone, S. Calderón, K. Raskin, J. Schwab","doi":"10.1097/CORR.0000000000000997","DOIUrl":null,"url":null,"abstract":"BACKGROUND\nA preoperative estimation of survival is critical for deciding on the operative management of metastatic bone disease of the extremities. Several tools have been developed for this purpose, but there is room for improvement. Machine learning is an increasingly popular and flexible method of prediction model building based on a data set. It raises some skepticism, however, because of the complex structure of these models.\n\n\nQUESTIONS/PURPOSES\nThe purposes of this study were (1) to develop machine learning algorithms for 90-day and 1-year survival in patients who received surgical treatment for a bone metastasis of the extremity, and (2) to use these algorithms to identify those clinical factors (demographic, treatment related, or surgical) that are most closely associated with survival after surgery in these patients.\n\n\nMETHODS\nAll 1090 patients who underwent surgical treatment for a long-bone metastasis at two institutions between 1999 and 2017 were included in this retrospective study. The median age of the patients in the cohort was 63 years (interquartile range [IQR] 54 to 72 years), 56% of patients (610 of 1090) were female, and the median BMI was 27 kg/m (IQR 23 to 30 kg/m). The most affected location was the femur (70%), followed by the humerus (22%). The most common primary tumors were breast (24%) and lung (23%). Intramedullary nailing was the most commonly performed type of surgery (58%), followed by endoprosthetic reconstruction (22%), and plate screw fixation (14%). Missing data were imputed using the missForest methods. Features were selected by random forest algorithms, and five different models were developed on the training set (80% of the data): stochastic gradient boosting, random forest, support vector machine, neural network, and penalized logistic regression. These models were chosen as a result of their classification capability in binary datasets. Model performance was assessed on both the training set and the validation set (20% of the data) by discrimination, calibration, and overall performance.\n\n\nRESULTS\nWe found no differences among the five models for discrimination, with an area under the curve ranging from 0.86 to 0.87. All models were well calibrated, with intercepts ranging from -0.03 to 0.08 and slopes ranging from 1.03 to 1.12. Brier scores ranged from 0.13 to 0.14. The stochastic gradient boosting model was chosen to be deployed as freely available web-based application and explanations on both a global and an individual level were provided. For 90-day survival, the three most important factors associated with poorer survivorship were lower albumin level, higher neutrophil-to-lymphocyte ratio, and rapid growth primary tumor. For 1-year survival, the three most important factors associated with poorer survivorship were lower albumin level, rapid growth primary tumor, and lower hemoglobin level.\n\n\nCONCLUSIONS\nAlthough the final models must be externally validated, the algorithms showed good performance on internal validation. The final models have been incorporated into a freely accessible web application that can be found at https://sorg-apps.shinyapps.io/extremitymetssurvival/. Pending external validation, clinicians may use this tool to predict survival for their individual patients to help in shared treatment decision making.\n\n\nLEVEL OF EVIDENCE\nLevel III, therapeutic study.","PeriodicalId":10465,"journal":{"name":"Clinical Orthopaedics & Related Research","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":"{\"title\":\"Development and Internal Validation of Machine Learning Algorithms for Preoperative Survival Prediction of Extremity Metastatic Disease.\",\"authors\":\"Q. Thio, A. Karhade, Paul T. Ogink, J. Bramer, M. Ferrone, S. Calderón, K. Raskin, J. Schwab\",\"doi\":\"10.1097/CORR.0000000000000997\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BACKGROUND\\nA preoperative estimation of survival is critical for deciding on the operative management of metastatic bone disease of the extremities. Several tools have been developed for this purpose, but there is room for improvement. Machine learning is an increasingly popular and flexible method of prediction model building based on a data set. It raises some skepticism, however, because of the complex structure of these models.\\n\\n\\nQUESTIONS/PURPOSES\\nThe purposes of this study were (1) to develop machine learning algorithms for 90-day and 1-year survival in patients who received surgical treatment for a bone metastasis of the extremity, and (2) to use these algorithms to identify those clinical factors (demographic, treatment related, or surgical) that are most closely associated with survival after surgery in these patients.\\n\\n\\nMETHODS\\nAll 1090 patients who underwent surgical treatment for a long-bone metastasis at two institutions between 1999 and 2017 were included in this retrospective study. The median age of the patients in the cohort was 63 years (interquartile range [IQR] 54 to 72 years), 56% of patients (610 of 1090) were female, and the median BMI was 27 kg/m (IQR 23 to 30 kg/m). The most affected location was the femur (70%), followed by the humerus (22%). The most common primary tumors were breast (24%) and lung (23%). Intramedullary nailing was the most commonly performed type of surgery (58%), followed by endoprosthetic reconstruction (22%), and plate screw fixation (14%). Missing data were imputed using the missForest methods. Features were selected by random forest algorithms, and five different models were developed on the training set (80% of the data): stochastic gradient boosting, random forest, support vector machine, neural network, and penalized logistic regression. These models were chosen as a result of their classification capability in binary datasets. Model performance was assessed on both the training set and the validation set (20% of the data) by discrimination, calibration, and overall performance.\\n\\n\\nRESULTS\\nWe found no differences among the five models for discrimination, with an area under the curve ranging from 0.86 to 0.87. All models were well calibrated, with intercepts ranging from -0.03 to 0.08 and slopes ranging from 1.03 to 1.12. Brier scores ranged from 0.13 to 0.14. The stochastic gradient boosting model was chosen to be deployed as freely available web-based application and explanations on both a global and an individual level were provided. For 90-day survival, the three most important factors associated with poorer survivorship were lower albumin level, higher neutrophil-to-lymphocyte ratio, and rapid growth primary tumor. For 1-year survival, the three most important factors associated with poorer survivorship were lower albumin level, rapid growth primary tumor, and lower hemoglobin level.\\n\\n\\nCONCLUSIONS\\nAlthough the final models must be externally validated, the algorithms showed good performance on internal validation. The final models have been incorporated into a freely accessible web application that can be found at https://sorg-apps.shinyapps.io/extremitymetssurvival/. Pending external validation, clinicians may use this tool to predict survival for their individual patients to help in shared treatment decision making.\\n\\n\\nLEVEL OF EVIDENCE\\nLevel III, therapeutic study.\",\"PeriodicalId\":10465,\"journal\":{\"name\":\"Clinical Orthopaedics & Related Research\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"44\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Orthopaedics & Related Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1097/CORR.0000000000000997\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Orthopaedics & Related Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/CORR.0000000000000997","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 44

摘要

背景:术前生存评估是决定四肢转移性骨病手术治疗的关键。为此目的已经开发了一些工具，但仍有改进的余地。机器学习是一种日益流行和灵活的基于数据集建立预测模型的方法。然而，由于这些模型的复杂结构，它引起了一些怀疑。问题/目的本研究的目的是:(1)开发机器学习算法，用于评估接受手术治疗的肢体骨转移患者90天和1年的生存，(2)使用这些算法识别与这些患者手术后生存最密切相关的临床因素(人口统计学、治疗相关或手术)。方法:本回顾性研究纳入1999年至2017年间在两家医院接受长骨转移手术治疗的1090例患者。队列中患者的中位年龄为63岁(四分位数范围[IQR] 54 ~ 72岁)，56%的患者(1090例中有610例)为女性，中位BMI为27 kg/m (IQR 23 ~ 30 kg/m)。受影响最大的部位是股骨(70%)，其次是肱骨(22%)。最常见的原发肿瘤是乳腺(24%)和肺部(23%)。髓内钉是最常见的手术类型(58%)，其次是假体内重建(22%)和钢板螺钉固定(14%)。使用misforest方法估算缺失数据。通过随机森林算法选择特征，并在训练集(80%的数据)上开发了五种不同的模型:随机梯度增强、随机森林、支持向量机、神经网络和惩罚逻辑回归。选择这些模型是因为它们在二进制数据集中的分类能力。模型性能在训练集和验证集(20%的数据)上通过区分、校准和总体性能进行评估。结果5种模型的鉴别效果无显著差异，曲线下面积在0.86 ~ 0.87之间。所有模型都经过了很好的校准，截距范围为-0.03至0.08，斜率范围为1.03至1.12。Brier评分范围为0.13 ~ 0.14。选择随机梯度增强模型作为免费的网络应用程序进行部署，并提供了全球和个人层面的解释。对于90天的生存期，与较差的生存期相关的三个最重要的因素是较低的白蛋白水平，较高的中性粒细胞与淋巴细胞比率和快速生长的原发肿瘤。对于1年生存率而言，与较差生存率相关的三个最重要因素是较低的白蛋白水平、快速生长的原发肿瘤和较低的血红蛋白水平。结论虽然最终模型需要外部验证，但算法在内部验证中表现出良好的性能。最终的模型已经被整合到一个免费访问的web应用程序中，可以在https://sorg-apps.shinyapps.io/extremitymetssurvival/上找到。等待外部验证，临床医生可以使用该工具来预测个体患者的生存，以帮助共同的治疗决策。证据等级:III级，治疗性研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Development and Internal Validation of Machine Learning Algorithms for Preoperative Survival Prediction of Extremity Metastatic Disease.

BACKGROUND A preoperative estimation of survival is critical for deciding on the operative management of metastatic bone disease of the extremities. Several tools have been developed for this purpose, but there is room for improvement. Machine learning is an increasingly popular and flexible method of prediction model building based on a data set. It raises some skepticism, however, because of the complex structure of these models. QUESTIONS/PURPOSES The purposes of this study were (1) to develop machine learning algorithms for 90-day and 1-year survival in patients who received surgical treatment for a bone metastasis of the extremity, and (2) to use these algorithms to identify those clinical factors (demographic, treatment related, or surgical) that are most closely associated with survival after surgery in these patients. METHODS All 1090 patients who underwent surgical treatment for a long-bone metastasis at two institutions between 1999 and 2017 were included in this retrospective study. The median age of the patients in the cohort was 63 years (interquartile range [IQR] 54 to 72 years), 56% of patients (610 of 1090) were female, and the median BMI was 27 kg/m (IQR 23 to 30 kg/m). The most affected location was the femur (70%), followed by the humerus (22%). The most common primary tumors were breast (24%) and lung (23%). Intramedullary nailing was the most commonly performed type of surgery (58%), followed by endoprosthetic reconstruction (22%), and plate screw fixation (14%). Missing data were imputed using the missForest methods. Features were selected by random forest algorithms, and five different models were developed on the training set (80% of the data): stochastic gradient boosting, random forest, support vector machine, neural network, and penalized logistic regression. These models were chosen as a result of their classification capability in binary datasets. Model performance was assessed on both the training set and the validation set (20% of the data) by discrimination, calibration, and overall performance. RESULTS We found no differences among the five models for discrimination, with an area under the curve ranging from 0.86 to 0.87. All models were well calibrated, with intercepts ranging from -0.03 to 0.08 and slopes ranging from 1.03 to 1.12. Brier scores ranged from 0.13 to 0.14. The stochastic gradient boosting model was chosen to be deployed as freely available web-based application and explanations on both a global and an individual level were provided. For 90-day survival, the three most important factors associated with poorer survivorship were lower albumin level, higher neutrophil-to-lymphocyte ratio, and rapid growth primary tumor. For 1-year survival, the three most important factors associated with poorer survivorship were lower albumin level, rapid growth primary tumor, and lower hemoglobin level. CONCLUSIONS Although the final models must be externally validated, the algorithms showed good performance on internal validation. The final models have been incorporated into a freely accessible web application that can be found at https://sorg-apps.shinyapps.io/extremitymetssurvival/. Pending external validation, clinicians may use this tool to predict survival for their individual patients to help in shared treatment decision making. LEVEL OF EVIDENCE Level III, therapeutic study.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Clinical Orthopaedics & Related Research

自引率

0.00%

发文量