Aneysis D Gonzalez-Suarez, Paymon G Rezaii, Daniel Herrick, Seth Stravers Tigchelaar, John K Ratliff, Mirabela Rusu, David Scheinker, Ikchan Jeon, Atman M Desai
{"title":"使用机器学习模型识别颈椎后路融合术后 30 天再入院的相关因素:纵向队列研究","authors":"Aneysis D Gonzalez-Suarez, Paymon G Rezaii, Daniel Herrick, Seth Stravers Tigchelaar, John K Ratliff, Mirabela Rusu, David Scheinker, Ikchan Jeon, Atman M Desai","doi":"10.14245/ns.2347340.670","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Readmission rates after posterior cervical fusion (PCF) significantly impact patients and healthcare, with complication rates at 15%-25% and up to 12% 90-day readmission rates. In this study, we aim to test whether machine learning (ML) models that capture interfactorial interactions outperform traditional logistic regression (LR) in identifying readmission-associated factors.</p><p><strong>Methods: </strong>The Optum Clinformatics Data Mart database was used to identify patients who underwent PCF between 2004-2017. To determine factors associated with 30-day readmissions, 5 ML models were generated and evaluated, including a multivariate LR (MLR) model. Then, the best-performing model, Gradient Boosting Machine (GBM), was compared to the LACE (Length patient stay in the hospital, Acuity of admission of patient in the hospital, Comorbidity, and Emergency visit) index regarding potential cost savings from algorithm implementation.</p><p><strong>Results: </strong>This study included 4,130 patients, 874 of which were readmitted within 30 days. When analyzed and scaled, we found that patient discharge status, comorbidities, and number of procedure codes were factors that influenced MLR, while patient discharge status, billed admission charge, and length of stay influenced the GBM model. The GBM model significantly outperformed MLR in predicting unplanned readmissions (mean area under the receiver operating characteristic curve, 0.846 vs. 0.829; p < 0.001), while also projecting an average cost savings of 50% more than the LACE index.</p><p><strong>Conclusion: </strong>Five models (GBM, XGBoost [extreme gradient boosting], RF [random forest], LASSO [least absolute shrinkage and selection operator], and MLR) were evaluated, among which, the GBM model exhibited superior predictive performance, robustness, and accuracy. Factors associated with readmissions impact LR and GBM models differently, suggesting that these models can be used complementarily. When analyzing PCF procedures, the GBM model resulted in greater predictive performance and was associated with higher theoretical cost savings for readmissions associated with PCF complications.</p>","PeriodicalId":19269,"journal":{"name":"Neurospine","volume":null,"pages":null},"PeriodicalIF":3.8000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11224744/pdf/","citationCount":"0","resultStr":"{\"title\":\"Using Machine Learning Models to Identify Factors Associated With 30-Day Readmissions After Posterior Cervical Fusions: A Longitudinal Cohort Study.\",\"authors\":\"Aneysis D Gonzalez-Suarez, Paymon G Rezaii, Daniel Herrick, Seth Stravers Tigchelaar, John K Ratliff, Mirabela Rusu, David Scheinker, Ikchan Jeon, Atman M Desai\",\"doi\":\"10.14245/ns.2347340.670\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Readmission rates after posterior cervical fusion (PCF) significantly impact patients and healthcare, with complication rates at 15%-25% and up to 12% 90-day readmission rates. In this study, we aim to test whether machine learning (ML) models that capture interfactorial interactions outperform traditional logistic regression (LR) in identifying readmission-associated factors.</p><p><strong>Methods: </strong>The Optum Clinformatics Data Mart database was used to identify patients who underwent PCF between 2004-2017. To determine factors associated with 30-day readmissions, 5 ML models were generated and evaluated, including a multivariate LR (MLR) model. Then, the best-performing model, Gradient Boosting Machine (GBM), was compared to the LACE (Length patient stay in the hospital, Acuity of admission of patient in the hospital, Comorbidity, and Emergency visit) index regarding potential cost savings from algorithm implementation.</p><p><strong>Results: </strong>This study included 4,130 patients, 874 of which were readmitted within 30 days. When analyzed and scaled, we found that patient discharge status, comorbidities, and number of procedure codes were factors that influenced MLR, while patient discharge status, billed admission charge, and length of stay influenced the GBM model. The GBM model significantly outperformed MLR in predicting unplanned readmissions (mean area under the receiver operating characteristic curve, 0.846 vs. 0.829; p < 0.001), while also projecting an average cost savings of 50% more than the LACE index.</p><p><strong>Conclusion: </strong>Five models (GBM, XGBoost [extreme gradient boosting], RF [random forest], LASSO [least absolute shrinkage and selection operator], and MLR) were evaluated, among which, the GBM model exhibited superior predictive performance, robustness, and accuracy. Factors associated with readmissions impact LR and GBM models differently, suggesting that these models can be used complementarily. When analyzing PCF procedures, the GBM model resulted in greater predictive performance and was associated with higher theoretical cost savings for readmissions associated with PCF complications.</p>\",\"PeriodicalId\":19269,\"journal\":{\"name\":\"Neurospine\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11224744/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurospine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.14245/ns.2347340.670\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/5/20 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurospine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.14245/ns.2347340.670","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/20 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
摘要
目的:颈椎后路融合术(PCF)后的再入院率对患者和医疗保健产生了重大影响,并发症发生率为 15%-5%,90 天再入院率高达 12%。在本研究中,我们旨在检验在识别再入院相关因素方面,捕捉因素间相互作用的机器学习(ML)模型是否优于传统的逻辑回归(LR):方法: Optum Clinformatics Data Mart 数据库用于识别 2004-2017 年间接受 PCF 的患者。为确定与 30 天再入院相关的因素,生成并评估了 5 个 ML 模型,包括一个多变量 LR (MLR) 模型。然后,将表现最佳的梯度提升机(GBM)模型与 LACE(患者住院时间、患者入院时的严重程度、合并症和急诊就诊)指数进行比较,以了解实施算法后可能节省的成本:这项研究包括 4,130 名患者,其中 874 人在 30 天内再次入院。经过分析和扩展,我们发现患者出院状态、合并症和手术代码数量是影响 MLR 的因素,而患者出院状态、收费入院费用和住院时间则影响 GBM 模型。在预测非计划再入院方面,GBM 模型的表现明显优于 MLR(接收者操作特征曲线下的平均面积为 0.846 vs. 0.829; p结论:评估了五种模型(GBM、XGBoost[极端梯度提升]、RF[随机森林]、LASSO[最小绝对收缩和选择算子]和 MLR),其中 GBM 模型在预测性能、稳健性和准确性方面都更胜一筹。与再入院相关的因素对 LR 模型和 GBM 模型的影响不同,这表明这些模型可以互补使用。在分析 PCF 程序时,GBM 模型具有更高的预测性能,而且与 PCF 并发症相关的再入院理论成本节约也更高。
Using Machine Learning Models to Identify Factors Associated With 30-Day Readmissions After Posterior Cervical Fusions: A Longitudinal Cohort Study.
Objective: Readmission rates after posterior cervical fusion (PCF) significantly impact patients and healthcare, with complication rates at 15%-25% and up to 12% 90-day readmission rates. In this study, we aim to test whether machine learning (ML) models that capture interfactorial interactions outperform traditional logistic regression (LR) in identifying readmission-associated factors.
Methods: The Optum Clinformatics Data Mart database was used to identify patients who underwent PCF between 2004-2017. To determine factors associated with 30-day readmissions, 5 ML models were generated and evaluated, including a multivariate LR (MLR) model. Then, the best-performing model, Gradient Boosting Machine (GBM), was compared to the LACE (Length patient stay in the hospital, Acuity of admission of patient in the hospital, Comorbidity, and Emergency visit) index regarding potential cost savings from algorithm implementation.
Results: This study included 4,130 patients, 874 of which were readmitted within 30 days. When analyzed and scaled, we found that patient discharge status, comorbidities, and number of procedure codes were factors that influenced MLR, while patient discharge status, billed admission charge, and length of stay influenced the GBM model. The GBM model significantly outperformed MLR in predicting unplanned readmissions (mean area under the receiver operating characteristic curve, 0.846 vs. 0.829; p < 0.001), while also projecting an average cost savings of 50% more than the LACE index.
Conclusion: Five models (GBM, XGBoost [extreme gradient boosting], RF [random forest], LASSO [least absolute shrinkage and selection operator], and MLR) were evaluated, among which, the GBM model exhibited superior predictive performance, robustness, and accuracy. Factors associated with readmissions impact LR and GBM models differently, suggesting that these models can be used complementarily. When analyzing PCF procedures, the GBM model resulted in greater predictive performance and was associated with higher theoretical cost savings for readmissions associated with PCF complications.