{"title":"Construction and validation of a readmission risk prediction model for elderly patients with coronary heart disease.","authors":"Hanyu Luo, Benlong Wang, Rui Cao, Jun Feng","doi":"10.3389/fcvm.2024.1497916","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>To investigate the risk factors for readmission of elderly patients with coronary artery disease, and to construct and validate a predictive model for readmission risk of elderly patients with coronary artery disease within 3 years by applying machine learning method.</p><p><strong>Methods: </strong>We selected 575 elderly patients with CHD admitted to the Affiliated Lu'an Hospital of Anhui Medical University from January 2020 to January 2023. Based on whether patients were readmitted within 3 years, they were divided into two groups: those readmitted within 3 years (215 patients) and those not readmitted within 3 years (360 patients). Lasso regression and multivariate logistic regression were used to compare the predictive value of these models. XGBoost, LR, RF, KNN and DT algorithms were used to build prediction models for readmission risk. ROC curves and calibration plots were used to evaluate the prediction performance of the model. For external validation, 143 patients who were admitted between February and June 2023 from a different associated hospital in Lu'an City were also used.</p><p><strong>Results: </strong>The XGBoost model demonstrated the most accurate prediction performance out of the five machine learning techniques. Diabetes, Red blood cell distribution width (RDW), and Triglyceride glucose-body mass index (TyG-BMI), as determined by Lasso regression and multivariate logistic regression. Calibration plot analysis demonstrated that the XGBoost model maintained strong calibration performance across both training and testing datasets, with calibration curves closely aligning with the ideal curve. This alignment signifies a high level of concordance between predicted probabilities and observed event rates. Additionally, decision curve analysis highlighted that both decision trees and XGBoost models achieved higher net benefits within the majority of threshold ranges, emphasizing their significant potential in clinical decision-making processes. The XGBoost model's area under the ROC curve (AUC) reached 0.903, while the external validation dataset yielded an AUC of 0.891, further validating the model's predictive accuracy and its ability to generalize across different datasets.</p><p><strong>Conclusion: </strong>TyG-BMI, RDW, and diabetes mellitus at the time of admission are the factors affecting readmission of elderly patients with coronary artery disease, and the model constructed based on the XGBoost algorithm for readmission risk prediction has good predictive efficacy, which can provide guidance for identifying high-risk patients and timely intervention strategies.</p>","PeriodicalId":12414,"journal":{"name":"Frontiers in Cardiovascular Medicine","volume":"11 ","pages":"1497916"},"PeriodicalIF":2.8000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11689274/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cardiovascular Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fcvm.2024.1497916","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: To investigate the risk factors for readmission of elderly patients with coronary artery disease, and to construct and validate a predictive model for readmission risk of elderly patients with coronary artery disease within 3 years by applying machine learning method.
Methods: We selected 575 elderly patients with CHD admitted to the Affiliated Lu'an Hospital of Anhui Medical University from January 2020 to January 2023. Based on whether patients were readmitted within 3 years, they were divided into two groups: those readmitted within 3 years (215 patients) and those not readmitted within 3 years (360 patients). Lasso regression and multivariate logistic regression were used to compare the predictive value of these models. XGBoost, LR, RF, KNN and DT algorithms were used to build prediction models for readmission risk. ROC curves and calibration plots were used to evaluate the prediction performance of the model. For external validation, 143 patients who were admitted between February and June 2023 from a different associated hospital in Lu'an City were also used.
Results: The XGBoost model demonstrated the most accurate prediction performance out of the five machine learning techniques. Diabetes, Red blood cell distribution width (RDW), and Triglyceride glucose-body mass index (TyG-BMI), as determined by Lasso regression and multivariate logistic regression. Calibration plot analysis demonstrated that the XGBoost model maintained strong calibration performance across both training and testing datasets, with calibration curves closely aligning with the ideal curve. This alignment signifies a high level of concordance between predicted probabilities and observed event rates. Additionally, decision curve analysis highlighted that both decision trees and XGBoost models achieved higher net benefits within the majority of threshold ranges, emphasizing their significant potential in clinical decision-making processes. The XGBoost model's area under the ROC curve (AUC) reached 0.903, while the external validation dataset yielded an AUC of 0.891, further validating the model's predictive accuracy and its ability to generalize across different datasets.
Conclusion: TyG-BMI, RDW, and diabetes mellitus at the time of admission are the factors affecting readmission of elderly patients with coronary artery disease, and the model constructed based on the XGBoost algorithm for readmission risk prediction has good predictive efficacy, which can provide guidance for identifying high-risk patients and timely intervention strategies.
背景:探讨老年冠心病患者再入院的危险因素,应用机器学习方法构建并验证老年冠心病患者3年内再入院风险预测模型。方法:选择2020年1月至2023年1月安徽医科大学附属六安医院住院的老年冠心病患者575例。根据患者3年内是否再入院分为3年内再入院组(215例)和3年内未再入院组(360例)。采用Lasso回归和多元逻辑回归对模型的预测值进行比较。采用XGBoost、LR、RF、KNN和DT算法建立再入院风险预测模型。采用ROC曲线和校正图评价模型的预测性能。为了进行外部验证,还使用了2023年2月至6月期间从六安市另一家附属医院入院的143例患者。结果:XGBoost模型在五种机器学习技术中表现出最准确的预测性能。糖尿病、红细胞分布宽度(RDW)和甘油三酯葡萄糖-体重指数(TyG-BMI),由Lasso回归和多变量logistic回归确定。校准图分析表明,XGBoost模型在训练和测试数据集上都保持了较强的校准性能,校准曲线与理想曲线非常接近。这种对齐表示预测概率和观测到的事件率之间高度一致。此外,决策曲线分析强调,决策树和XGBoost模型在大多数阈值范围内都获得了更高的净效益,强调了它们在临床决策过程中的巨大潜力。XGBoost模型的ROC曲线下面积(area under ROC curve, AUC)达到0.903,而外部验证数据集的AUC为0.891,进一步验证了模型的预测精度及其跨不同数据集的泛化能力。结论:入院时TyG-BMI、RDW、糖尿病是影响老年冠心病患者再入院的因素,基于XGBoost算法构建的再入院风险预测模型具有较好的预测效果,可为识别高危患者和及时采取干预策略提供指导。
期刊介绍:
Frontiers? Which frontiers? Where exactly are the frontiers of cardiovascular medicine? And who should be defining these frontiers?
At Frontiers in Cardiovascular Medicine we believe it is worth being curious to foresee and explore beyond the current frontiers. In other words, we would like, through the articles published by our community journal Frontiers in Cardiovascular Medicine, to anticipate the future of cardiovascular medicine, and thus better prevent cardiovascular disorders and improve therapeutic options and outcomes of our patients.