Huixiu Hu , Yajie Zhao , Chao Sun , Quanying Wu , Ying Deng , Jie Liu
{"title":"Enhancing readmission prediction model in older stroke patients by integrating insight from readiness for hospital discharge: Prospective cohort study","authors":"Huixiu Hu , Yajie Zhao , Chao Sun , Quanying Wu , Ying Deng , Jie Liu","doi":"10.1016/j.ijmedinf.2025.105845","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The 30-day hospital readmission rate is a key indicator of healthcare quality and system efficiency. This study aimed to develop machine-learning (ML) models to predict unplanned 30-day readmissions in older patients with ischemic stroke (IS) using a prospective cohort design.</div></div><div><h3>Methods</h3><div>Patients were divided into two datasets: dataset I (January 2020–December 2021) for model development and dataset II (January 2022–December 2023) for validation. A diffusion model was applied to address data imbalance. Eleven machine-learning methods, including Random Forest (RF), Logistic Regression, CatBoost, eXtreme Gradient Boosting Light Gradient Boosting Machine, K-Nearest Neighbors Support Vector Machine, Multi-Layer Perceptron, and Gaussian Naive Bayes, and 2 ensemble learning models, were constructed to predict readmissions. Bayesian optimization was used to fine-tune the hyperparameters of these models. Model performance was primarily evaluated using the area under the receiver operating characteristic curve (AUC). Shapley Additive Explanations (SHAP) were utilized to identify and interpret the significance of predictive variables.</div></div><div><h3>Results</h3><div>Dataset I included 489 patients, while dataset II comprised 418 patients, with readmission rates of 15.3 % and 16.0 %, respectively. The RF model achieved the highest predictive performance (AUC = 0.9116, sensitivity = 0.8806, specificity = 0.7806). SHAP analysis identified readiness for hospital discharge as the most significant predictor of readmission.</div></div><div><h3>Conclusion</h3><div>The RF model shows promise for predicting unplanned 30-day readmissions in older patients with IS. Multi-center studies with larger sample sizes are needed to validate these findings.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"197 ","pages":"Article 105845"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505625000620","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
The 30-day hospital readmission rate is a key indicator of healthcare quality and system efficiency. This study aimed to develop machine-learning (ML) models to predict unplanned 30-day readmissions in older patients with ischemic stroke (IS) using a prospective cohort design.
Methods
Patients were divided into two datasets: dataset I (January 2020–December 2021) for model development and dataset II (January 2022–December 2023) for validation. A diffusion model was applied to address data imbalance. Eleven machine-learning methods, including Random Forest (RF), Logistic Regression, CatBoost, eXtreme Gradient Boosting Light Gradient Boosting Machine, K-Nearest Neighbors Support Vector Machine, Multi-Layer Perceptron, and Gaussian Naive Bayes, and 2 ensemble learning models, were constructed to predict readmissions. Bayesian optimization was used to fine-tune the hyperparameters of these models. Model performance was primarily evaluated using the area under the receiver operating characteristic curve (AUC). Shapley Additive Explanations (SHAP) were utilized to identify and interpret the significance of predictive variables.
Results
Dataset I included 489 patients, while dataset II comprised 418 patients, with readmission rates of 15.3 % and 16.0 %, respectively. The RF model achieved the highest predictive performance (AUC = 0.9116, sensitivity = 0.8806, specificity = 0.7806). SHAP analysis identified readiness for hospital discharge as the most significant predictor of readmission.
Conclusion
The RF model shows promise for predicting unplanned 30-day readmissions in older patients with IS. Multi-center studies with larger sample sizes are needed to validate these findings.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.