Yu Seong Hwang , Seongheon Kim , Inhyeok Yim , Yukyoung Park , Seonguk Kang , Heui Sug Jo
{"title":"预测缺血性卒中患者再入院的可能性:使用通用数据模型数据的可解释的机器学习方法。","authors":"Yu Seong Hwang , Seongheon Kim , Inhyeok Yim , Yukyoung Park , Seonguk Kang , Heui Sug Jo","doi":"10.1016/j.ijmedinf.2024.105754","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Ischemic stroke affects 15 million people worldwide, causing five million deaths annually. Despite declining mortality rates, stroke incidence and readmission risks remain high, highlighting the need for preventing readmission to improve the quality of life of survivors. This study developed a machine-learning model to predict 90-day stroke readmission using electronic medical records converted to the common data model (CDM) from the Regional Accountable Care Hospital in Gangwon state in South Korea.</div></div><div><h3>Methods</h3><div>We retrospectively analyzed data from 1,136 patients with ischemic stroke admitted between August 2003 and August 2021 after excluding cases with missing blood test values. Demographics, blood test results, treatments, and comorbidities were used as key features. Six machine learning models and three deep learning models were used to predict 90-day readmission using the synthetic minority over-sampling technique to address class imbalance. Models were evaluated using threefold cross-validation, and SHapley Additive exPlanations (SHAP) values were calculated to interpret feature importance.</div></div><div><h3>Results</h3><div>Among 1,136 patients, 196 (17.2 %) were readmitted within 90 days. Male patients were significantly more likely to experience readmission (<em>p</em> = 0.02). LightGBM achieved an area under the curve of 0.94, demonstrating that analyzing stroke and stroke-related conditions provides greater predictive accuracy than predicting stroke alone or all-cause readmissions. SHAP analysis highlighted renal and metabolic variables, including creatinine, blood urea nitrogen, calcium, sodium, and potassium, as key predictors of readmission.</div></div><div><h3>Conclusion</h3><div>Machine-learning models using electronic health record-based CDM data demonstrated strong predictive performance for 90-day stroke readmission. These results support personalized post-discharge management and lay the groundwork for future multicenter studies.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"195 ","pages":"Article 105754"},"PeriodicalIF":3.7000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting the likelihood of readmission in patients with ischemic stroke: An explainable machine learning approach using common data model data\",\"authors\":\"Yu Seong Hwang , Seongheon Kim , Inhyeok Yim , Yukyoung Park , Seonguk Kang , Heui Sug Jo\",\"doi\":\"10.1016/j.ijmedinf.2024.105754\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Ischemic stroke affects 15 million people worldwide, causing five million deaths annually. Despite declining mortality rates, stroke incidence and readmission risks remain high, highlighting the need for preventing readmission to improve the quality of life of survivors. This study developed a machine-learning model to predict 90-day stroke readmission using electronic medical records converted to the common data model (CDM) from the Regional Accountable Care Hospital in Gangwon state in South Korea.</div></div><div><h3>Methods</h3><div>We retrospectively analyzed data from 1,136 patients with ischemic stroke admitted between August 2003 and August 2021 after excluding cases with missing blood test values. Demographics, blood test results, treatments, and comorbidities were used as key features. Six machine learning models and three deep learning models were used to predict 90-day readmission using the synthetic minority over-sampling technique to address class imbalance. Models were evaluated using threefold cross-validation, and SHapley Additive exPlanations (SHAP) values were calculated to interpret feature importance.</div></div><div><h3>Results</h3><div>Among 1,136 patients, 196 (17.2 %) were readmitted within 90 days. Male patients were significantly more likely to experience readmission (<em>p</em> = 0.02). LightGBM achieved an area under the curve of 0.94, demonstrating that analyzing stroke and stroke-related conditions provides greater predictive accuracy than predicting stroke alone or all-cause readmissions. SHAP analysis highlighted renal and metabolic variables, including creatinine, blood urea nitrogen, calcium, sodium, and potassium, as key predictors of readmission.</div></div><div><h3>Conclusion</h3><div>Machine-learning models using electronic health record-based CDM data demonstrated strong predictive performance for 90-day stroke readmission. These results support personalized post-discharge management and lay the groundwork for future multicenter studies.</div></div>\",\"PeriodicalId\":54950,\"journal\":{\"name\":\"International Journal of Medical Informatics\",\"volume\":\"195 \",\"pages\":\"Article 105754\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-12-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386505624004179\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505624004179","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Predicting the likelihood of readmission in patients with ischemic stroke: An explainable machine learning approach using common data model data
Background
Ischemic stroke affects 15 million people worldwide, causing five million deaths annually. Despite declining mortality rates, stroke incidence and readmission risks remain high, highlighting the need for preventing readmission to improve the quality of life of survivors. This study developed a machine-learning model to predict 90-day stroke readmission using electronic medical records converted to the common data model (CDM) from the Regional Accountable Care Hospital in Gangwon state in South Korea.
Methods
We retrospectively analyzed data from 1,136 patients with ischemic stroke admitted between August 2003 and August 2021 after excluding cases with missing blood test values. Demographics, blood test results, treatments, and comorbidities were used as key features. Six machine learning models and three deep learning models were used to predict 90-day readmission using the synthetic minority over-sampling technique to address class imbalance. Models were evaluated using threefold cross-validation, and SHapley Additive exPlanations (SHAP) values were calculated to interpret feature importance.
Results
Among 1,136 patients, 196 (17.2 %) were readmitted within 90 days. Male patients were significantly more likely to experience readmission (p = 0.02). LightGBM achieved an area under the curve of 0.94, demonstrating that analyzing stroke and stroke-related conditions provides greater predictive accuracy than predicting stroke alone or all-cause readmissions. SHAP analysis highlighted renal and metabolic variables, including creatinine, blood urea nitrogen, calcium, sodium, and potassium, as key predictors of readmission.
Conclusion
Machine-learning models using electronic health record-based CDM data demonstrated strong predictive performance for 90-day stroke readmission. These results support personalized post-discharge management and lay the groundwork for future multicenter studies.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.