{"title":"基于机器学习的老年神经危重症患者28天死亡率预测模型","authors":"Jia Yuan , Jiong Xiong , Jinfeng Yang , Qi Dong , Yin Wang , Yumei Cheng , Xianjun Chen , Ying Liu , Chuan Xiao , Junlin Tao , Shuangzi Lizhang , Yangzi Liujiao , Qimin Chen , Feng Shen","doi":"10.1016/j.cmpb.2025.108589","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The growing population of elderly neurocritically ill patients highlights the need for effective prognosis prediction tools. This study aims to develop and validate machine learning (ML) models for predicting 28-day mortality in intensive care units (ICUs).</div></div><div><h3>Methods</h3><div>Data were extracted from the Medical Information Mart for Intensive Care IV(MIMIC-IV) database, focusing on elderly neurocritical ill patients with ICU stays ≥ 24 h. The cohort was split into 70 % for training and 30 % for internal validation. We analyzed 58 variables, including demographics, vital signs, medications, lab results, comorbidities, and medical scores, using Lasso regression to identify predictors of 28-day mortality. Seven ML algorithms were evaluated, and the best model was validated with data from Guizhou Medical University Affiliated Hospital. A log-rank test was used to assess survival differences in Kaplan-Meier curves. Shapley Additive Explanations (SHAP) were used to interpret the best model, while subgroup analysis identified variations in model performance across different populations.</div></div><div><h3>Results</h3><div>The study included 1,773 elderly neurocritically ill patients, with a 28-day mortality rate of 28.6 %. The Light Gradient Boosting Machine (LightGBM) outperformed other models, achieving an area under the curve (AUC) of 0.896 in internal validation and 0.812 in external validation. Kaplan-Meier analysis showed that higher LightGBM prediction scores correlated with lower survival probabilities. Key predictors identified through SHAP analysis included partial pressure of arterial carbon dioxide (PaCO<sub>2</sub>), Acute physiology and chronic health evaluation II (APACHE II), white blood cell count, age, and lactate. The LightGBM model demonstrated consistent performance across various subgroups.</div></div><div><h3>Conclusions</h3><div>The LightGBM model effectively predicts 28-day mortality risk in elderly neurocritically ill patients, aiding clinicians in management and resource allocation. Its reliable performance across diverse subgroups underscores its clinical utility.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108589"},"PeriodicalIF":4.9000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based 28-day mortality prediction model for elderly neurocritically Ill patients\",\"authors\":\"Jia Yuan , Jiong Xiong , Jinfeng Yang , Qi Dong , Yin Wang , Yumei Cheng , Xianjun Chen , Ying Liu , Chuan Xiao , Junlin Tao , Shuangzi Lizhang , Yangzi Liujiao , Qimin Chen , Feng Shen\",\"doi\":\"10.1016/j.cmpb.2025.108589\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>The growing population of elderly neurocritically ill patients highlights the need for effective prognosis prediction tools. This study aims to develop and validate machine learning (ML) models for predicting 28-day mortality in intensive care units (ICUs).</div></div><div><h3>Methods</h3><div>Data were extracted from the Medical Information Mart for Intensive Care IV(MIMIC-IV) database, focusing on elderly neurocritical ill patients with ICU stays ≥ 24 h. The cohort was split into 70 % for training and 30 % for internal validation. We analyzed 58 variables, including demographics, vital signs, medications, lab results, comorbidities, and medical scores, using Lasso regression to identify predictors of 28-day mortality. Seven ML algorithms were evaluated, and the best model was validated with data from Guizhou Medical University Affiliated Hospital. A log-rank test was used to assess survival differences in Kaplan-Meier curves. Shapley Additive Explanations (SHAP) were used to interpret the best model, while subgroup analysis identified variations in model performance across different populations.</div></div><div><h3>Results</h3><div>The study included 1,773 elderly neurocritically ill patients, with a 28-day mortality rate of 28.6 %. The Light Gradient Boosting Machine (LightGBM) outperformed other models, achieving an area under the curve (AUC) of 0.896 in internal validation and 0.812 in external validation. Kaplan-Meier analysis showed that higher LightGBM prediction scores correlated with lower survival probabilities. Key predictors identified through SHAP analysis included partial pressure of arterial carbon dioxide (PaCO<sub>2</sub>), Acute physiology and chronic health evaluation II (APACHE II), white blood cell count, age, and lactate. The LightGBM model demonstrated consistent performance across various subgroups.</div></div><div><h3>Conclusions</h3><div>The LightGBM model effectively predicts 28-day mortality risk in elderly neurocritically ill patients, aiding clinicians in management and resource allocation. Its reliable performance across diverse subgroups underscores its clinical utility.</div></div>\",\"PeriodicalId\":10624,\"journal\":{\"name\":\"Computer methods and programs in biomedicine\",\"volume\":\"260 \",\"pages\":\"Article 108589\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169260725000069\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725000069","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Machine learning-based 28-day mortality prediction model for elderly neurocritically Ill patients
Background
The growing population of elderly neurocritically ill patients highlights the need for effective prognosis prediction tools. This study aims to develop and validate machine learning (ML) models for predicting 28-day mortality in intensive care units (ICUs).
Methods
Data were extracted from the Medical Information Mart for Intensive Care IV(MIMIC-IV) database, focusing on elderly neurocritical ill patients with ICU stays ≥ 24 h. The cohort was split into 70 % for training and 30 % for internal validation. We analyzed 58 variables, including demographics, vital signs, medications, lab results, comorbidities, and medical scores, using Lasso regression to identify predictors of 28-day mortality. Seven ML algorithms were evaluated, and the best model was validated with data from Guizhou Medical University Affiliated Hospital. A log-rank test was used to assess survival differences in Kaplan-Meier curves. Shapley Additive Explanations (SHAP) were used to interpret the best model, while subgroup analysis identified variations in model performance across different populations.
Results
The study included 1,773 elderly neurocritically ill patients, with a 28-day mortality rate of 28.6 %. The Light Gradient Boosting Machine (LightGBM) outperformed other models, achieving an area under the curve (AUC) of 0.896 in internal validation and 0.812 in external validation. Kaplan-Meier analysis showed that higher LightGBM prediction scores correlated with lower survival probabilities. Key predictors identified through SHAP analysis included partial pressure of arterial carbon dioxide (PaCO2), Acute physiology and chronic health evaluation II (APACHE II), white blood cell count, age, and lactate. The LightGBM model demonstrated consistent performance across various subgroups.
Conclusions
The LightGBM model effectively predicts 28-day mortality risk in elderly neurocritically ill patients, aiding clinicians in management and resource allocation. Its reliable performance across diverse subgroups underscores its clinical utility.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.