Machine learning-based predictive tools and nomogram for in-hospital mortality in critically ill cancer patients: development and external validation using retrospective cohorts.
{"title":"Machine learning-based predictive tools and nomogram for in-hospital mortality in critically ill cancer patients: development and external validation using retrospective cohorts.","authors":"Kaier Gu, Saisai Lu","doi":"10.1186/s12911-025-03054-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The incidence of intensive care unit (ICU) admissions and the corresponding mortality rates among cancer patients are both high. However, the existing scoring systems all lack specificity. This research seeks to establish and validate a prediction model for early forecasting of in-hospital mortality in critically ill cancer patients.</p><p><strong>Methods: </strong>A retrospective analysis was conducted utilizing data from cancer patients obtained from the eICU and MIMIC-IV databases. The least absolute shrinkage and selection operator method was employed to screen predictive factors and construct six machine learning (ML) models. These models were mainly compared in terms of their predictive performance through area under the curve (AUC) and underwent external validation. Nomograms were developed using multivariate logistic regression (LR) analysis findings. The Shapley Additive exPlanations (SHAP) method was employed to explain the variables within the ML models.</p><p><strong>Results: </strong>Twelve predictive factors were chosen to develop the ML models. Among these models, the LR model and the eXtreme gradient boosting (XGB) model demonstrated the optimal efficacy. In the external validation cohort, their AUC values reached 0.751 [95% confidence interval (CI): 0.735 - 0.768] and 0.737 (95% CI: 0.720 - 0.754), respectively. Moreover, nomograms and SHAP were employed to explain the variables. Additionally, a user-friendly web-based calculator tool was created.</p><p><strong>Conclusions: </strong>The LR and XGB models were successfully developed to predict in-hospital mortality in critically ill cancer patients. Their robust predictive ability was demonstrated in the external validation cohorts. This model can assist physicians in clinical decision-making and timely intervention.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"251"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12231625/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03054-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The incidence of intensive care unit (ICU) admissions and the corresponding mortality rates among cancer patients are both high. However, the existing scoring systems all lack specificity. This research seeks to establish and validate a prediction model for early forecasting of in-hospital mortality in critically ill cancer patients.
Methods: A retrospective analysis was conducted utilizing data from cancer patients obtained from the eICU and MIMIC-IV databases. The least absolute shrinkage and selection operator method was employed to screen predictive factors and construct six machine learning (ML) models. These models were mainly compared in terms of their predictive performance through area under the curve (AUC) and underwent external validation. Nomograms were developed using multivariate logistic regression (LR) analysis findings. The Shapley Additive exPlanations (SHAP) method was employed to explain the variables within the ML models.
Results: Twelve predictive factors were chosen to develop the ML models. Among these models, the LR model and the eXtreme gradient boosting (XGB) model demonstrated the optimal efficacy. In the external validation cohort, their AUC values reached 0.751 [95% confidence interval (CI): 0.735 - 0.768] and 0.737 (95% CI: 0.720 - 0.754), respectively. Moreover, nomograms and SHAP were employed to explain the variables. Additionally, a user-friendly web-based calculator tool was created.
Conclusions: The LR and XGB models were successfully developed to predict in-hospital mortality in critically ill cancer patients. Their robust predictive ability was demonstrated in the external validation cohorts. This model can assist physicians in clinical decision-making and timely intervention.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.