Taghi Khaniyev, Efecan Cekic, Muhammet Abdullah Koc, Ilke Dogan, Sahin Hanalioglu
{"title":"评估机器学习模型在预测神经外科开颅患者重症监护病房出院中的作用:大数据分析。","authors":"Taghi Khaniyev, Efecan Cekic, Muhammet Abdullah Koc, Ilke Dogan, Sahin Hanalioglu","doi":"10.1007/s12028-025-02246-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Predicting intensive care unit (ICU) discharge for neurosurgical patients is crucial for optimizing bed sources, reducing costs, and improving outcomes. Our study aims to develop and validate machine learning (ML) models to predict ICU discharge within 24 h for patients undergoing craniotomy.</p><p><strong>Methods: </strong>The 2,742 patients undergoing craniotomy were identified from Medical Information Mart for Intensive Care dataset using diagnosis-related group and International Classification of Diseases codes. Demographic, clinical, laboratory, and radiological data were collected and preprocessed. Textual clinical examinations were converted into numerical scales. Data were split into training (70%), validation (15%), and test (15%) sets. Four ML models, logistic regression (LR), decision tree, random forest, and neural network (NN), were trained and evaluated. Model performance was assessed using area under the receiver operating characteristic curve (AUC), average precision (AP), accuracy, and F1 scores. Shapley Additive Explanations (SHAP) were used to analyze importance of features. Statistical analyses were performed using R (version 4.2.1) and ML analyses with Python (version 3.8), using scikit-learn, tensorflow, and shap packages.</p><p><strong>Results: </strong>Cohort included 2,742 patients (mean age 58.2 years; first and third quartiles 47-70 years), with 53.4% being male (n = 1,464). Total ICU stay was 15,645 bed days (mean length of stay 4.7 days), and total hospital stay was 32,008 bed days (mean length of stay 10.8 days). Random forest demonstrated highest performance (AUC 0.831, AP 0.561, accuracy 0.827, F1-score 0.339) on test set. NN achieved an AUC of 0.824, with an AP, accuracy, and F1-score of 0.558, 0.830, and 0.383, respectively. LR achieved an AUC of 0.821 and an accuracy of 0.829. The decision tree model showed lowest performance (AUC 0.813, accuracy 0.822). Key predictors of SHAP analysis included Glasgow Coma Scale, respiratory-related parameters (i.e., tidal volume, respiratory effort), intracranial pressure, arterial pH, and Richmond Agitation-Sedation Scale.</p><p><strong>Conclusions: </strong>Random forest and NN predict ICU discharge well, whereas LR is interpretable but less accurate. Numeric conversion of clinical data improved performance. This study offers framework for predictions using clinical, radiological, and demographic features, with SHAP enhancing transparency.</p>","PeriodicalId":19118,"journal":{"name":"Neurocritical Care","volume":" ","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating the Machine Learning Models in Predicting Intensive Care Unit Discharge for Neurosurgical Patients Undergoing Craniotomy: A Big Data Analysis.\",\"authors\":\"Taghi Khaniyev, Efecan Cekic, Muhammet Abdullah Koc, Ilke Dogan, Sahin Hanalioglu\",\"doi\":\"10.1007/s12028-025-02246-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Predicting intensive care unit (ICU) discharge for neurosurgical patients is crucial for optimizing bed sources, reducing costs, and improving outcomes. Our study aims to develop and validate machine learning (ML) models to predict ICU discharge within 24 h for patients undergoing craniotomy.</p><p><strong>Methods: </strong>The 2,742 patients undergoing craniotomy were identified from Medical Information Mart for Intensive Care dataset using diagnosis-related group and International Classification of Diseases codes. Demographic, clinical, laboratory, and radiological data were collected and preprocessed. Textual clinical examinations were converted into numerical scales. Data were split into training (70%), validation (15%), and test (15%) sets. Four ML models, logistic regression (LR), decision tree, random forest, and neural network (NN), were trained and evaluated. Model performance was assessed using area under the receiver operating characteristic curve (AUC), average precision (AP), accuracy, and F1 scores. Shapley Additive Explanations (SHAP) were used to analyze importance of features. Statistical analyses were performed using R (version 4.2.1) and ML analyses with Python (version 3.8), using scikit-learn, tensorflow, and shap packages.</p><p><strong>Results: </strong>Cohort included 2,742 patients (mean age 58.2 years; first and third quartiles 47-70 years), with 53.4% being male (n = 1,464). Total ICU stay was 15,645 bed days (mean length of stay 4.7 days), and total hospital stay was 32,008 bed days (mean length of stay 10.8 days). Random forest demonstrated highest performance (AUC 0.831, AP 0.561, accuracy 0.827, F1-score 0.339) on test set. NN achieved an AUC of 0.824, with an AP, accuracy, and F1-score of 0.558, 0.830, and 0.383, respectively. LR achieved an AUC of 0.821 and an accuracy of 0.829. The decision tree model showed lowest performance (AUC 0.813, accuracy 0.822). Key predictors of SHAP analysis included Glasgow Coma Scale, respiratory-related parameters (i.e., tidal volume, respiratory effort), intracranial pressure, arterial pH, and Richmond Agitation-Sedation Scale.</p><p><strong>Conclusions: </strong>Random forest and NN predict ICU discharge well, whereas LR is interpretable but less accurate. Numeric conversion of clinical data improved performance. This study offers framework for predictions using clinical, radiological, and demographic features, with SHAP enhancing transparency.</p>\",\"PeriodicalId\":19118,\"journal\":{\"name\":\"Neurocritical Care\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocritical Care\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s12028-025-02246-9\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocritical Care","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12028-025-02246-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景:预测神经外科患者的重症监护病房(ICU)出院对于优化床位来源、降低成本和改善预后至关重要。我们的研究旨在开发和验证机器学习(ML)模型,以预测开颅患者24小时内ICU出院情况。方法:使用诊断相关组和国际疾病分类代码从重症监护医学信息集市数据集中识别2742例开颅患者。收集人口统计学、临床、实验室和放射学数据并进行预处理。临床文本检查转换为数值量表。数据被分成训练集(70%)、验证集(15%)和测试集(15%)。四种机器学习模型,逻辑回归(LR),决策树,随机森林和神经网络(NN),进行了训练和评估。使用受试者工作特征曲线下面积(AUC)、平均精度(AP)、准确度和F1分数来评估模型的性能。采用Shapley加性解释(SHAP)分析特征的重要性。使用R(版本4.2.1)进行统计分析,使用Python(版本3.8)进行ML分析,使用scikit-learn, tensorflow和shape包。结果:队列纳入2742例患者(平均年龄58.2岁;第一和第三四分位数(47-70岁),其中53.4%为男性(n = 1464)。ICU总住院15645个床位日(平均住院时间4.7天),住院总32008个床位日(平均住院时间10.8天)。随机森林在测试集上表现出最高的性能(AUC 0.831, AP 0.561,准确率0.827,F1-score 0.339)。NN的AUC为0.824,AP、准确率和f1得分分别为0.558、0.830和0.383。LR的AUC为0.821,准确度为0.829。决策树模型表现最差(AUC 0.813,准确率0.822)。SHAP分析的关键预测指标包括格拉斯哥昏迷量表、呼吸相关参数(即潮气量、呼吸力)、颅内压、动脉pH和Richmond躁动-镇静量表。结论:随机森林和神经网络预测ICU出院较好,而LR可解释,但准确性较低。临床数据的数值转换提高了性能。这项研究为临床、放射学和人口统计学特征的预测提供了框架,并提高了SHAP的透明度。
Evaluating the Machine Learning Models in Predicting Intensive Care Unit Discharge for Neurosurgical Patients Undergoing Craniotomy: A Big Data Analysis.
Background: Predicting intensive care unit (ICU) discharge for neurosurgical patients is crucial for optimizing bed sources, reducing costs, and improving outcomes. Our study aims to develop and validate machine learning (ML) models to predict ICU discharge within 24 h for patients undergoing craniotomy.
Methods: The 2,742 patients undergoing craniotomy were identified from Medical Information Mart for Intensive Care dataset using diagnosis-related group and International Classification of Diseases codes. Demographic, clinical, laboratory, and radiological data were collected and preprocessed. Textual clinical examinations were converted into numerical scales. Data were split into training (70%), validation (15%), and test (15%) sets. Four ML models, logistic regression (LR), decision tree, random forest, and neural network (NN), were trained and evaluated. Model performance was assessed using area under the receiver operating characteristic curve (AUC), average precision (AP), accuracy, and F1 scores. Shapley Additive Explanations (SHAP) were used to analyze importance of features. Statistical analyses were performed using R (version 4.2.1) and ML analyses with Python (version 3.8), using scikit-learn, tensorflow, and shap packages.
Results: Cohort included 2,742 patients (mean age 58.2 years; first and third quartiles 47-70 years), with 53.4% being male (n = 1,464). Total ICU stay was 15,645 bed days (mean length of stay 4.7 days), and total hospital stay was 32,008 bed days (mean length of stay 10.8 days). Random forest demonstrated highest performance (AUC 0.831, AP 0.561, accuracy 0.827, F1-score 0.339) on test set. NN achieved an AUC of 0.824, with an AP, accuracy, and F1-score of 0.558, 0.830, and 0.383, respectively. LR achieved an AUC of 0.821 and an accuracy of 0.829. The decision tree model showed lowest performance (AUC 0.813, accuracy 0.822). Key predictors of SHAP analysis included Glasgow Coma Scale, respiratory-related parameters (i.e., tidal volume, respiratory effort), intracranial pressure, arterial pH, and Richmond Agitation-Sedation Scale.
Conclusions: Random forest and NN predict ICU discharge well, whereas LR is interpretable but less accurate. Numeric conversion of clinical data improved performance. This study offers framework for predictions using clinical, radiological, and demographic features, with SHAP enhancing transparency.
期刊介绍:
Neurocritical Care is a peer reviewed scientific publication whose major goal is to disseminate new knowledge on all aspects of acute neurological care. It is directed towards neurosurgeons, neuro-intensivists, neurologists, anesthesiologists, emergency physicians, and critical care nurses treating patients with urgent neurologic disorders. These are conditions that may potentially evolve rapidly and could need immediate medical or surgical intervention. Neurocritical Care provides a comprehensive overview of current developments in intensive care neurology, neurosurgery and neuroanesthesia and includes information about new therapeutic avenues and technological innovations. Neurocritical Care is the official journal of the Neurocritical Care Society.