Peijie Zhang, Shuo Yuan, Shuzhan Zhang, Zhiheng Yuan, Zi Ye, Lanxin Lv, Hongning Yang, Hui Peng, Haiquan Li, Ningjun Zhao
{"title":"在ARDS全球新定义的背景下:一种可解释的机器学习方法预测败血症合并ARDS患者28天ICU死亡率。","authors":"Peijie Zhang, Shuo Yuan, Shuzhan Zhang, Zhiheng Yuan, Zi Ye, Lanxin Lv, Hongning Yang, Hui Peng, Haiquan Li, Ningjun Zhao","doi":"10.3389/fphys.2025.1617196","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Acute respiratory distress syndrome (ARDS) is a prevalent clinical complication among patients with sepsis, characterized by high incidence and mortality rates. The definition of ARDS has evolved over time, with the new global definition introducing significant updates to its diagnosis and treatment. Our objective is to develop and validate an interpretable prediction model for the prognosis of sepsis patients complicated by ARDS, utilizing machine learning techniques in accordance with the new global definition.</p><p><strong>Methods: </strong>This study extracted data from the MIMIC database (version MIMIC-IV 2.2) to create the training set for our model. For external validation, this study used data from sepsis patients complicated by ARDS who met the new global definition of ARDS, sourced from the Affiliated Hospital of Xuzhou Medical University. Lasso regression with cross-validation was used to identify key predictors of patient prognosis. Subsequently, this study established models to predict the 28-day prognosis following ICU admission using various machine learning algorithms, including logistic regression, random forest, decision tree, support vector machine classifier, LightGBM, XGBoost, AdaBoost, and multi-layer perceptron (MLP). Model performance was assessed using ROC curves, clinical decision curves (DCA), and calibration curves, while SHAP values were utilized to interpret the machine learning models.</p><p><strong>Results: </strong>A total of 905 patients with sepsis complicated by ARDS were included in our analysis, leading to the selection of 15 key variables for model development. Based on the AUC of the ROC curve, as well as DCA and calibration curve results from the training set, the support vector classifier (SVC) model demonstrated strong performance, achieving an average AUC of 0.792 in the internal validation set and 0.816 in the external validation set.</p><p><strong>Conclusion: </strong>The application of machine learning methodologies to construct prognostic prediction models for sepsis patients complicated by ARDS, informed by the new global definition, proves to be reliable. This approach can assist clinicians in developing personalized treatment strategies for affected patients.</p>","PeriodicalId":12477,"journal":{"name":"Frontiers in Physiology","volume":"16 ","pages":"1617196"},"PeriodicalIF":3.2000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491239/pdf/","citationCount":"0","resultStr":"{\"title\":\"Under the background of the new global definition of ARDS: an interpretable machine learning approach for predicting 28-day ICU mortality in patients with sepsis complicated by ARDS.\",\"authors\":\"Peijie Zhang, Shuo Yuan, Shuzhan Zhang, Zhiheng Yuan, Zi Ye, Lanxin Lv, Hongning Yang, Hui Peng, Haiquan Li, Ningjun Zhao\",\"doi\":\"10.3389/fphys.2025.1617196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Acute respiratory distress syndrome (ARDS) is a prevalent clinical complication among patients with sepsis, characterized by high incidence and mortality rates. The definition of ARDS has evolved over time, with the new global definition introducing significant updates to its diagnosis and treatment. Our objective is to develop and validate an interpretable prediction model for the prognosis of sepsis patients complicated by ARDS, utilizing machine learning techniques in accordance with the new global definition.</p><p><strong>Methods: </strong>This study extracted data from the MIMIC database (version MIMIC-IV 2.2) to create the training set for our model. For external validation, this study used data from sepsis patients complicated by ARDS who met the new global definition of ARDS, sourced from the Affiliated Hospital of Xuzhou Medical University. Lasso regression with cross-validation was used to identify key predictors of patient prognosis. Subsequently, this study established models to predict the 28-day prognosis following ICU admission using various machine learning algorithms, including logistic regression, random forest, decision tree, support vector machine classifier, LightGBM, XGBoost, AdaBoost, and multi-layer perceptron (MLP). Model performance was assessed using ROC curves, clinical decision curves (DCA), and calibration curves, while SHAP values were utilized to interpret the machine learning models.</p><p><strong>Results: </strong>A total of 905 patients with sepsis complicated by ARDS were included in our analysis, leading to the selection of 15 key variables for model development. Based on the AUC of the ROC curve, as well as DCA and calibration curve results from the training set, the support vector classifier (SVC) model demonstrated strong performance, achieving an average AUC of 0.792 in the internal validation set and 0.816 in the external validation set.</p><p><strong>Conclusion: </strong>The application of machine learning methodologies to construct prognostic prediction models for sepsis patients complicated by ARDS, informed by the new global definition, proves to be reliable. This approach can assist clinicians in developing personalized treatment strategies for affected patients.</p>\",\"PeriodicalId\":12477,\"journal\":{\"name\":\"Frontiers in Physiology\",\"volume\":\"16 \",\"pages\":\"1617196\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491239/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Physiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3389/fphys.2025.1617196\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"PHYSIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Physiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fphys.2025.1617196","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"PHYSIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景:急性呼吸窘迫综合征(Acute respiratory distress syndrome, ARDS)是脓毒症患者常见的临床并发症,具有高发病率和高死亡率的特点。随着时间的推移,ARDS的定义发生了变化,新的全球定义对其诊断和治疗进行了重大更新。我们的目标是根据新的全球定义,利用机器学习技术,开发和验证一种可解释的脓毒症合并ARDS患者预后预测模型。方法:本研究从MIMIC数据库(版本MIMIC- iv 2.2)中提取数据,为我们的模型创建训练集。为了进行外部验证,本研究使用了来自徐州医科大学附属医院的脓毒症合并ARDS患者的数据,这些患者符合ARDS的新全球定义。交叉验证的套索回归用于确定患者预后的关键预测因素。随后,本研究利用各种机器学习算法,包括逻辑回归、随机森林、决策树、支持向量机分类器、LightGBM、XGBoost、AdaBoost和多层感知器(MLP),建立了预测ICU入院后28天预后的模型。使用ROC曲线、临床决策曲线(DCA)和校准曲线评估模型的性能,而使用SHAP值来解释机器学习模型。结果:我们共纳入905例脓毒症合并ARDS患者,选择15个关键变量进行模型开发。基于ROC曲线的AUC以及训练集的DCA和校准曲线结果,支持向量分类器(SVC)模型表现出较强的性能,在内部验证集中实现了0.792的平均AUC,在外部验证集中实现了0.816的平均AUC。结论:根据新的全球定义,应用机器学习方法构建脓毒症合并ARDS患者的预后预测模型是可靠的。这种方法可以帮助临床医生为受影响的患者制定个性化的治疗策略。
Under the background of the new global definition of ARDS: an interpretable machine learning approach for predicting 28-day ICU mortality in patients with sepsis complicated by ARDS.
Background: Acute respiratory distress syndrome (ARDS) is a prevalent clinical complication among patients with sepsis, characterized by high incidence and mortality rates. The definition of ARDS has evolved over time, with the new global definition introducing significant updates to its diagnosis and treatment. Our objective is to develop and validate an interpretable prediction model for the prognosis of sepsis patients complicated by ARDS, utilizing machine learning techniques in accordance with the new global definition.
Methods: This study extracted data from the MIMIC database (version MIMIC-IV 2.2) to create the training set for our model. For external validation, this study used data from sepsis patients complicated by ARDS who met the new global definition of ARDS, sourced from the Affiliated Hospital of Xuzhou Medical University. Lasso regression with cross-validation was used to identify key predictors of patient prognosis. Subsequently, this study established models to predict the 28-day prognosis following ICU admission using various machine learning algorithms, including logistic regression, random forest, decision tree, support vector machine classifier, LightGBM, XGBoost, AdaBoost, and multi-layer perceptron (MLP). Model performance was assessed using ROC curves, clinical decision curves (DCA), and calibration curves, while SHAP values were utilized to interpret the machine learning models.
Results: A total of 905 patients with sepsis complicated by ARDS were included in our analysis, leading to the selection of 15 key variables for model development. Based on the AUC of the ROC curve, as well as DCA and calibration curve results from the training set, the support vector classifier (SVC) model demonstrated strong performance, achieving an average AUC of 0.792 in the internal validation set and 0.816 in the external validation set.
Conclusion: The application of machine learning methodologies to construct prognostic prediction models for sepsis patients complicated by ARDS, informed by the new global definition, proves to be reliable. This approach can assist clinicians in developing personalized treatment strategies for affected patients.
期刊介绍:
Frontiers in Physiology is a leading journal in its field, publishing rigorously peer-reviewed research on the physiology of living systems, from the subcellular and molecular domains to the intact organism, and its interaction with the environment. Field Chief Editor George E. Billman at the Ohio State University Columbus is supported by an outstanding Editorial Board of international researchers. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.