{"title":"预测哮喘患者住院时间延长:模型开发和外部验证。","authors":"Xinkai Ma, Peiqi Li, Yupeng Li, Yanqing Xing, Zhen Ma, Chuangchuan Dong, Liting Feng, Rujie Huo, Fei Hu, Yanting Dong, Jie Chen, Jiali Zhang, Xinrui Tian","doi":"10.1080/02770903.2025.2500081","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aims to develop and validate a machine learning (ML) model to predict prolonged hospitalization in asthma patients.</p><p><strong>Patients and methods: </strong>This retrospective cohort study included patients with asthma as the primary diagnosis. We randomly divided 2820 asthma patients from Beth Israel Deaconess Medical Center into a training set and an internal validation set (in an 8:2 ratio), and used 1714 asthma patients from 208 other hospitals in the United States as an external validation cohort. Prolonged hospitalization was the primary outcome. Feature selection was conducted using LASSO regression, univariate logistic regression, and multivariate logistic regression analyses. Nine ML algorithms were employed to develop predictive models.</p><p><strong>Results: </strong>Based on discrimination, calibration, and clinical utility, the Extreme Gradient Boosting (XGBoost) model demonstrated the best overall performance. The nine most important predictors in the model were age, oxygen saturation (SpO2), red blood cell count, hemoglobin count, comorbid pneumonia, chronic obstructive pulmonary disease (COPD), congestive heart failure, anxiety, and use of invasive mechanical ventilation. The XGBoost model achieved an area under the receiver operating characteristic curve (AUC) of 0.829 and a Cohen's Kappa value of 0.439 in the internal validation set, and an AUC of 0.745 and a Cohen's Kappa value of 0.315 in the external validation set. The decision curve analysis indicated good clinical utility of the model.</p><p><strong>Conclusions: </strong>The XGBoost model can effectively predict prolonged hospitalization in asthma patients.</p>","PeriodicalId":15076,"journal":{"name":"Journal of Asthma","volume":" ","pages":"1616-1626"},"PeriodicalIF":1.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting prolonged hospitalization in asthma patients: model development and external validation.\",\"authors\":\"Xinkai Ma, Peiqi Li, Yupeng Li, Yanqing Xing, Zhen Ma, Chuangchuan Dong, Liting Feng, Rujie Huo, Fei Hu, Yanting Dong, Jie Chen, Jiali Zhang, Xinrui Tian\",\"doi\":\"10.1080/02770903.2025.2500081\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>This study aims to develop and validate a machine learning (ML) model to predict prolonged hospitalization in asthma patients.</p><p><strong>Patients and methods: </strong>This retrospective cohort study included patients with asthma as the primary diagnosis. We randomly divided 2820 asthma patients from Beth Israel Deaconess Medical Center into a training set and an internal validation set (in an 8:2 ratio), and used 1714 asthma patients from 208 other hospitals in the United States as an external validation cohort. Prolonged hospitalization was the primary outcome. Feature selection was conducted using LASSO regression, univariate logistic regression, and multivariate logistic regression analyses. Nine ML algorithms were employed to develop predictive models.</p><p><strong>Results: </strong>Based on discrimination, calibration, and clinical utility, the Extreme Gradient Boosting (XGBoost) model demonstrated the best overall performance. The nine most important predictors in the model were age, oxygen saturation (SpO2), red blood cell count, hemoglobin count, comorbid pneumonia, chronic obstructive pulmonary disease (COPD), congestive heart failure, anxiety, and use of invasive mechanical ventilation. The XGBoost model achieved an area under the receiver operating characteristic curve (AUC) of 0.829 and a Cohen's Kappa value of 0.439 in the internal validation set, and an AUC of 0.745 and a Cohen's Kappa value of 0.315 in the external validation set. The decision curve analysis indicated good clinical utility of the model.</p><p><strong>Conclusions: </strong>The XGBoost model can effectively predict prolonged hospitalization in asthma patients.</p>\",\"PeriodicalId\":15076,\"journal\":{\"name\":\"Journal of Asthma\",\"volume\":\" \",\"pages\":\"1616-1626\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Asthma\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/02770903.2025.2500081\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/5/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"ALLERGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Asthma","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/02770903.2025.2500081","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/10 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ALLERGY","Score":null,"Total":0}
引用次数: 0
摘要
目的:本研究旨在开发和验证机器学习(ML)模型来预测哮喘患者的长期住院。患者和方法:本回顾性队列研究纳入以哮喘为主要诊断的患者。我们将来自Beth Israel Deaconess医疗中心的2820例哮喘患者随机分为训练集和内部验证集(按8:2的比例),并使用来自美国其他208家医院的1714例哮喘患者作为外部验证队列。延长住院时间是主要结局。特征选择采用LASSO回归、单变量逻辑回归和多变量逻辑回归分析。采用9种ML算法建立预测模型。结果:基于鉴别、校准和临床应用,极端梯度增强(XGBoost)模型表现出最佳的整体性能。该模型中9个最重要的预测因素是年龄、血氧饱和度(SpO2)、红细胞计数、血红蛋白计数、合并症肺炎、慢性阻塞性肺疾病(COPD)、充血性心力衰竭、焦虑和使用有创机械通气。XGBoost模型在内部验证集中实现了接收者工作特征曲线下面积(AUC)为0.829,Cohen’s Kappa值为0.439;在外部验证集中实现了AUC为0.745,Cohen’s Kappa值为0.315。决策曲线分析表明该模型具有良好的临床应用价值。结论:XGBoost模型可有效预测哮喘患者住院时间延长。
Predicting prolonged hospitalization in asthma patients: model development and external validation.
Purpose: This study aims to develop and validate a machine learning (ML) model to predict prolonged hospitalization in asthma patients.
Patients and methods: This retrospective cohort study included patients with asthma as the primary diagnosis. We randomly divided 2820 asthma patients from Beth Israel Deaconess Medical Center into a training set and an internal validation set (in an 8:2 ratio), and used 1714 asthma patients from 208 other hospitals in the United States as an external validation cohort. Prolonged hospitalization was the primary outcome. Feature selection was conducted using LASSO regression, univariate logistic regression, and multivariate logistic regression analyses. Nine ML algorithms were employed to develop predictive models.
Results: Based on discrimination, calibration, and clinical utility, the Extreme Gradient Boosting (XGBoost) model demonstrated the best overall performance. The nine most important predictors in the model were age, oxygen saturation (SpO2), red blood cell count, hemoglobin count, comorbid pneumonia, chronic obstructive pulmonary disease (COPD), congestive heart failure, anxiety, and use of invasive mechanical ventilation. The XGBoost model achieved an area under the receiver operating characteristic curve (AUC) of 0.829 and a Cohen's Kappa value of 0.439 in the internal validation set, and an AUC of 0.745 and a Cohen's Kappa value of 0.315 in the external validation set. The decision curve analysis indicated good clinical utility of the model.
Conclusions: The XGBoost model can effectively predict prolonged hospitalization in asthma patients.
期刊介绍:
Providing an authoritative open forum on asthma and related conditions, Journal of Asthma publishes clinical research around such topics as asthma management, critical and long-term care, preventative measures, environmental counselling, and patient education.