{"title":"基于常规临床实验室参数的慢性阻塞性肺疾病急性加重的机器学习诊断模型","authors":"Youpeng Chen,Yabang Chen,Junquan Sun,Yifei Xie,Jiancai Lu,Enzhong Li,Qingqing Yang,Yu Guo,Jiana Zhang,Haojie Wu,Zhangkai J Cheng,Baoqing Sun","doi":"10.1111/nyas.70080","DOIUrl":null,"url":null,"abstract":"Acute exacerbation of chronic obstructive pulmonary disease (COPD), or AECOPD, significantly increases disease burden yet lacks objective diagnostic criteria. We aimed to develop a machine learning model for AECOPD diagnosis using routine laboratory parameters. We analyzed records from 25,965 COPD patients at the First Affiliated Hospital of Guangzhou Medical University, with patients randomized 7:3 into training and test cohorts. We evaluated 113 model combinations from 12 machine learning algorithms, assessing performance through receiver operating characteristic analysis, calibration curves, and decision curve analysis. The generalized linear model boosting + random forest (glmBoost + RF) model demonstrated excellent diagnostic performance (training area under the curve [AUC] = 0.993, test AUC = 0.834) utilizing only nine variables: age, lymphocyte percentage, calcium, hemoglobin, eosinophil percentage, potassium, platelet distribution width, monocytes count, and mean corpuscular hemoglobin concentration. This streamlined model showed performance comparable to the more complex Lasso + RF model (48 variables) with superior clinical applicability. Both models exhibited excellent calibration performance (mean absolute error = 0.012-0.013) and maintained consistent performance across gender-stratified populations. A machine learning model utilizing nine routine clinical laboratory parameters effectively distinguishes AECOPD from stable COPD, providing an objective diagnostic tool applicable across diverse healthcare settings, particularly in resource-limited facilities.","PeriodicalId":8250,"journal":{"name":"Annals of the New York Academy of Sciences","volume":"58 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning-Based Diagnostic Model for Acute Exacerbation of Chronic Obstructive Pulmonary Disease Using Routine Clinical Laboratory Parameters.\",\"authors\":\"Youpeng Chen,Yabang Chen,Junquan Sun,Yifei Xie,Jiancai Lu,Enzhong Li,Qingqing Yang,Yu Guo,Jiana Zhang,Haojie Wu,Zhangkai J Cheng,Baoqing Sun\",\"doi\":\"10.1111/nyas.70080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Acute exacerbation of chronic obstructive pulmonary disease (COPD), or AECOPD, significantly increases disease burden yet lacks objective diagnostic criteria. We aimed to develop a machine learning model for AECOPD diagnosis using routine laboratory parameters. We analyzed records from 25,965 COPD patients at the First Affiliated Hospital of Guangzhou Medical University, with patients randomized 7:3 into training and test cohorts. We evaluated 113 model combinations from 12 machine learning algorithms, assessing performance through receiver operating characteristic analysis, calibration curves, and decision curve analysis. The generalized linear model boosting + random forest (glmBoost + RF) model demonstrated excellent diagnostic performance (training area under the curve [AUC] = 0.993, test AUC = 0.834) utilizing only nine variables: age, lymphocyte percentage, calcium, hemoglobin, eosinophil percentage, potassium, platelet distribution width, monocytes count, and mean corpuscular hemoglobin concentration. This streamlined model showed performance comparable to the more complex Lasso + RF model (48 variables) with superior clinical applicability. Both models exhibited excellent calibration performance (mean absolute error = 0.012-0.013) and maintained consistent performance across gender-stratified populations. A machine learning model utilizing nine routine clinical laboratory parameters effectively distinguishes AECOPD from stable COPD, providing an objective diagnostic tool applicable across diverse healthcare settings, particularly in resource-limited facilities.\",\"PeriodicalId\":8250,\"journal\":{\"name\":\"Annals of the New York Academy of Sciences\",\"volume\":\"58 1\",\"pages\":\"\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of the New York Academy of Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1111/nyas.70080\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of the New York Academy of Sciences","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1111/nyas.70080","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Machine Learning-Based Diagnostic Model for Acute Exacerbation of Chronic Obstructive Pulmonary Disease Using Routine Clinical Laboratory Parameters.
Acute exacerbation of chronic obstructive pulmonary disease (COPD), or AECOPD, significantly increases disease burden yet lacks objective diagnostic criteria. We aimed to develop a machine learning model for AECOPD diagnosis using routine laboratory parameters. We analyzed records from 25,965 COPD patients at the First Affiliated Hospital of Guangzhou Medical University, with patients randomized 7:3 into training and test cohorts. We evaluated 113 model combinations from 12 machine learning algorithms, assessing performance through receiver operating characteristic analysis, calibration curves, and decision curve analysis. The generalized linear model boosting + random forest (glmBoost + RF) model demonstrated excellent diagnostic performance (training area under the curve [AUC] = 0.993, test AUC = 0.834) utilizing only nine variables: age, lymphocyte percentage, calcium, hemoglobin, eosinophil percentage, potassium, platelet distribution width, monocytes count, and mean corpuscular hemoglobin concentration. This streamlined model showed performance comparable to the more complex Lasso + RF model (48 variables) with superior clinical applicability. Both models exhibited excellent calibration performance (mean absolute error = 0.012-0.013) and maintained consistent performance across gender-stratified populations. A machine learning model utilizing nine routine clinical laboratory parameters effectively distinguishes AECOPD from stable COPD, providing an objective diagnostic tool applicable across diverse healthcare settings, particularly in resource-limited facilities.
期刊介绍:
Published on behalf of the New York Academy of Sciences, Annals of the New York Academy of Sciences provides multidisciplinary perspectives on research of current scientific interest with far-reaching implications for the wider scientific community and society at large. Each special issue assembles the best thinking of key contributors to a field of investigation at a time when emerging developments offer the promise of new insight. Individually themed, Annals special issues stimulate new ways to think about science by providing a neutral forum for discourse—within and across many institutions and fields.