Yiming Ma, Zijie Zhan, Yahong Chen, Jing Zhang, Wen Li, Zhiyi He, Jungang Xie, Haijin Zhao, Anping Xu, Kun Peng, Gang Wang, Qingping Zeng, Ting Yang, Yan Chen, Chen Wang
{"title":"Machine learning-assisted construction of COPD self-evaluation questionnaire (COPD-EQ): a national multicentre study in China.","authors":"Yiming Ma, Zijie Zhan, Yahong Chen, Jing Zhang, Wen Li, Zhiyi He, Jungang Xie, Haijin Zhao, Anping Xu, Kun Peng, Gang Wang, Qingping Zeng, Ting Yang, Yan Chen, Chen Wang","doi":"10.7189/jogh.15.04052","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Approximately 70% of chronic obstructive pulmonary disease (COPD) is underdiagnosed worldwide. We aimed to develop and validate a COPD self-evaluation questionnaire (COPD-EQ) that is better suited for COPD screening in China.</p><p><strong>Methods: </strong>We developed a primary version of COPD-EQ based on the Delphi method. Then, we conducted a nationwide multicentre prospective to validate our novel COPD-EQ screening ability. To improve the screening ability of COPD-EQ, we used a series of machine learning (ML)-based methods, including logistic regression, XgBoost, LightGBM, and CatBoost. These models were developed and then evaluated on a random 3:1 train/test split.</p><p><strong>Results: </strong>Through the Delphi approach, we developed the primary version of COPD-EQ with nine items. In the following prospective multicentre study, we recruited 1824 outpatients from 12 sites, of whom 404 (22.1%) were diagnosed with COPD. After the score assignment assisted by ML models and the Shapley Additive Explanation method, six of nine items were retained for a briefer version of COPD-EQ. The scoring-based method achieves an AUC score of 0.734 at a threshold of 4.0. Finally, a novel six-item COPD-EQ questionnaire was developed.</p><p><strong>Conclusions: </strong>The COPD-EQ questionnaire was validated to be reliable and accurate in COPD screening for the Chinese population. The ML model can further improve the questionnaire's screening ability.</p>","PeriodicalId":48734,"journal":{"name":"Journal of Global Health","volume":"15 ","pages":"04052"},"PeriodicalIF":4.5000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11699521/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Global Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.7189/jogh.15.04052","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Approximately 70% of chronic obstructive pulmonary disease (COPD) is underdiagnosed worldwide. We aimed to develop and validate a COPD self-evaluation questionnaire (COPD-EQ) that is better suited for COPD screening in China.
Methods: We developed a primary version of COPD-EQ based on the Delphi method. Then, we conducted a nationwide multicentre prospective to validate our novel COPD-EQ screening ability. To improve the screening ability of COPD-EQ, we used a series of machine learning (ML)-based methods, including logistic regression, XgBoost, LightGBM, and CatBoost. These models were developed and then evaluated on a random 3:1 train/test split.
Results: Through the Delphi approach, we developed the primary version of COPD-EQ with nine items. In the following prospective multicentre study, we recruited 1824 outpatients from 12 sites, of whom 404 (22.1%) were diagnosed with COPD. After the score assignment assisted by ML models and the Shapley Additive Explanation method, six of nine items were retained for a briefer version of COPD-EQ. The scoring-based method achieves an AUC score of 0.734 at a threshold of 4.0. Finally, a novel six-item COPD-EQ questionnaire was developed.
Conclusions: The COPD-EQ questionnaire was validated to be reliable and accurate in COPD screening for the Chinese population. The ML model can further improve the questionnaire's screening ability.
期刊介绍:
Journal of Global Health is a peer-reviewed journal published by the Edinburgh University Global Health Society, a not-for-profit organization registered in the UK. We publish editorials, news, viewpoints, original research and review articles in two issues per year.