Machine learning-assisted construction of COPD self-evaluation questionnaire (COPD-EQ): a national multicentre study in China.

IF 4.5 3区医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH

Journal of Global Health Pub Date : 2025-01-03 DOI:10.7189/jogh.15.04052

Yiming Ma, Zijie Zhan, Yahong Chen, Jing Zhang, Wen Li, Zhiyi He, Jungang Xie, Haijin Zhao, Anping Xu, Kun Peng, Gang Wang, Qingping Zeng, Ting Yang, Yan Chen, Chen Wang

{"title":"Machine learning-assisted construction of COPD self-evaluation questionnaire (COPD-EQ): a national multicentre study in China.","authors":"Yiming Ma, Zijie Zhan, Yahong Chen, Jing Zhang, Wen Li, Zhiyi He, Jungang Xie, Haijin Zhao, Anping Xu, Kun Peng, Gang Wang, Qingping Zeng, Ting Yang, Yan Chen, Chen Wang","doi":"10.7189/jogh.15.04052","DOIUrl":null,"url":null,"abstract":"Background: Approximately 70% of chronic obstructive pulmonary disease (COPD) is underdiagnosed worldwide. We aimed to develop and validate a COPD self-evaluation questionnaire (COPD-EQ) that is better suited for COPD screening in China.Methods: We developed a primary version of COPD-EQ based on the Delphi method. Then, we conducted a nationwide multicentre prospective to validate our novel COPD-EQ screening ability. To improve the screening ability of COPD-EQ, we used a series of machine learning (ML)-based methods, including logistic regression, XgBoost, LightGBM, and CatBoost. These models were developed and then evaluated on a random 3:1 train/test split.Results: Through the Delphi approach, we developed the primary version of COPD-EQ with nine items. In the following prospective multicentre study, we recruited 1824 outpatients from 12 sites, of whom 404 (22.1%) were diagnosed with COPD. After the score assignment assisted by ML models and the Shapley Additive Explanation method, six of nine items were retained for a briefer version of COPD-EQ. The scoring-based method achieves an AUC score of 0.734 at a threshold of 4.0. Finally, a novel six-item COPD-EQ questionnaire was developed.Conclusions: The COPD-EQ questionnaire was validated to be reliable and accurate in COPD screening for the Chinese population. The ML model can further improve the questionnaire's screening ability.","PeriodicalId":48734,"journal":{"name":"Journal of Global Health","volume":"15 ","pages":"04052"},"PeriodicalIF":4.5000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11699521/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Global Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.7189/jogh.15.04052","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Approximately 70% of chronic obstructive pulmonary disease (COPD) is underdiagnosed worldwide. We aimed to develop and validate a COPD self-evaluation questionnaire (COPD-EQ) that is better suited for COPD screening in China.

Methods: We developed a primary version of COPD-EQ based on the Delphi method. Then, we conducted a nationwide multicentre prospective to validate our novel COPD-EQ screening ability. To improve the screening ability of COPD-EQ, we used a series of machine learning (ML)-based methods, including logistic regression, XgBoost, LightGBM, and CatBoost. These models were developed and then evaluated on a random 3:1 train/test split.

Results: Through the Delphi approach, we developed the primary version of COPD-EQ with nine items. In the following prospective multicentre study, we recruited 1824 outpatients from 12 sites, of whom 404 (22.1%) were diagnosed with COPD. After the score assignment assisted by ML models and the Shapley Additive Explanation method, six of nine items were retained for a briefer version of COPD-EQ. The scoring-based method achieves an AUC score of 0.734 at a threshold of 4.0. Finally, a novel six-item COPD-EQ questionnaire was developed.

Conclusions: The COPD-EQ questionnaire was validated to be reliable and accurate in COPD screening for the Chinese population. The ML model can further improve the questionnaire's screening ability.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Global Health PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH -

CiteScore

6.10

自引率

2.80%

发文量

240

审稿时长

6 weeks

期刊介绍： Journal of Global Health is a peer-reviewed journal published by the Edinburgh University Global Health Society, a not-for-profit organization registered in the UK. We publish editorials, news, viewpoints, original research and review articles in two issues per year.