Development and validation of a mortality risk prediction model for chronic obstructive pulmonary disease: a cross-sectional study using probabilistic graphical modelling.

IF 9.6 1区 医学 Q1 MEDICINE, GENERAL & INTERNAL
EClinicalMedicine Pub Date : 2024-08-22 eCollection Date: 2024-09-01 DOI:10.1016/j.eclinm.2024.102786
Tyler C Lovelace, Min Hyung Ryu, Minxue Jia, Peter Castaldi, Frank C Sciurba, Craig P Hersh, Panayiotis V Benos
{"title":"Development and validation of a mortality risk prediction model for chronic obstructive pulmonary disease: a cross-sectional study using probabilistic graphical modelling.","authors":"Tyler C Lovelace, Min Hyung Ryu, Minxue Jia, Peter Castaldi, Frank C Sciurba, Craig P Hersh, Panayiotis V Benos","doi":"10.1016/j.eclinm.2024.102786","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Chronic Obstructive Pulmonary Disease (COPD) is a leading cause of mortality. Predicting mortality risk in patients with COPD can be important for disease management strategies. Although all-cause mortality predictors have been developed previously, limited research exists on factors directly affecting COPD-specific mortality.</p><p><strong>Methods: </strong>In a retrospective study, we used probabilistic graphs to analyse clinical cross-sectional data (COPDGene cohort), including demographics, spirometry, quantitative chest imaging, and symptom features, as well as gene expression data. COPDGene recruited current and former smokers, aged 45-80 years with >10 pack-years smoking history, from across the USA (Phase 1, 11/2007-4/2011) and invited them for a follow-up visit (Phase 2, 7/2013-7/2017). ECLIPSE cohort recruited current and former smokers (COPD patients and controls from USA and Europe), aged 45-80 with smoking history >10 pack-years (12/2005-11/2007). We applied graphical models on multi-modal data COPDGene Phase 1 participants to identify factors directly affecting all-cause and COPD-specific mortality (primary outcomes); and on Phase 2 follow-up cohort to identify additional molecular and social factors affecting mortality. We used penalized Cox regression with features selected by the causal graph to build VAPORED, a mortality risk prediction model. VAPORED was compared to existing scores (BODE: BMI, airflow obstruction, dyspnoea, exercise capacity; ADO: age, dyspnoea, airflow obstruction) on the ability to rank individuals by mortality risk, using four evaluation metrics (concordance, concordance probability estimate (CPE), cumulative/dynamic (C/D) area under the receiver operating characteristic curve (AUC), and integrated C/D AUC). The results were validated in ECLIPSE.</p><p><strong>Findings: </strong>Graphical models, applied on the COPDGene Phase 1 samples (n = 8610), identified 11 and 7 variables directly linked to all-cause and COPD-specific mortality, respectively. Although many appear in both models, non-lung comorbidities appear only in the all-cause model, while forced vital capacity (FVC %predicted) appears in COPD-specific mortality model only. Additionally, the graph model of Phase 2 data (n = 3182) identified internet access, CD4 T cells and platelets to be linked to lower mortality risk. Furthermore, using the 7 variables linked to COPD-specific mortality (forced expiratory volume in 1 s/forced vital capacity (FEV<sub>1</sub>/FVC) ration, FVC %predicted, age, history of pneumonia, oxygen saturation, 6-min walk distance, dyspnoea) we developed <i>VAPORED</i> mortality risk score, which we validated on the ECLIPSE cohort (3-yr all-cause mortality data, n = 2312). VAPORED performed significantly better than ADO, BODE, and updated BODE indices in predicting all-cause mortality in ECLIPSE in terms of concordance (VAPORED [0.719] vs ADO [0.693; FDR p-value 0.014], BODE [0.695; FDR p-value 0.020], and updated BODE [0.694; FDR p-value 0.021]); CPE (VAPORED [0.714] vs ADO [0.673; FDR p-value <0.0001], BODE [0.662; FDR p-value <0.0001], and updated BODE [0.646; FDR p-value <0.0001]); 3-year C/D AUC (VAPORED [0.728] vs ADO [0.702; FDR p-value 0.017], BODE [0.704; FDR p-value 0.021], and updated BODE [0.703; FDR p-value 0.024]); integrated C/D AUC (VAPORED [0.723] vs ADO [0.698; FDR p-value 0.047], BODE [0.695; FDR p-value 0.024], and updated BODE [0.690; FDR p-value 0.021]). Finally, we developed a web tool to help clinicians calculate VAPORED mortality risk and compare it to ADO and BODE predictions.</p><p><strong>Interpretation: </strong>Our work is an important step towards improving our identification of high-risk patients and generating hypotheses of potential biological mechanisms and social factors driving mortality in patients with COPD at the population level. The main limitation of our study is the fact that the analysed datasets consist of older people with extensive smoking history and limited racial diversity. Thus, the results are relevant to high-risk individuals or those diagnosed with COPD and the VAPORED score is validated for them.</p><p><strong>Funding: </strong>This research was supported by NIH [NHLBI, NLM]. The COPDGene study is supported by the COPD Foundation, through grants from AstraZeneca, Bayer Pharmaceuticals, Boehringer Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer and Sunovion.</p>","PeriodicalId":11393,"journal":{"name":"EClinicalMedicine","volume":"75 ","pages":"102786"},"PeriodicalIF":9.6000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11388367/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EClinicalMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.eclinm.2024.102786","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Chronic Obstructive Pulmonary Disease (COPD) is a leading cause of mortality. Predicting mortality risk in patients with COPD can be important for disease management strategies. Although all-cause mortality predictors have been developed previously, limited research exists on factors directly affecting COPD-specific mortality.

Methods: In a retrospective study, we used probabilistic graphs to analyse clinical cross-sectional data (COPDGene cohort), including demographics, spirometry, quantitative chest imaging, and symptom features, as well as gene expression data. COPDGene recruited current and former smokers, aged 45-80 years with >10 pack-years smoking history, from across the USA (Phase 1, 11/2007-4/2011) and invited them for a follow-up visit (Phase 2, 7/2013-7/2017). ECLIPSE cohort recruited current and former smokers (COPD patients and controls from USA and Europe), aged 45-80 with smoking history >10 pack-years (12/2005-11/2007). We applied graphical models on multi-modal data COPDGene Phase 1 participants to identify factors directly affecting all-cause and COPD-specific mortality (primary outcomes); and on Phase 2 follow-up cohort to identify additional molecular and social factors affecting mortality. We used penalized Cox regression with features selected by the causal graph to build VAPORED, a mortality risk prediction model. VAPORED was compared to existing scores (BODE: BMI, airflow obstruction, dyspnoea, exercise capacity; ADO: age, dyspnoea, airflow obstruction) on the ability to rank individuals by mortality risk, using four evaluation metrics (concordance, concordance probability estimate (CPE), cumulative/dynamic (C/D) area under the receiver operating characteristic curve (AUC), and integrated C/D AUC). The results were validated in ECLIPSE.

Findings: Graphical models, applied on the COPDGene Phase 1 samples (n = 8610), identified 11 and 7 variables directly linked to all-cause and COPD-specific mortality, respectively. Although many appear in both models, non-lung comorbidities appear only in the all-cause model, while forced vital capacity (FVC %predicted) appears in COPD-specific mortality model only. Additionally, the graph model of Phase 2 data (n = 3182) identified internet access, CD4 T cells and platelets to be linked to lower mortality risk. Furthermore, using the 7 variables linked to COPD-specific mortality (forced expiratory volume in 1 s/forced vital capacity (FEV1/FVC) ration, FVC %predicted, age, history of pneumonia, oxygen saturation, 6-min walk distance, dyspnoea) we developed VAPORED mortality risk score, which we validated on the ECLIPSE cohort (3-yr all-cause mortality data, n = 2312). VAPORED performed significantly better than ADO, BODE, and updated BODE indices in predicting all-cause mortality in ECLIPSE in terms of concordance (VAPORED [0.719] vs ADO [0.693; FDR p-value 0.014], BODE [0.695; FDR p-value 0.020], and updated BODE [0.694; FDR p-value 0.021]); CPE (VAPORED [0.714] vs ADO [0.673; FDR p-value <0.0001], BODE [0.662; FDR p-value <0.0001], and updated BODE [0.646; FDR p-value <0.0001]); 3-year C/D AUC (VAPORED [0.728] vs ADO [0.702; FDR p-value 0.017], BODE [0.704; FDR p-value 0.021], and updated BODE [0.703; FDR p-value 0.024]); integrated C/D AUC (VAPORED [0.723] vs ADO [0.698; FDR p-value 0.047], BODE [0.695; FDR p-value 0.024], and updated BODE [0.690; FDR p-value 0.021]). Finally, we developed a web tool to help clinicians calculate VAPORED mortality risk and compare it to ADO and BODE predictions.

Interpretation: Our work is an important step towards improving our identification of high-risk patients and generating hypotheses of potential biological mechanisms and social factors driving mortality in patients with COPD at the population level. The main limitation of our study is the fact that the analysed datasets consist of older people with extensive smoking history and limited racial diversity. Thus, the results are relevant to high-risk individuals or those diagnosed with COPD and the VAPORED score is validated for them.

Funding: This research was supported by NIH [NHLBI, NLM]. The COPDGene study is supported by the COPD Foundation, through grants from AstraZeneca, Bayer Pharmaceuticals, Boehringer Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer and Sunovion.

慢性阻塞性肺病死亡风险预测模型的开发与验证:使用概率图形建模的横断面研究。
背景:慢性阻塞性肺病(COPD慢性阻塞性肺病(COPD)是导致死亡的主要原因。预测慢性阻塞性肺病患者的死亡风险对疾病管理策略非常重要。虽然此前已经开发出了全因死亡率预测指标,但对直接影响 COPD 特异性死亡率的因素研究有限:在一项回顾性研究中,我们使用概率图分析了临床横断面数据(COPDGene 队列),包括人口统计学、肺活量测定、胸部定量成像、症状特征以及基因表达数据。COPDGene 从美国各地招募了年龄在 45-80 岁、吸烟史大于 10 包年的现吸烟者和曾吸烟者(第一阶段,2007 年 11 月至 2011 年 4 月),并邀请他们进行随访(第二阶段,2013 年 7 月至 2017 年 7 月)。ECLIPSE 队列招募了年龄在 45-80 岁、吸烟史大于 10 包年(12/2005-11/2007)的现吸烟者和曾吸烟者(来自美国和欧洲的慢性阻塞性肺病患者和对照组)。我们对 COPDGene 第一阶段参与者的多模态数据应用了图形模型,以确定直接影响全因死亡率和 COPD 特异性死亡率(主要结果)的因素;并对第二阶段随访队列应用了图形模型,以确定影响死亡率的其他分子和社会因素。我们使用惩罚性 Cox 回归和因果图所选择的特征来建立死亡率风险预测模型 VAPORED。我们将 VAPORED 与现有的评分(BODE:体重指数、气流阻塞、呼吸困难、运动能力;ADO:年龄、呼吸困难、气流阻塞)进行了比较,通过四种评估指标(一致性、一致性概率估计值 (CPE)、接收器工作特征曲线下的累积/动态 (C/D) 面积 (AUC) 和综合 C/D AUC)对个人的死亡风险进行排序。结果在 ECLIPSE.Findings 中进行了验证:应用于 COPDGene 第一阶段样本(n = 8610)的图形模型分别确定了 11 个和 7 个与全因死亡率和 COPD 特异性死亡率直接相关的变量。尽管许多变量同时出现在两个模型中,但非肺部合并症仅出现在全因模型中,而用力肺活量(FVC 预测百分比)仅出现在慢性阻塞性肺病特异性死亡率模型中。此外,第 2 阶段数据图表模型(n = 3182)发现,互联网接入、CD4 T 细胞和血小板与较低的死亡风险有关。此外,利用与慢性阻塞性肺病特异性死亡率相关的 7 个变量(1 秒用力呼气容积/用力生命容量(FEV1/FVC)比率、FVC 预测百分比、年龄、肺炎病史、血氧饱和度、6 分钟步行距离、呼吸困难),我们制定了 VAPORED 死亡率风险评分,并在 ECLIPSE 队列(3 年全因死亡率数据,n = 2312)中进行了验证。在 ECLIPSE 中,VAPORED 在预测全因死亡率方面的一致性明显优于 ADO、BODE 和更新的 BODE 指数(VAPORED [0.719] vs ADO [0.693;FDR p-值 0.014]、BODE [0.695;FDR p-值 0.020]和更新 BODE [0.694;FDR p-值 0.021]);CPE(VAPORED [0.714] vs ADO [0.673;FDR p-值 解释:我们的工作迈出了重要一步,有助于我们更好地识别高危患者,并在人群水平上对导致慢性阻塞性肺病患者死亡的潜在生物机制和社会因素提出假设。我们研究的主要局限性在于所分析的数据集由吸烟史广泛且种族多样性有限的老年人组成。因此,研究结果只适用于高危人群或确诊为慢性阻塞性肺病的人群,VAPORED评分对他们是有效的:本研究得到了美国国立卫生研究院[NHLBI, NLM]的支持。COPDGene 研究由 COPD 基金会通过阿斯利康、拜耳医药、勃林格殷格翰、基因泰克、葛兰素史克、诺华、辉瑞和 Sunovion 提供资助。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
EClinicalMedicine
EClinicalMedicine Medicine-Medicine (all)
CiteScore
18.90
自引率
1.30%
发文量
506
审稿时长
22 days
期刊介绍: eClinicalMedicine is a gold open-access clinical journal designed to support frontline health professionals in addressing the complex and rapid health transitions affecting societies globally. The journal aims to assist practitioners in overcoming healthcare challenges across diverse communities, spanning diagnosis, treatment, prevention, and health promotion. Integrating disciplines from various specialties and life stages, it seeks to enhance health systems as fundamental institutions within societies. With a forward-thinking approach, eClinicalMedicine aims to redefine the future of healthcare.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信