Hypercubes to identify geomarkers of rapid cystic fibrosis lung disease progression.

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS
Yizi Cheng, Cole Brokamp, Erika Rasnick Manning, Elizabeth L Kramer, Patrick H Ryan, Rhonda D Szczesniak, Emrah Gecili
{"title":"Hypercubes to identify geomarkers of rapid cystic fibrosis lung disease progression.","authors":"Yizi Cheng, Cole Brokamp, Erika Rasnick Manning, Elizabeth L Kramer, Patrick H Ryan, Rhonda D Szczesniak, Emrah Gecili","doi":"10.1186/s12911-025-03097-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Prior research has shown that place-based environmental exposures and community characteristics, known as geomarkers, are associated with accelerated lung function decline and increased mortality in individuals with cystic fibrosis (CF). Although geomarkers have been linked to pulmonary outcomes in other respiratory diseases, it is unknown which have the greatest predictive power for rapid lung function decline in CF.</p><p><strong>Methods: </strong>We adapted an existing statistical procedure, which arranges candidate variables in a k-dimensional hypercube, where the hypercube forms a set of variables for a multi-stage selection process involving complex longitudinal data. We embedded the hypercube within a dynamic prediction model of rapid lung function decline, in order to accommodate complexity in lung function trajectories. This practical approach simultaneously selects a handful of genuinely predictive markers among candidates and accounts for complex correlations in longitudinal marker data. Our method is applied to actual geomarker and lung-function outcomes data from the existing Cystic Fibrosis Patient Registry and Cincinnati Cystic Fibrosis Center datasets.</p><p><strong>Results: </strong>We applied a 4 × 4 × 4 3-D hypercube to the national and local datasets and selected a subset of geomarkers using p-values from testing coefficients of the association between each geomarker and lung function decline in the dynamic prediction model. Based on the national data analyses, some road density-related geomarkers were selected, including some air pollution-related and greenspace-related variables. Simulations showed the proposed method's variable selection efficacy and robust performance in identifying true predictors, particularly under weak correlation (ρ≤0.6), although performance dipped with stronger correlations (ρ=0.9).</p><p><strong>Conclusions: </strong>The proposed method is a useful approach for selecting a small set of truly relevant demographic, clinical, and place-based predictors of rapid lung function decline while accounting for the complex correlations inherent in longitudinal lung-function data. We found that selection results differed according to spatial resolution of the geomarkers. Our findings have potential to improve care decisions for people with CF.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"304"},"PeriodicalIF":3.8000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12344988/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03097-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Prior research has shown that place-based environmental exposures and community characteristics, known as geomarkers, are associated with accelerated lung function decline and increased mortality in individuals with cystic fibrosis (CF). Although geomarkers have been linked to pulmonary outcomes in other respiratory diseases, it is unknown which have the greatest predictive power for rapid lung function decline in CF.

Methods: We adapted an existing statistical procedure, which arranges candidate variables in a k-dimensional hypercube, where the hypercube forms a set of variables for a multi-stage selection process involving complex longitudinal data. We embedded the hypercube within a dynamic prediction model of rapid lung function decline, in order to accommodate complexity in lung function trajectories. This practical approach simultaneously selects a handful of genuinely predictive markers among candidates and accounts for complex correlations in longitudinal marker data. Our method is applied to actual geomarker and lung-function outcomes data from the existing Cystic Fibrosis Patient Registry and Cincinnati Cystic Fibrosis Center datasets.

Results: We applied a 4 × 4 × 4 3-D hypercube to the national and local datasets and selected a subset of geomarkers using p-values from testing coefficients of the association between each geomarker and lung function decline in the dynamic prediction model. Based on the national data analyses, some road density-related geomarkers were selected, including some air pollution-related and greenspace-related variables. Simulations showed the proposed method's variable selection efficacy and robust performance in identifying true predictors, particularly under weak correlation (ρ≤0.6), although performance dipped with stronger correlations (ρ=0.9).

Conclusions: The proposed method is a useful approach for selecting a small set of truly relevant demographic, clinical, and place-based predictors of rapid lung function decline while accounting for the complex correlations inherent in longitudinal lung-function data. We found that selection results differed according to spatial resolution of the geomarkers. Our findings have potential to improve care decisions for people with CF.

超立方体识别囊性纤维化肺疾病快速进展的地理标志。
背景:先前的研究表明,在囊性纤维化(CF)患者中,基于地点的环境暴露和社区特征(称为地理标志)与肺功能加速衰退和死亡率增加有关。虽然地理标志与其他呼吸系统疾病的肺部预后有关,但尚不清楚哪些对慢性阻塞性肺疾病的肺功能快速下降具有最大的预测能力。方法:我们采用了现有的统计程序,该程序将候选变量安排在k维超立方体中,其中超立方体形成一组变量,用于涉及复杂纵向数据的多阶段选择过程。我们将超立方体嵌入到肺功能快速下降的动态预测模型中,以适应肺功能轨迹的复杂性。这种实用的方法同时在候选对象中选择少数真正具有预测性的标记,并在纵向标记数据中解释复杂的相关性。我们的方法应用于来自现有囊性纤维化患者登记处和辛辛那提囊性纤维化中心数据集的实际地理标记和肺功能结果数据。结果:我们对国家和地方数据集应用了一个4 × 4 × 4的三维超立方体,并使用动态预测模型中每个地质标志与肺功能下降之间关联的检验系数的p值选择了一个地质标志子集。在全国数据分析的基础上,选择了一些与道路密度相关的地理标志,包括一些与空气污染和绿地相关的变量。仿真结果表明,该方法在变量选择效率和识别真实预测因子方面表现出色,特别是在弱相关(ρ≤0.6)下,尽管在强相关(ρ=0.9)下性能有所下降。结论:所提出的方法是一种有用的方法,可以选择一组真正相关的人口统计学、临床和基于位置的肺功能快速衰退预测因子,同时考虑到纵向肺功能数据中固有的复杂相关性。研究发现,地理标志的空间分辨率不同,选择结果也不同。我们的研究结果有可能改善CF患者的护理决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信