Spike and Slab Regression for Nonstationary Gaussian Linear Mixed Effects Modeling of Rapid Disease Progression

IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES
Environmetrics Pub Date : 2024-11-05 DOI:10.1002/env.2884
Emrah Gecili, Cole Brokamp, Özgür Asar, Eleni-Rosalina Andrinopoulou, John J. Brewington, Rhonda D. Szczesniak
{"title":"Spike and Slab Regression for Nonstationary Gaussian Linear Mixed Effects Modeling of Rapid Disease Progression","authors":"Emrah Gecili,&nbsp;Cole Brokamp,&nbsp;Özgür Asar,&nbsp;Eleni-Rosalina Andrinopoulou,&nbsp;John J. Brewington,&nbsp;Rhonda D. Szczesniak","doi":"10.1002/env.2884","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Select measures of social and environmental determinants of health (referred to as “geomarkers”), predict rapid lung function decline in cystic fibrosis (CF), defined as a prolonged decline relative to patient and/or center-level norms. The extent to which hyper-localization, defined as increasing the spatiotemporal precision of geomarkers, aids in prediction of rapid lung decline remains unclear. Linear mixed effects (LME) models with specialized covariance functions have been used for predicting rapid lung function decline, but there are few options to properly incorporate spatial correlation into the covariance functions while inducing simultaneous variable selection. Our innovative Bayesian model uses a spike and slab prior for simultaneous variable selection and offers additional advantages when coupled with nonstationary Gaussian LME modeling. This model also incorporates spatial correlation through an additional random effect term that accounts for spatial correlation based on ZIP code distances. We validated the model with simulations and applied it to real CF data from a Midwestern CF Center. We demonstrate how a combination of demographic, clinical, and geomarker variables can be selected as optimal predictors using Bayesian false discovery rate controlling rule. Our results indicate that incorporating spatiotemporal effects and geomarkers into this novel Bayesian stochastic LME model enhances the dynamic prediction of rapid CF disease progression.</p>\n </div>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmetrics","FirstCategoryId":"93","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/env.2884","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Select measures of social and environmental determinants of health (referred to as “geomarkers”), predict rapid lung function decline in cystic fibrosis (CF), defined as a prolonged decline relative to patient and/or center-level norms. The extent to which hyper-localization, defined as increasing the spatiotemporal precision of geomarkers, aids in prediction of rapid lung decline remains unclear. Linear mixed effects (LME) models with specialized covariance functions have been used for predicting rapid lung function decline, but there are few options to properly incorporate spatial correlation into the covariance functions while inducing simultaneous variable selection. Our innovative Bayesian model uses a spike and slab prior for simultaneous variable selection and offers additional advantages when coupled with nonstationary Gaussian LME modeling. This model also incorporates spatial correlation through an additional random effect term that accounts for spatial correlation based on ZIP code distances. We validated the model with simulations and applied it to real CF data from a Midwestern CF Center. We demonstrate how a combination of demographic, clinical, and geomarker variables can be selected as optimal predictors using Bayesian false discovery rate controlling rule. Our results indicate that incorporating spatiotemporal effects and geomarkers into this novel Bayesian stochastic LME model enhances the dynamic prediction of rapid CF disease progression.

快速疾病进展的非平稳高斯线性混合效应模型的尖峰和平板回归
选择健康的社会和环境决定因素(称为“地理标志”)的措施,预测囊性纤维化(CF)的肺功能快速下降,定义为相对于患者和/或中心水平标准的长期下降。高度定位被定义为提高地理标记物的时空精度,在多大程度上有助于预测肺功能的快速衰退,目前尚不清楚。具有专门协方差函数的线性混合效应(LME)模型已被用于预测肺功能的快速衰退,但在诱导同步变量选择的同时,很少有办法将空间相关性适当地纳入协方差函数。我们创新的贝叶斯模型使用峰值和slab先验来同时选择变量,并且在与非平稳高斯LME建模相结合时提供额外的优势。该模型还通过一个额外的随机效应项来考虑基于邮政编码距离的空间相关性,从而结合了空间相关性。我们通过模拟验证了该模型,并将其应用于中西部CF中心的真实CF数据。我们演示了如何使用贝叶斯错误发现率控制规则选择人口统计、临床和地理标记变量的组合作为最佳预测因子。我们的研究结果表明,将时空效应和地理标记纳入这种新的贝叶斯随机LME模型可以增强对CF快速疾病进展的动态预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmetrics
Environmetrics 环境科学-环境科学
CiteScore
2.90
自引率
17.60%
发文量
67
审稿时长
18-36 weeks
期刊介绍: Environmetrics, the official journal of The International Environmetrics Society (TIES), an Association of the International Statistical Institute, is devoted to the dissemination of high-quality quantitative research in the environmental sciences. The journal welcomes pertinent and innovative submissions from quantitative disciplines developing new statistical and mathematical techniques, methods, and theories that solve modern environmental problems. Articles must proffer substantive, new statistical or mathematical advances to answer important scientific questions in the environmental sciences, or must develop novel or enhanced statistical methodology with clear applications to environmental science. New methods should be illustrated with recent environmental data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信