Frequentist Grouped Weighted Quantile Sum Regression for Correlated Chemical Mixtures.

IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Daniel Rud, Md Mostafijur Rahman, Anny H Xiang, Rob McConnell, Fred Lurmann, Michael J Kleeman, Joel Schwartz, Zhanghua Chen, Sandy Eckel, Juan Pablo Lewinger
{"title":"Frequentist Grouped Weighted Quantile Sum Regression for Correlated Chemical Mixtures.","authors":"Daniel Rud, Md Mostafijur Rahman, Anny H Xiang, Rob McConnell, Fred Lurmann, Michael J Kleeman, Joel Schwartz, Zhanghua Chen, Sandy Eckel, Juan Pablo Lewinger","doi":"10.1002/sim.70078","DOIUrl":null,"url":null,"abstract":"<p><p>As individuals are exposed to a myriad of potentially harmful pollutants every day, it is important to determine which actors have the greatest influence on health outcomes. However, jointly modeling the associations of multiple pollutant exposures is often hindered by the presence of highly correlated chemicals originating from a common source. A popular approach to analyzing associations between a disease outcome and several highly correlated exposures is Weighted Quantile Sum Regression (WQSR) modeling. WQSR provides increased stability in estimating model parameters but requires data splitting to estimate individual and group effects of chemicals, which reduces the power of the approach. A recent Bayesian implementation of WQSR regression provides a model fitting procedure that avoids data splitting at the cost of high computational expense on large data. In this paper, we introduce a Frequentist Grouped Weighted Quantile Sum Regression (FGWQSR) model that can be fitted efficiently to large datasets without requiring data splitting. FGWQSR produces estimates of the joint effect of mixture groups and of individual chemicals, and likelihood-ratio-based tests that account for FGWQSR's non-standard asymptotics. We demonstrate that FGWQSR is well calibrated for type-I errors while outperforming both Bayesian Grouped Weighted Quantile Sum Regression and Quantile Logistic Regression in terms of statistical power to detect the effects of mixture groups and individual chemicals. In addition, we show that FGWQSR is robust to model misspecification and can be fitted on large datasets in a fraction of the time required for BGWQSR. We apply FGWQSR to a dataset of 317 767 mother-child pairs with exposure profiles generated by chemical transport models to study the associations between several components found in particulate matter with an aerodynamic diameter smaller than 2.5 <math> <semantics><mrow><mi>μ</mi> <mi>m</mi></mrow> <annotation>$$ \\mu \\mathrm{m} $$</annotation></semantics> </math> (PM <math> <semantics> <mrow><msub><mo> </mo> <mrow><mn>2</mn> <mo>.</mo> <mn>5</mn></mrow> </msub> </mrow> <annotation>$$ {}_{2.5} $$</annotation></semantics> </math> ) and child Autism Spectrum Disorder (ASD) diagnosis before age 5. PM <math> <semantics> <mrow><msub><mo> </mo> <mrow><mn>2</mn> <mo>.</mo> <mn>5</mn></mrow> </msub> </mrow> <annotation>$$ {}_{2.5} $$</annotation></semantics> </math> copper and PM <math> <semantics> <mrow><msub><mo> </mo> <mrow><mn>2</mn> <mo>.</mo> <mn>5</mn></mrow> </msub> </mrow> <annotation>$$ {}_{2.5} $$</annotation></semantics> </math> crustal material are found to be statistically significantly associated with ASD diagnosis by five years of age.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"44 7","pages":"e70078"},"PeriodicalIF":1.8000,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11987061/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/sim.70078","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

As individuals are exposed to a myriad of potentially harmful pollutants every day, it is important to determine which actors have the greatest influence on health outcomes. However, jointly modeling the associations of multiple pollutant exposures is often hindered by the presence of highly correlated chemicals originating from a common source. A popular approach to analyzing associations between a disease outcome and several highly correlated exposures is Weighted Quantile Sum Regression (WQSR) modeling. WQSR provides increased stability in estimating model parameters but requires data splitting to estimate individual and group effects of chemicals, which reduces the power of the approach. A recent Bayesian implementation of WQSR regression provides a model fitting procedure that avoids data splitting at the cost of high computational expense on large data. In this paper, we introduce a Frequentist Grouped Weighted Quantile Sum Regression (FGWQSR) model that can be fitted efficiently to large datasets without requiring data splitting. FGWQSR produces estimates of the joint effect of mixture groups and of individual chemicals, and likelihood-ratio-based tests that account for FGWQSR's non-standard asymptotics. We demonstrate that FGWQSR is well calibrated for type-I errors while outperforming both Bayesian Grouped Weighted Quantile Sum Regression and Quantile Logistic Regression in terms of statistical power to detect the effects of mixture groups and individual chemicals. In addition, we show that FGWQSR is robust to model misspecification and can be fitted on large datasets in a fraction of the time required for BGWQSR. We apply FGWQSR to a dataset of 317 767 mother-child pairs with exposure profiles generated by chemical transport models to study the associations between several components found in particulate matter with an aerodynamic diameter smaller than 2.5 μ m $$ \mu \mathrm{m} $$ (PM 2 . 5 $$ {}_{2.5} $$ ) and child Autism Spectrum Disorder (ASD) diagnosis before age 5. PM 2 . 5 $$ {}_{2.5} $$ copper and PM 2 . 5 $$ {}_{2.5} $$ crustal material are found to be statistically significantly associated with ASD diagnosis by five years of age.

相关化学混合物的频率组加权分位数和回归。
由于个人每天都暴露在无数可能有害的污染物中,确定哪些行为者对健康结果的影响最大是很重要的。然而,联合模拟多种污染物暴露之间的关联往往受到来自共同来源的高度相关化学品的阻碍。加权分位数和回归(WQSR)模型是分析疾病结果与几种高度相关暴露之间关系的常用方法。WQSR在估计模型参数方面提供了更高的稳定性,但需要进行数据分割来估计化学物质的个体和群体效应,这降低了该方法的有效性。最近的一种WQSR回归的贝叶斯实现提供了一种模型拟合过程,该过程避免了以大数据的高计算开销为代价的数据分割。在本文中,我们引入了一种频率组加权分位数和回归(FGWQSR)模型,该模型可以有效地拟合大型数据集,而无需进行数据分割。FGWQSR产生混合组和单个化学物质的联合效应的估计,以及基于似然比的测试,这些测试说明了FGWQSR的非标准渐近性。我们证明,FGWQSR可以很好地校准i型误差,同时在检测混合组和单个化学品影响的统计能力方面优于贝叶斯分组加权分位数和回归和分位数逻辑回归。此外,我们表明FGWQSR对模型错误规范具有鲁棒性,并且可以在BGWQSR所需时间的一小部分内拟合大型数据集。为了研究空气动力学直径小于2.5 μ m $$ \mu \mathrm{m} $$ (PM 2)的颗粒物中几种成分之间的关系,我们将FGWQSR应用于317767对母子的数据集,这些数据集具有化学传输模型生成的暴露剖面。5 $$ {}_{2.5} $$)和5岁前的儿童自闭症谱系障碍(ASD)诊断。下午2点。5 $$ {}_{2.5} $$铜和PM 2。5 $$ {}_{2.5} $$地壳物质与5岁前的ASD诊断有统计学显著相关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Statistics in Medicine
Statistics in Medicine 医学-公共卫生、环境卫生与职业卫生
CiteScore
3.40
自引率
10.00%
发文量
334
审稿时长
2-4 weeks
期刊介绍: The journal aims to influence practice in medicine and its associated sciences through the publication of papers on statistical and other quantitative methods. Papers will explain new methods and demonstrate their application, preferably through a substantive, real, motivating example or a comprehensive evaluation based on an illustrative example. Alternatively, papers will report on case-studies where creative use or technical generalizations of established methodology is directed towards a substantive application. Reviews of, and tutorials on, general topics relevant to the application of statistics to medicine will also be published. The main criteria for publication are appropriateness of the statistical methods to a particular medical problem and clarity of exposition. Papers with primarily mathematical content will be excluded. The journal aims to enhance communication between statisticians, clinicians and medical researchers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信