大规模单一中介假设检验的方法:可能的选择和比较

IF 1.7 4区 医学 Q3 GENETICS & HEREDITY
Jiacong Du, Xiang Zhou, Dylan Clark-Boucher, Wei Hao, Yongmei Liu, Jennifer A. Smith, Bhramar Mukherjee
{"title":"大规模单一中介假设检验的方法:可能的选择和比较","authors":"Jiacong Du,&nbsp;Xiang Zhou,&nbsp;Dylan Clark-Boucher,&nbsp;Wei Hao,&nbsp;Yongmei Liu,&nbsp;Jennifer A. Smith,&nbsp;Bhramar Mukherjee","doi":"10.1002/gepi.22510","DOIUrl":null,"url":null,"abstract":"<p>Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, <math>\n <semantics>\n <mrow>\n <msub>\n <mi>H</mi>\n \n <mn>0</mn>\n </msub>\n \n <mo>:</mo>\n \n <mi>α</mi>\n \n <mi>β</mi>\n \n <mo>=</mo>\n \n <mn>0</mn>\n </mrow>\n <annotation> ${H}_{0}:\\alpha \\beta =0$</annotation>\n </semantics></math> (<math>\n <semantics>\n <mrow>\n <mi>α</mi>\n </mrow>\n <annotation> $\\alpha $</annotation>\n </semantics></math>: effect of the exposure on the mediator after adjusting for confounders; <math>\n <semantics>\n <mrow>\n <mi>β</mi>\n </mrow>\n <annotation> $\\beta $</annotation>\n </semantics></math>: effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for large-scale one at a time mediation hypothesis testing. These methods are commonly used for continuous outcomes and continuous mediators assuming there is no exposure-mediator interaction so that the product <math>\n <semantics>\n <mrow>\n <mi>α</mi>\n \n <mi>β</mi>\n </mrow>\n <annotation> $\\alpha \\beta $</annotation>\n </semantics></math> has a causal interpretation as the indirect effect. The first class of methods ignores the impact of different structures under the composite null hypothesis, namely, (1) <math>\n <semantics>\n <mrow>\n <mi>α</mi>\n \n <mo>=</mo>\n \n <mn>0</mn>\n \n <mo>,</mo>\n \n <mi>β</mi>\n \n <mo>≠</mo>\n \n <mn>0</mn>\n </mrow>\n <annotation> $\\alpha =0,\\beta \\ne 0$</annotation>\n </semantics></math>; (2) <math>\n <semantics>\n <mrow>\n <mi>α</mi>\n \n <mo>≠</mo>\n \n <mn>0</mn>\n \n <mo>,</mo>\n \n <mi>β</mi>\n \n <mo>=</mo>\n \n <mn>0</mn>\n </mrow>\n <annotation> $\\alpha \\ne 0,\\beta =0$</annotation>\n </semantics></math>; and (3) <math>\n <semantics>\n <mrow>\n <mi>α</mi>\n \n <mo>=</mo>\n \n <mi>β</mi>\n \n <mo>=</mo>\n \n <mn>0</mn>\n </mrow>\n <annotation> $\\alpha =\\beta =0$</annotation>\n </semantics></math>. The second class of methods weights the reference distribution under each case of the null to form a mixture reference distribution. The third class constructs a composite test statistic using the three <i>p</i> values obtained under each case of the null so that the reference distribution of the composite statistic is approximately <math>\n <semantics>\n <mrow>\n <mi>U</mi>\n \n <mrow>\n <mo>(</mo>\n \n <mrow>\n <mn>0</mn>\n \n <mo>,</mo>\n \n <mn>1</mn>\n </mrow>\n \n <mo>)</mo>\n </mrow>\n </mrow>\n <annotation> $U(0,1)$</annotation>\n </semantics></math>. In addition to these existing methods, we developed the Sobel-comp method belonging to the second class, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods belonging to these three classes in terms of the false positive rates (FPRs) under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the second class of methods which uses a mixture reference distribution could best maintain the FPRs at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi-Ethnic Study of Atherosclerosis (MESA). We provide guidelines for choosing the optimal mediation hypothesis testing method in practice and develop an R package <i>medScan</i> available on the CRAN for implementing all the six methods.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 2","pages":"167-184"},"PeriodicalIF":1.7000,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22510","citationCount":"2","resultStr":"{\"title\":\"Methods for large-scale single mediator hypothesis testing: Possible choices and comparisons\",\"authors\":\"Jiacong Du,&nbsp;Xiang Zhou,&nbsp;Dylan Clark-Boucher,&nbsp;Wei Hao,&nbsp;Yongmei Liu,&nbsp;Jennifer A. Smith,&nbsp;Bhramar Mukherjee\",\"doi\":\"10.1002/gepi.22510\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, <math>\\n <semantics>\\n <mrow>\\n <msub>\\n <mi>H</mi>\\n \\n <mn>0</mn>\\n </msub>\\n \\n <mo>:</mo>\\n \\n <mi>α</mi>\\n \\n <mi>β</mi>\\n \\n <mo>=</mo>\\n \\n <mn>0</mn>\\n </mrow>\\n <annotation> ${H}_{0}:\\\\alpha \\\\beta =0$</annotation>\\n </semantics></math> (<math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n </mrow>\\n <annotation> $\\\\alpha $</annotation>\\n </semantics></math>: effect of the exposure on the mediator after adjusting for confounders; <math>\\n <semantics>\\n <mrow>\\n <mi>β</mi>\\n </mrow>\\n <annotation> $\\\\beta $</annotation>\\n </semantics></math>: effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for large-scale one at a time mediation hypothesis testing. These methods are commonly used for continuous outcomes and continuous mediators assuming there is no exposure-mediator interaction so that the product <math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n \\n <mi>β</mi>\\n </mrow>\\n <annotation> $\\\\alpha \\\\beta $</annotation>\\n </semantics></math> has a causal interpretation as the indirect effect. The first class of methods ignores the impact of different structures under the composite null hypothesis, namely, (1) <math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n \\n <mo>=</mo>\\n \\n <mn>0</mn>\\n \\n <mo>,</mo>\\n \\n <mi>β</mi>\\n \\n <mo>≠</mo>\\n \\n <mn>0</mn>\\n </mrow>\\n <annotation> $\\\\alpha =0,\\\\beta \\\\ne 0$</annotation>\\n </semantics></math>; (2) <math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n \\n <mo>≠</mo>\\n \\n <mn>0</mn>\\n \\n <mo>,</mo>\\n \\n <mi>β</mi>\\n \\n <mo>=</mo>\\n \\n <mn>0</mn>\\n </mrow>\\n <annotation> $\\\\alpha \\\\ne 0,\\\\beta =0$</annotation>\\n </semantics></math>; and (3) <math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n \\n <mo>=</mo>\\n \\n <mi>β</mi>\\n \\n <mo>=</mo>\\n \\n <mn>0</mn>\\n </mrow>\\n <annotation> $\\\\alpha =\\\\beta =0$</annotation>\\n </semantics></math>. The second class of methods weights the reference distribution under each case of the null to form a mixture reference distribution. The third class constructs a composite test statistic using the three <i>p</i> values obtained under each case of the null so that the reference distribution of the composite statistic is approximately <math>\\n <semantics>\\n <mrow>\\n <mi>U</mi>\\n \\n <mrow>\\n <mo>(</mo>\\n \\n <mrow>\\n <mn>0</mn>\\n \\n <mo>,</mo>\\n \\n <mn>1</mn>\\n </mrow>\\n \\n <mo>)</mo>\\n </mrow>\\n </mrow>\\n <annotation> $U(0,1)$</annotation>\\n </semantics></math>. In addition to these existing methods, we developed the Sobel-comp method belonging to the second class, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods belonging to these three classes in terms of the false positive rates (FPRs) under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the second class of methods which uses a mixture reference distribution could best maintain the FPRs at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi-Ethnic Study of Atherosclerosis (MESA). We provide guidelines for choosing the optimal mediation hypothesis testing method in practice and develop an R package <i>medScan</i> available on the CRAN for implementing all the six methods.</p>\",\"PeriodicalId\":12710,\"journal\":{\"name\":\"Genetic Epidemiology\",\"volume\":\"47 2\",\"pages\":\"167-184\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2022-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22510\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genetic Epidemiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22510\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22510","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 2

摘要

由于零假设h0的复合结构,对大量中介的中介假设检验具有挑战性:α β = 0 ${H}_{0}:\alpha \beta =0$ (α $\alpha $:调整混杂因素后暴露对介质的影响;β $\beta $:调整暴露和混杂因素后,介质对结果的影响)。在本文中,我们回顾了大规模一次中介假设检验的三种方法。这些方法通常用于连续结果和连续介质,假设没有暴露-介质相互作用,因此产品α β $\alpha \beta $作为间接效应具有因果解释。第一类方法忽略了复合零假设下不同结构的影响,即(1)α = 0;β≠0 $\alpha =0,\beta \ne 0$;(2) α≠0,β = 0 $\alpha \ne 0,\beta =0$;(3) α = β = 0 $\alpha =\beta =0$。第二类方法对每一种情况下的引用分布进行加权,形成混合引用分布。第三类利用在每一种null情况下得到的三个p值构造一个复合检验统计量,使复合统计量的参考分布近似为U (0);1) $U(0,1)$。在这些现有方法的基础上,我们开发了Sobel-comp方法,属于第二类,它使用一个修正的Sobel检验统计量的混合参考分布。我们进行了广泛的模拟研究,以比较属于这三种类别的所有六种方法在零假设下的假阳性率(fpr)和替代假设下的真阳性率。我们发现,在零假设下,使用混合参考分布的第二类方法可以最好地将fpr保持在名义水平上,并且在备择假设下具有最大的真阳性率。我们使用来自多种族动脉粥样硬化研究(MESA)的数据,应用所有方法研究DNA甲基化位点在成人社会经济地位到糖化血红蛋白水平通路中的中介机制。我们提供了在实践中选择最佳中介假设检验方法的指南,并在CRAN上开发了一个R包medScan,用于实现所有六种方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Methods for large-scale single mediator hypothesis testing: Possible choices and comparisons

Methods for large-scale single mediator hypothesis testing: Possible choices and comparisons

Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, H 0 : α β = 0 ${H}_{0}:\alpha \beta =0$ ( α $\alpha $ : effect of the exposure on the mediator after adjusting for confounders; β $\beta $ : effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for large-scale one at a time mediation hypothesis testing. These methods are commonly used for continuous outcomes and continuous mediators assuming there is no exposure-mediator interaction so that the product α β $\alpha \beta $ has a causal interpretation as the indirect effect. The first class of methods ignores the impact of different structures under the composite null hypothesis, namely, (1) α = 0 , β 0 $\alpha =0,\beta \ne 0$ ; (2) α 0 , β = 0 $\alpha \ne 0,\beta =0$ ; and (3) α = β = 0 $\alpha =\beta =0$ . The second class of methods weights the reference distribution under each case of the null to form a mixture reference distribution. The third class constructs a composite test statistic using the three p values obtained under each case of the null so that the reference distribution of the composite statistic is approximately U ( 0 , 1 ) $U(0,1)$ . In addition to these existing methods, we developed the Sobel-comp method belonging to the second class, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods belonging to these three classes in terms of the false positive rates (FPRs) under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the second class of methods which uses a mixture reference distribution could best maintain the FPRs at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi-Ethnic Study of Atherosclerosis (MESA). We provide guidelines for choosing the optimal mediation hypothesis testing method in practice and develop an R package medScan available on the CRAN for implementing all the six methods.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Genetic Epidemiology
Genetic Epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
4.40
自引率
9.50%
发文量
49
审稿时长
6-12 weeks
期刊介绍: Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations. Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信