Sensitivity analyses gain relevance by fixing parameters observable during the empirical analyses

IF 1.7 4区 医学 Q3 GENETICS & HEREDITY
Gibran Hemani, Apostolos Gkatzionis, Kate Tilling, George Davey Smith
{"title":"Sensitivity analyses gain relevance by fixing parameters observable during the empirical analyses","authors":"Gibran Hemani,&nbsp;Apostolos Gkatzionis,&nbsp;Kate Tilling,&nbsp;George Davey Smith","doi":"10.1002/gepi.22530","DOIUrl":null,"url":null,"abstract":"<p>In 2017 we presented the MR Steiger method, a sensitivity analysis in Mendelian randomization (MR) for inferring causal directions between variables (Hemani et al., <span>2017</span>). We discussed many of its potential limitations including that unmeasured confounding under certain extreme circumstances could lead to the wrong inferred causal direction. Lutz et al. (<span>2022</span>) propose an R package (UCRMS) for performing sensitivity analysis of the MR Steiger method, and use it in an illustration to suggest that the MR Steiger method has a ~90% chance of giving the wrong answer due to unmeasured confounding. In this note we will show that an error in their approach to sensitivity analysis leads to the wrong conclusion about the validity of the MR Steiger test. We provide a valid alternative which uses the observed data to investigate sensitivity to unmeasured confounding.</p><p>A sensitivity analysis aims to understand the degree to which a result can change due to uncertainties in the inputs (Saltelli, <span>2002</span>). In this case for the MR Steiger test, we need to ask how sensitive is the inference of the causal direction between X and Y to possible values of unmeasured confounders influencing X and Y. Importantly, there is relative certainty in many of the parameters of this system because they are easily observed, for example, the variances of X, Y and the instrumental variables (IVs), the estimated effect of the IVs on X and Y, and therefore the IV estimate of the effect of X on Y. Often the ordinary least squares (OLS) association between X and Y is also available either due to the analysis being performed using individual level data, or by sourcing the estimate from other published results. Therefore, an appropriate sensitivity analysis must explore the extent to which the inferred causal direction between X and Y can change due to unmeasured confounding, without causing these observed parameters to change.</p><p>Lutz et al.'s proposed method does not attempt to fix all observable parameters. In the simple example provided by Lutz et al. the variance of Y varies between 28 and 39, and the OLS estimate varies between 1 and −1 across the parameter values used for the sensitivity analysis. This arises because the residual variance—which is unobserved—is fixed in their approach. Instead the phenotypic variance—which is observed—should be fixed. If they were presenting a simulation of the general performance of MR Steiger under unmeasured confounding then it would not matter that the simulated parameters are not tied to those observed in a particular empirical analysis. However in a sensitivity analysis, allowing observed parameters to vary provides no value to the analyst. To say that unmeasured confounding could reverse the causal direction, provided that the variance of Y also changes drastically, is of little use to the researcher who has a data set with an observed variance of Y. If some quantities are observed (i.e. the regression coefficient for Y on X, the variance explained in X by the instrument, the variances of X and Y, and the IV effect estimate are all observed), allowing only <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> to vary and compensating through changing the residual variance, the SNP-outcome <math>\n <semantics>\n <mrow>\n <msup>\n <mi>R</mi>\n <mn>2</mn>\n </msup>\n </mrow>\n <annotation> ${R}^{2}$</annotation>\n </semantics></math> will not change under any set of <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> parameters (Supporting Information Note).</p><p>Briefly expanding on Lutz et al.'s analysis, they specified that for a causal effect of <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>xy</mi>\n </msub>\n <mo>=</mo>\n <mn>1</mn>\n </mrow>\n <annotation> ${\\beta }_{{xy}}=1$</annotation>\n </semantics></math>, there were specific unmeasured confounding parameters of <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n <mo>=</mo>\n <mo>−</mo>\n <mn>5</mn>\n </mrow>\n <annotation> ${\\beta }_{{ux}}=-5$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> ranged only between 0 and 11. Using these parameters they suggest that the MR Steiger method has a~90% chance of returning the incorrect causal direction. But if <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> were permitted the same ranges of values (e.g., −11 to 11) then the Steiger method would only return the incorrect result in 36% of confounding scenarios. If the range was restricted to −1 to 1 for <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> each, then the wrong result would only be found in 0.02% of scenarios. In our 2017 paper (Supporting Information: note 3) we analyzed a much broader range of scenarios to comprehensively assess the degree to which unmeasured confounding could in general introduce a problem, and concluded that in most practical cases, where <math>\n <semantics>\n <mrow>\n <msubsup>\n <mpadded>\n <mi>R</mi>\n </mpadded>\n <mi>xy</mi>\n <mn>2</mn>\n </msubsup>\n <mo>&lt;</mo>\n <mn>0.2</mn>\n </mrow>\n <annotation> ${&lt;mpadded xmlns=\"http://www.w3.org/1998/Math/MathML\"&gt;R&lt;/mpadded&gt;}_{{xy}}^{2}\\lt 0.2$</annotation>\n </semantics></math>, the chance of unmeasured confounding leading to the incorrect causal direction was very small.</p><p>If analysts were motivated to check the sensitivity of MR Steiger to unmeasured confounding, a different approach is needed where one asks what values of unmeasured confounding support the inferred causal direction for a given set of empirically observed quantities (variances of X, Y and instrument, effects of instrument on X and Y, and OLS estimate of X on Y). Analysts can then determine whether the confounding values required to cast doubt on their conclusion are plausible. Alternatively one can determine what fraction of the possible confounding parameter space supports the inferred causal direction. In the Supplementary Note we provide an analytical solution to this problem. We illustrate that at the analysis-specific level the probability that unmeasured confounding will reverse the causal direction inferred by MR Steiger only exceeds a low probability when confounders explain large fractions of the variance in X and Y. The method is included in the TwoSampleMR package and is fast because it uses a closed form calculation rather than the stochastic simulation approach implemented by UCRMS.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 6","pages":"461-462"},"PeriodicalIF":1.7000,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22530","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22530","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

In 2017 we presented the MR Steiger method, a sensitivity analysis in Mendelian randomization (MR) for inferring causal directions between variables (Hemani et al., 2017). We discussed many of its potential limitations including that unmeasured confounding under certain extreme circumstances could lead to the wrong inferred causal direction. Lutz et al. (2022) propose an R package (UCRMS) for performing sensitivity analysis of the MR Steiger method, and use it in an illustration to suggest that the MR Steiger method has a ~90% chance of giving the wrong answer due to unmeasured confounding. In this note we will show that an error in their approach to sensitivity analysis leads to the wrong conclusion about the validity of the MR Steiger test. We provide a valid alternative which uses the observed data to investigate sensitivity to unmeasured confounding.

A sensitivity analysis aims to understand the degree to which a result can change due to uncertainties in the inputs (Saltelli, 2002). In this case for the MR Steiger test, we need to ask how sensitive is the inference of the causal direction between X and Y to possible values of unmeasured confounders influencing X and Y. Importantly, there is relative certainty in many of the parameters of this system because they are easily observed, for example, the variances of X, Y and the instrumental variables (IVs), the estimated effect of the IVs on X and Y, and therefore the IV estimate of the effect of X on Y. Often the ordinary least squares (OLS) association between X and Y is also available either due to the analysis being performed using individual level data, or by sourcing the estimate from other published results. Therefore, an appropriate sensitivity analysis must explore the extent to which the inferred causal direction between X and Y can change due to unmeasured confounding, without causing these observed parameters to change.

Lutz et al.'s proposed method does not attempt to fix all observable parameters. In the simple example provided by Lutz et al. the variance of Y varies between 28 and 39, and the OLS estimate varies between 1 and −1 across the parameter values used for the sensitivity analysis. This arises because the residual variance—which is unobserved—is fixed in their approach. Instead the phenotypic variance—which is observed—should be fixed. If they were presenting a simulation of the general performance of MR Steiger under unmeasured confounding then it would not matter that the simulated parameters are not tied to those observed in a particular empirical analysis. However in a sensitivity analysis, allowing observed parameters to vary provides no value to the analyst. To say that unmeasured confounding could reverse the causal direction, provided that the variance of Y also changes drastically, is of little use to the researcher who has a data set with an observed variance of Y. If some quantities are observed (i.e. the regression coefficient for Y on X, the variance explained in X by the instrument, the variances of X and Y, and the IV effect estimate are all observed), allowing only β uy ${\beta }_{{uy}}$ and β ux ${\beta }_{{ux}}$ to vary and compensating through changing the residual variance, the SNP-outcome R 2 ${R}^{2}$ will not change under any set of β uy ${\beta }_{{uy}}$ and β ux ${\beta }_{{ux}}$ parameters (Supporting Information Note).

Briefly expanding on Lutz et al.'s analysis, they specified that for a causal effect of β xy = 1 ${\beta }_{{xy}}=1$ , there were specific unmeasured confounding parameters of β ux = 5 ${\beta }_{{ux}}=-5$ and β uy ${\beta }_{{uy}}$ ranged only between 0 and 11. Using these parameters they suggest that the MR Steiger method has a~90% chance of returning the incorrect causal direction. But if β ux ${\beta }_{{ux}}$ and β uy ${\beta }_{{uy}}$ were permitted the same ranges of values (e.g., −11 to 11) then the Steiger method would only return the incorrect result in 36% of confounding scenarios. If the range was restricted to −1 to 1 for β ux ${\beta }_{{ux}}$ and β uy ${\beta }_{{uy}}$ each, then the wrong result would only be found in 0.02% of scenarios. In our 2017 paper (Supporting Information: note 3) we analyzed a much broader range of scenarios to comprehensively assess the degree to which unmeasured confounding could in general introduce a problem, and concluded that in most practical cases, where R xy 2 < 0.2 ${<mpadded xmlns="http://www.w3.org/1998/Math/MathML">R</mpadded>}_{{xy}}^{2}\lt 0.2$ , the chance of unmeasured confounding leading to the incorrect causal direction was very small.

If analysts were motivated to check the sensitivity of MR Steiger to unmeasured confounding, a different approach is needed where one asks what values of unmeasured confounding support the inferred causal direction for a given set of empirically observed quantities (variances of X, Y and instrument, effects of instrument on X and Y, and OLS estimate of X on Y). Analysts can then determine whether the confounding values required to cast doubt on their conclusion are plausible. Alternatively one can determine what fraction of the possible confounding parameter space supports the inferred causal direction. In the Supplementary Note we provide an analytical solution to this problem. We illustrate that at the analysis-specific level the probability that unmeasured confounding will reverse the causal direction inferred by MR Steiger only exceeds a low probability when confounders explain large fractions of the variance in X and Y. The method is included in the TwoSampleMR package and is fast because it uses a closed form calculation rather than the stochastic simulation approach implemented by UCRMS.

敏感性分析通过确定经验分析中可观察到的参数来获得相关性
2017年,我们提出了MR Steiger方法,这是一种孟德尔随机化(MR)的敏感性分析,用于推断变量之间的因果方向(Hemani et al., 2017)。我们讨论了它的许多潜在局限性,包括在某些极端情况下无法测量的混淆可能导致错误的推断因果方向。Lutz等人(2022)提出了一个R包(UCRMS),用于对MR Steiger方法进行敏感性分析,并在一个插图中使用它来表明MR Steiger方法有90%的机会由于未测量的混杂而给出错误的答案。在本笔记中,我们将表明,在他们的方法敏感性分析的错误导致错误的结论关于MR Steiger测试的有效性。我们提供了一种有效的替代方法,它使用观察到的数据来研究对未测量混杂的敏感性。敏感性分析旨在了解由于输入中的不确定性而导致结果变化的程度(Saltelli, 2002)。在MR Steiger检验的这种情况下,我们需要问X和Y之间因果方向的推断对影响X和Y的未测量混杂因素的可能值有多敏感。重要的是,该系统的许多参数具有相对确定性,因为它们很容易观察到,例如,X、Y和工具变量(IVs)的方差,IVs对X和Y的估计影响,因此,X对Y的影响的IV估计。通常,X和Y之间的普通最小二乘(OLS)关联也可以通过使用个人水平数据进行分析或通过从其他已发表的结果中获取估计而获得。因此,适当的敏感性分析必须探讨在不引起这些观测参数变化的情况下,推断出的X和Y之间的因果方向在多大程度上可能由于未测量的混杂而发生变化。Lutz等人提出的方法并不试图固定所有可观察的参数。在Lutz等人提供的简单示例中,Y的方差在28到39之间变化,用于敏感性分析的参数值的OLS估计值在1到−1之间变化。这是因为残差——未被观察到的——在他们的方法中是固定的。相反,观察到的表型差异应该是固定的。如果他们在未测量的混杂下对MR Steiger的一般性能进行模拟,那么模拟参数与在特定经验分析中观察到的参数无关紧要。然而,在敏感性分析中,允许观察到的参数变化对分析人员没有任何价值。假设Y的方差也急剧变化,那么说未测量的混杂可以逆转因果方向,对于拥有观测方差为Y的数据集的研究人员来说是没有多大用处的。如果观察到一些数量(即Y对X的回归系数、仪器在X中解释的方差、X和Y的方差以及IV效应估计都观察到),只允许β uy ${\beta}_{{uy}}$和β ux ${\beta}_{{ux}}$变化并通过改变残差方差进行补偿,snp结果r2 ${R}^{2}$在任何β y ${\beta}_{{y}}$和下都不会改变β ux ${\beta}_{{ux}}$ parameters(支持信息说明)。简要介绍Lutz等人。 在分析中,他们指出,对于β xy =1$ {\beta}_{{xy}}=1$的因果效应,β ux =−5$ {\beta}_{{ux}}=-5$和β y ${\beta有特定的未测量的混杂参数}_{{y}}$的取值范围在0到11之间。使用这些参数,他们建议MR Steiger方法有~90%的机会返回错误的因果方向。但是如果β ux ${\beta}_{{ux}}$和β uy ${\beta}_{{uy}}$被允许使用相同的值范围(例如:−11至11),那么Steiger方法只会在36%的混淆情况下返回不正确的结果。如果β ux ${\beta}_{{ux}}$和β uy ${\beta}_{{uy}}$的范围被限制为- 1到1,那么错误的结果只会出现在0.02%的场景中。在我们2017年的论文中(支持信息:注3)我们分析了更广泛的场景范围,以全面评估不可测量的混杂因素通常可能引入问题的程度,并得出结论,在大多数实际情况下,rxy 2 &lt;0.2$ {&lt; mpaddxmlns ="http://www.w3.org/1998/Math/MathML"&gt;R&lt;/mpadded&gt;}_{{xy}}^{2}\lt 0.2$,未测量的混淆导致错误因果方向的机会非常小。如果分析人员有动机检查MR Steiger对未测量混杂的敏感性,则需要采用不同的方法,询问未测量混杂的哪些值支持对给定经验观察数量(X, Y和工具的方差,工具对X和Y的影响,以及X对Y的OLS估计)的推断因果方向。然后分析人员可以确定对其结论提出怀疑所需的混杂值是否合理。或者,可以确定可能的混杂参数空间的多少部分支持推断的因果方向。在补充说明中,我们提供了这个问题的分析解决方案。我们说明,在分析特定水平上,当混杂因素解释X和y中的大部分方差时,未测量的混杂因素逆转MR Steiger推断的因果方向的概率仅超过低概率。该方法包含在TwoSampleMR包中,并且速度很快,因为它使用封闭形式计算而不是UCRMS实现的随机模拟方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genetic Epidemiology
Genetic Epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
4.40
自引率
9.50%
发文量
49
审稿时长
6-12 weeks
期刊介绍: Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations. Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信