Gibran Hemani, Apostolos Gkatzionis, Kate Tilling, George Davey Smith
{"title":"Sensitivity analyses gain relevance by fixing parameters observable during the empirical analyses","authors":"Gibran Hemani, Apostolos Gkatzionis, Kate Tilling, George Davey Smith","doi":"10.1002/gepi.22530","DOIUrl":null,"url":null,"abstract":"<p>In 2017 we presented the MR Steiger method, a sensitivity analysis in Mendelian randomization (MR) for inferring causal directions between variables (Hemani et al., <span>2017</span>). We discussed many of its potential limitations including that unmeasured confounding under certain extreme circumstances could lead to the wrong inferred causal direction. Lutz et al. (<span>2022</span>) propose an R package (UCRMS) for performing sensitivity analysis of the MR Steiger method, and use it in an illustration to suggest that the MR Steiger method has a ~90% chance of giving the wrong answer due to unmeasured confounding. In this note we will show that an error in their approach to sensitivity analysis leads to the wrong conclusion about the validity of the MR Steiger test. We provide a valid alternative which uses the observed data to investigate sensitivity to unmeasured confounding.</p><p>A sensitivity analysis aims to understand the degree to which a result can change due to uncertainties in the inputs (Saltelli, <span>2002</span>). In this case for the MR Steiger test, we need to ask how sensitive is the inference of the causal direction between X and Y to possible values of unmeasured confounders influencing X and Y. Importantly, there is relative certainty in many of the parameters of this system because they are easily observed, for example, the variances of X, Y and the instrumental variables (IVs), the estimated effect of the IVs on X and Y, and therefore the IV estimate of the effect of X on Y. Often the ordinary least squares (OLS) association between X and Y is also available either due to the analysis being performed using individual level data, or by sourcing the estimate from other published results. Therefore, an appropriate sensitivity analysis must explore the extent to which the inferred causal direction between X and Y can change due to unmeasured confounding, without causing these observed parameters to change.</p><p>Lutz et al.'s proposed method does not attempt to fix all observable parameters. In the simple example provided by Lutz et al. the variance of Y varies between 28 and 39, and the OLS estimate varies between 1 and −1 across the parameter values used for the sensitivity analysis. This arises because the residual variance—which is unobserved—is fixed in their approach. Instead the phenotypic variance—which is observed—should be fixed. If they were presenting a simulation of the general performance of MR Steiger under unmeasured confounding then it would not matter that the simulated parameters are not tied to those observed in a particular empirical analysis. However in a sensitivity analysis, allowing observed parameters to vary provides no value to the analyst. To say that unmeasured confounding could reverse the causal direction, provided that the variance of Y also changes drastically, is of little use to the researcher who has a data set with an observed variance of Y. If some quantities are observed (i.e. the regression coefficient for Y on X, the variance explained in X by the instrument, the variances of X and Y, and the IV effect estimate are all observed), allowing only <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> to vary and compensating through changing the residual variance, the SNP-outcome <math>\n <semantics>\n <mrow>\n <msup>\n <mi>R</mi>\n <mn>2</mn>\n </msup>\n </mrow>\n <annotation> ${R}^{2}$</annotation>\n </semantics></math> will not change under any set of <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> parameters (Supporting Information Note).</p><p>Briefly expanding on Lutz et al.'s analysis, they specified that for a causal effect of <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>xy</mi>\n </msub>\n <mo>=</mo>\n <mn>1</mn>\n </mrow>\n <annotation> ${\\beta }_{{xy}}=1$</annotation>\n </semantics></math>, there were specific unmeasured confounding parameters of <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n <mo>=</mo>\n <mo>−</mo>\n <mn>5</mn>\n </mrow>\n <annotation> ${\\beta }_{{ux}}=-5$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> ranged only between 0 and 11. Using these parameters they suggest that the MR Steiger method has a~90% chance of returning the incorrect causal direction. But if <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> were permitted the same ranges of values (e.g., −11 to 11) then the Steiger method would only return the incorrect result in 36% of confounding scenarios. If the range was restricted to −1 to 1 for <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>ux</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{ux}}$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <msub>\n <mi>β</mi>\n <mi>uy</mi>\n </msub>\n </mrow>\n <annotation> ${\\beta }_{{uy}}$</annotation>\n </semantics></math> each, then the wrong result would only be found in 0.02% of scenarios. In our 2017 paper (Supporting Information: note 3) we analyzed a much broader range of scenarios to comprehensively assess the degree to which unmeasured confounding could in general introduce a problem, and concluded that in most practical cases, where <math>\n <semantics>\n <mrow>\n <msubsup>\n <mpadded>\n <mi>R</mi>\n </mpadded>\n <mi>xy</mi>\n <mn>2</mn>\n </msubsup>\n <mo><</mo>\n <mn>0.2</mn>\n </mrow>\n <annotation> ${<mpadded xmlns=\"http://www.w3.org/1998/Math/MathML\">R</mpadded>}_{{xy}}^{2}\\lt 0.2$</annotation>\n </semantics></math>, the chance of unmeasured confounding leading to the incorrect causal direction was very small.</p><p>If analysts were motivated to check the sensitivity of MR Steiger to unmeasured confounding, a different approach is needed where one asks what values of unmeasured confounding support the inferred causal direction for a given set of empirically observed quantities (variances of X, Y and instrument, effects of instrument on X and Y, and OLS estimate of X on Y). Analysts can then determine whether the confounding values required to cast doubt on their conclusion are plausible. Alternatively one can determine what fraction of the possible confounding parameter space supports the inferred causal direction. In the Supplementary Note we provide an analytical solution to this problem. We illustrate that at the analysis-specific level the probability that unmeasured confounding will reverse the causal direction inferred by MR Steiger only exceeds a low probability when confounders explain large fractions of the variance in X and Y. The method is included in the TwoSampleMR package and is fast because it uses a closed form calculation rather than the stochastic simulation approach implemented by UCRMS.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 6","pages":"461-462"},"PeriodicalIF":1.7000,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22530","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22530","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
In 2017 we presented the MR Steiger method, a sensitivity analysis in Mendelian randomization (MR) for inferring causal directions between variables (Hemani et al., 2017). We discussed many of its potential limitations including that unmeasured confounding under certain extreme circumstances could lead to the wrong inferred causal direction. Lutz et al. (2022) propose an R package (UCRMS) for performing sensitivity analysis of the MR Steiger method, and use it in an illustration to suggest that the MR Steiger method has a ~90% chance of giving the wrong answer due to unmeasured confounding. In this note we will show that an error in their approach to sensitivity analysis leads to the wrong conclusion about the validity of the MR Steiger test. We provide a valid alternative which uses the observed data to investigate sensitivity to unmeasured confounding.
A sensitivity analysis aims to understand the degree to which a result can change due to uncertainties in the inputs (Saltelli, 2002). In this case for the MR Steiger test, we need to ask how sensitive is the inference of the causal direction between X and Y to possible values of unmeasured confounders influencing X and Y. Importantly, there is relative certainty in many of the parameters of this system because they are easily observed, for example, the variances of X, Y and the instrumental variables (IVs), the estimated effect of the IVs on X and Y, and therefore the IV estimate of the effect of X on Y. Often the ordinary least squares (OLS) association between X and Y is also available either due to the analysis being performed using individual level data, or by sourcing the estimate from other published results. Therefore, an appropriate sensitivity analysis must explore the extent to which the inferred causal direction between X and Y can change due to unmeasured confounding, without causing these observed parameters to change.
Lutz et al.'s proposed method does not attempt to fix all observable parameters. In the simple example provided by Lutz et al. the variance of Y varies between 28 and 39, and the OLS estimate varies between 1 and −1 across the parameter values used for the sensitivity analysis. This arises because the residual variance—which is unobserved—is fixed in their approach. Instead the phenotypic variance—which is observed—should be fixed. If they were presenting a simulation of the general performance of MR Steiger under unmeasured confounding then it would not matter that the simulated parameters are not tied to those observed in a particular empirical analysis. However in a sensitivity analysis, allowing observed parameters to vary provides no value to the analyst. To say that unmeasured confounding could reverse the causal direction, provided that the variance of Y also changes drastically, is of little use to the researcher who has a data set with an observed variance of Y. If some quantities are observed (i.e. the regression coefficient for Y on X, the variance explained in X by the instrument, the variances of X and Y, and the IV effect estimate are all observed), allowing only and to vary and compensating through changing the residual variance, the SNP-outcome will not change under any set of and parameters (Supporting Information Note).
Briefly expanding on Lutz et al.'s analysis, they specified that for a causal effect of , there were specific unmeasured confounding parameters of and ranged only between 0 and 11. Using these parameters they suggest that the MR Steiger method has a~90% chance of returning the incorrect causal direction. But if and were permitted the same ranges of values (e.g., −11 to 11) then the Steiger method would only return the incorrect result in 36% of confounding scenarios. If the range was restricted to −1 to 1 for and each, then the wrong result would only be found in 0.02% of scenarios. In our 2017 paper (Supporting Information: note 3) we analyzed a much broader range of scenarios to comprehensively assess the degree to which unmeasured confounding could in general introduce a problem, and concluded that in most practical cases, where , the chance of unmeasured confounding leading to the incorrect causal direction was very small.
If analysts were motivated to check the sensitivity of MR Steiger to unmeasured confounding, a different approach is needed where one asks what values of unmeasured confounding support the inferred causal direction for a given set of empirically observed quantities (variances of X, Y and instrument, effects of instrument on X and Y, and OLS estimate of X on Y). Analysts can then determine whether the confounding values required to cast doubt on their conclusion are plausible. Alternatively one can determine what fraction of the possible confounding parameter space supports the inferred causal direction. In the Supplementary Note we provide an analytical solution to this problem. We illustrate that at the analysis-specific level the probability that unmeasured confounding will reverse the causal direction inferred by MR Steiger only exceeds a low probability when confounders explain large fractions of the variance in X and Y. The method is included in the TwoSampleMR package and is fast because it uses a closed form calculation rather than the stochastic simulation approach implemented by UCRMS.
期刊介绍:
Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations.
Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.