A Linear Mixed Model with Measurement Error Correction (LMM-MEC): A Method for Summary-data-based Multivariable Mendelian Randomization.

medRxiv : the preprint server for health sciences Pub Date : 2025-01-19 DOI:10.1101/2023.04.25.23289099

Ming Ding, Fei Zou

{"title":"A Linear Mixed Model with Measurement Error Correction (LMM-MEC): A Method for Summary-data-based Multivariable Mendelian Randomization.","authors":"Ming Ding, Fei Zou","doi":"10.1101/2023.04.25.23289099","DOIUrl":null,"url":null,"abstract":"Summary-data-based multivariable Mendelian randomization (MVMR) methods, such as MVMR-Egger, MVMR-IVW, MVMR median-based, and MVMR-PRESSO, assess the causal effects of multiple risk factors on disease. However, accounting for variances in summary statistics related to risk factors remains a challenge. We propose a linear mixed model with measurement error correction (LMM-MEC) that accounts for the variance of summary statistics for both disease outcomes and risk factors. In step I, a linear mixed model is applied to account for the variance in disease summary statistics. Specifically, if heterogeneity is present in disease summary statistics, we treat it as a random effect and adopt an iteratively re-weighted least squares algorithm to estimate causal effects. In step II, we treat the variance in the summary statistics of risk factors as multiple measurement errors and apply a regression calibration method for simultaneous multiple measurement error correction. In a simulation study, when using independent genetic variants as instrumental variables (IV), our method showed comparable performance to existing MVMR methods under conditions of no pleiotropy or balanced pleiotropy with the outcome, and it exhibited higher coverage rates and power under directional pleiotropy. Similar findings were observed when using genetic variants with low to moderate linkage disequilibrium (LD) (0 < ρ 2 ≤ 0.3) as IVs, although coverage rates reduced for all methods compared to using independent genetic variants as IVs. In the application study, we examined causal associations between correlated cholesterol biomarkers and longevity. By including 739 genetic variants selected based on P values <5×10 -5 from GWAS and allowing for low LD ( ρ 2 ≤ 0.1), our method identified that large LDL-c were causally associated with lower likelihood of achieving longevity.","PeriodicalId":18659,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168515/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.04.25.23289099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Summary-data-based multivariable Mendelian randomization (MVMR) methods, such as MVMR-Egger, MVMR-IVW, MVMR median-based, and MVMR-PRESSO, assess the causal effects of multiple risk factors on disease. However, accounting for variances in summary statistics related to risk factors remains a challenge. We propose a linear mixed model with measurement error correction (LMM-MEC) that accounts for the variance of summary statistics for both disease outcomes and risk factors. In step I, a linear mixed model is applied to account for the variance in disease summary statistics. Specifically, if heterogeneity is present in disease summary statistics, we treat it as a random effect and adopt an iteratively re-weighted least squares algorithm to estimate causal effects. In step II, we treat the variance in the summary statistics of risk factors as multiple measurement errors and apply a regression calibration method for simultaneous multiple measurement error correction. In a simulation study, when using independent genetic variants as instrumental variables (IV), our method showed comparable performance to existing MVMR methods under conditions of no pleiotropy or balanced pleiotropy with the outcome, and it exhibited higher coverage rates and power under directional pleiotropy. Similar findings were observed when using genetic variants with low to moderate linkage disequilibrium (LD) (0 < ρ ² ≤ 0.3) as IVs, although coverage rates reduced for all methods compared to using independent genetic variants as IVs. In the application study, we examined causal associations between correlated cholesterol biomarkers and longevity. By including 739 genetic variants selected based on P values <5×10 ^-5 from GWAS and allowing for low LD ( ρ ² ≤ 0.1), our method identified that large LDL-c were causally associated with lower likelihood of achieving longevity.

查看原文本刊更多论文

具有测量误差校正的线性混合模型：一种基于汇总数据的多变量孟德尔随机化方法。

基于汇总数据的多变量孟德尔随机化（MVMR）方法，如MVMR- egger、MVMR- ivw、MVMR-中位数法和MVMR- presso，可评估多种危险因素对疾病的因果影响。然而，对与风险因素有关的汇总统计的差异进行解释仍然是一项挑战。我们提出了一个测量误差校正的线性混合模型（LMM-MEC），该模型考虑了疾病结局和危险因素的汇总统计差异。在步骤1中，应用线性混合模型来解释疾病汇总统计中的方差。具体来说，如果疾病汇总统计中存在异质性，我们将其视为随机效应，并采用迭代重新加权的最小二乘算法来估计因果效应。在第二步中，我们将危险因素汇总统计中的方差视为多重测量误差，并采用回归校正方法对多重测量误差进行同步校正。在一项模拟研究中，当使用独立遗传变异作为工具变量（IV）时，我们的方法在无多效性或与结果平衡多效性的情况下表现出与现有MVMR方法相当的性能，并且在定向多效性下表现出更高的覆盖率和功率。当使用低至中度连锁不平衡（LD）（0 < ρ 2≤0.3）的遗传变异作为IVs时，也观察到类似的结果，尽管与使用独立遗传变异作为IVs相比，所有方法的覆盖率都有所降低。在应用研究中，我们检查了相关胆固醇生物标志物与寿命之间的因果关系。通过纳入基于GWAS的P值-5选择的739个遗传变异，并允许低LD (ρ 2≤0.1)，我们的方法确定了高LDL-c与较低的长寿可能性存在因果关系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

medRxiv : the preprint server for health sciences

自引率

0.00%

发文量