A Linear Mixed Model with Measurement Error Correction (LMM-MEC): A Method for Summary-data-based Multivariable Mendelian Randomization.

Ming Ding, Fei Zou
{"title":"A Linear Mixed Model with Measurement Error Correction (LMM-MEC): A Method for Summary-data-based Multivariable Mendelian Randomization.","authors":"Ming Ding, Fei Zou","doi":"10.1101/2023.04.25.23289099","DOIUrl":null,"url":null,"abstract":"<p><p>Summary-data-based multivariable Mendelian randomization (MVMR) methods, such as MVMR-Egger, MVMR-IVW, MVMR median-based, and MVMR-PRESSO, assess the causal effects of multiple risk factors on disease. However, accounting for variances in summary statistics related to risk factors remains a challenge. We propose a linear mixed model with measurement error correction (LMM-MEC) that accounts for the variance of summary statistics for both disease outcomes and risk factors. In step I, a linear mixed model is applied to account for the variance in disease summary statistics. Specifically, if heterogeneity is present in disease summary statistics, we treat it as a random effect and adopt an iteratively re-weighted least squares algorithm to estimate causal effects. In step II, we treat the variance in the summary statistics of risk factors as multiple measurement errors and apply a regression calibration method for simultaneous multiple measurement error correction. In a simulation study, when using independent genetic variants as instrumental variables (IV), our method showed comparable performance to existing MVMR methods under conditions of no pleiotropy or balanced pleiotropy with the outcome, and it exhibited higher coverage rates and power under directional pleiotropy. Similar findings were observed when using genetic variants with low to moderate linkage disequilibrium (LD) (0 < <i>ρ</i> <sup>2</sup> ≤ 0.3) as IVs, although coverage rates reduced for all methods compared to using independent genetic variants as IVs. In the application study, we examined causal associations between correlated cholesterol biomarkers and longevity. By including 739 genetic variants selected based on P values <5×10 <sup>-5</sup> from GWAS and allowing for low LD ( <i>ρ</i> <sup>2</sup> ≤ 0.1), our method identified that large LDL-c were causally associated with lower likelihood of achieving longevity.</p>","PeriodicalId":18659,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168515/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.04.25.23289099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Summary-data-based multivariable Mendelian randomization (MVMR) methods, such as MVMR-Egger, MVMR-IVW, MVMR median-based, and MVMR-PRESSO, assess the causal effects of multiple risk factors on disease. However, accounting for variances in summary statistics related to risk factors remains a challenge. We propose a linear mixed model with measurement error correction (LMM-MEC) that accounts for the variance of summary statistics for both disease outcomes and risk factors. In step I, a linear mixed model is applied to account for the variance in disease summary statistics. Specifically, if heterogeneity is present in disease summary statistics, we treat it as a random effect and adopt an iteratively re-weighted least squares algorithm to estimate causal effects. In step II, we treat the variance in the summary statistics of risk factors as multiple measurement errors and apply a regression calibration method for simultaneous multiple measurement error correction. In a simulation study, when using independent genetic variants as instrumental variables (IV), our method showed comparable performance to existing MVMR methods under conditions of no pleiotropy or balanced pleiotropy with the outcome, and it exhibited higher coverage rates and power under directional pleiotropy. Similar findings were observed when using genetic variants with low to moderate linkage disequilibrium (LD) (0 < ρ 2 ≤ 0.3) as IVs, although coverage rates reduced for all methods compared to using independent genetic variants as IVs. In the application study, we examined causal associations between correlated cholesterol biomarkers and longevity. By including 739 genetic variants selected based on P values <5×10 -5 from GWAS and allowing for low LD ( ρ 2 ≤ 0.1), our method identified that large LDL-c were causally associated with lower likelihood of achieving longevity.

具有测量误差校正的线性混合模型:一种基于汇总数据的多变量孟德尔随机化方法。
基于汇总数据的多变量孟德尔随机化(MVMR)方法,如MVMR- egger、MVMR- ivw、MVMR-中位数法和MVMR- presso,可评估多种危险因素对疾病的因果影响。然而,对与风险因素有关的汇总统计的差异进行解释仍然是一项挑战。我们提出了一个测量误差校正的线性混合模型(LMM-MEC),该模型考虑了疾病结局和危险因素的汇总统计差异。在步骤1中,应用线性混合模型来解释疾病汇总统计中的方差。具体来说,如果疾病汇总统计中存在异质性,我们将其视为随机效应,并采用迭代重新加权的最小二乘算法来估计因果效应。在第二步中,我们将危险因素汇总统计中的方差视为多重测量误差,并采用回归校正方法对多重测量误差进行同步校正。在一项模拟研究中,当使用独立遗传变异作为工具变量(IV)时,我们的方法在无多效性或与结果平衡多效性的情况下表现出与现有MVMR方法相当的性能,并且在定向多效性下表现出更高的覆盖率和功率。当使用低至中度连锁不平衡(LD) (0 < ρ 2≤0.3)的遗传变异作为IVs时,也观察到类似的结果,尽管与使用独立遗传变异作为IVs相比,所有方法的覆盖率都有所降低。在应用研究中,我们检查了相关胆固醇生物标志物与寿命之间的因果关系。通过纳入基于GWAS的P值-5选择的739个遗传变异,并允许低LD (ρ 2≤0.1),我们的方法确定了高LDL-c与较低的长寿可能性存在因果关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信