Bayesian mixed models for longitudinal genetic data: theory, concepts, and simulation studies

Wonil Chung, Youngkwan Cho
{"title":"Bayesian mixed models for longitudinal genetic data: theory, concepts, and simulation studies","authors":"Wonil Chung, Youngkwan Cho","doi":"10.5808/gi.21080","DOIUrl":null,"url":null,"abstract":"Despite the success of recent genome-wide association studies investigating longitudinal traits, a large fraction of overall heritability remains unexplained. This suggests that some of the missing heritability may be accounted for by gene-gene and gene-time/environment interactions. In this paper, we develop a Bayesian variable selection method for longitudinal genetic data based on mixed models. The method jointly models the main effects and interactions of all candidate genetic variants and non-genetic factors and has higher statistical power than previous approaches. To account for the within-subject dependence structure, we propose a grid-based approach that models only one fixed-dimensional covariance matrix, which is thus applicable to data where subjects have different numbers of time points. We provide the theoretical basis of our Bayesian method and then illustrate its performance using data from the 1000 Genome Project with various simulation settings. Several simulation studies show that our multivariate method increases the statistical power compared to the corresponding univariate method and can detect gene-time/environment interactions well. We further evaluate our method with different numbers of individuals, variants, and causal variants, as well as different trait-heritability, and conclude that our method performs reasonably well with various simulation settings.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics & informatics","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.5808/gi.21080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Despite the success of recent genome-wide association studies investigating longitudinal traits, a large fraction of overall heritability remains unexplained. This suggests that some of the missing heritability may be accounted for by gene-gene and gene-time/environment interactions. In this paper, we develop a Bayesian variable selection method for longitudinal genetic data based on mixed models. The method jointly models the main effects and interactions of all candidate genetic variants and non-genetic factors and has higher statistical power than previous approaches. To account for the within-subject dependence structure, we propose a grid-based approach that models only one fixed-dimensional covariance matrix, which is thus applicable to data where subjects have different numbers of time points. We provide the theoretical basis of our Bayesian method and then illustrate its performance using data from the 1000 Genome Project with various simulation settings. Several simulation studies show that our multivariate method increases the statistical power compared to the corresponding univariate method and can detect gene-time/environment interactions well. We further evaluate our method with different numbers of individuals, variants, and causal variants, as well as different trait-heritability, and conclude that our method performs reasonably well with various simulation settings.
纵向遗传数据的贝叶斯混合模型:理论、概念和模拟研究
尽管最近研究纵向性状的全基因组关联研究取得了成功,但总体遗传性的很大一部分仍未得到解释。这表明一些缺失的遗传性可能是由基因-基因和基因-时间/环境相互作用造成的。本文提出了一种基于混合模型的纵向遗传数据贝叶斯变量选择方法。该方法联合建模了所有候选遗传变异和非遗传因素的主要影响和相互作用,具有比以往方法更高的统计能力。为了考虑受试者内部的依赖结构,我们提出了一种基于网格的方法,该方法仅对一个固定维协方差矩阵进行建模,因此适用于受试者具有不同数量时间点的数据。我们提供了我们的贝叶斯方法的理论基础,然后用不同模拟设置的1000基因组计划的数据来说明它的性能。多项模拟研究表明,与单变量方法相比,我们的多元方法提高了统计能力,可以很好地检测基因-时间/环境相互作用。我们进一步用不同数量的个体、变量和因果变量以及不同的性状遗传性来评估我们的方法,并得出结论,我们的方法在各种模拟设置下都表现得相当好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信