Efficient identification of trait-associated loss-of-function variants in the UK Biobank cohort by exome-sequencing based genotype imputation

IF 1.7 4区 医学 Q3 GENETICS & HEREDITY
Wen-Yuan Yu, Shan-Shan Yan, Shu-Han Zhang, Jing-Jing Ni,  Bin-Li, Yu-Fang Pei, Lei Zhang
{"title":"Efficient identification of trait-associated loss-of-function variants in the UK Biobank cohort by exome-sequencing based genotype imputation","authors":"Wen-Yuan Yu,&nbsp;Shan-Shan Yan,&nbsp;Shu-Han Zhang,&nbsp;Jing-Jing Ni,&nbsp; Bin-Li,&nbsp;Yu-Fang Pei,&nbsp;Lei Zhang","doi":"10.1002/gepi.22511","DOIUrl":null,"url":null,"abstract":"<p>The large-scale open access whole-exome sequencing (WES) data of the UK Biobank ~200,000 participants is accelerating a new wave of genetic association studies aiming to identify rare and functional loss-of-function (LoF) variants associated with complex traits and diseases. We proposed to merge the WES genotypes and the genome-wide genotyping (GWAS) genotypes of 167,000 UKB homogeneous European participants into a combined reference panel, and then to impute 241,911 UKB homogeneous European participants who had the GWAS genotypes only. We then used the imputed data to replicate association identified in the discovery WES sample. The average imputation accuracy measure <i>r</i><sup>2</sup> is modest to high for LoF variants at all minor allele frequency intervals: 0.942 at MAF interval (0.01, 0.5), 0.807 at (1.0 × 10<sup>−3</sup>, 0.01), 0.805 at (1.0 × 10<sup>−4</sup>, 1.0 × 10<sup>−3</sup>), 0.664 at (1.0 × 10<sup>−5</sup>, 1.0 × 10<sup>−4</sup>) and 0.410 at (0, 1.0 × 10<sup>−5</sup>). As applications, we studied associations of LoF variants with estimated heel BMD and four lipid traits. In addition to replicating dozens of previously reported genes, we also identified three novel associations, two genes <i>PLIN1</i> and <i>ANGPTL3</i> for high-density-lipoprotein cholesterol and one gene <i>PDE3B</i> for triglycerides. Our results highlighted the strength of WES based genotype imputation as well as provided useful imputed data within the UKB cohort.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 2","pages":"121-134"},"PeriodicalIF":1.7000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22511","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 2

Abstract

The large-scale open access whole-exome sequencing (WES) data of the UK Biobank ~200,000 participants is accelerating a new wave of genetic association studies aiming to identify rare and functional loss-of-function (LoF) variants associated with complex traits and diseases. We proposed to merge the WES genotypes and the genome-wide genotyping (GWAS) genotypes of 167,000 UKB homogeneous European participants into a combined reference panel, and then to impute 241,911 UKB homogeneous European participants who had the GWAS genotypes only. We then used the imputed data to replicate association identified in the discovery WES sample. The average imputation accuracy measure r2 is modest to high for LoF variants at all minor allele frequency intervals: 0.942 at MAF interval (0.01, 0.5), 0.807 at (1.0 × 10−3, 0.01), 0.805 at (1.0 × 10−4, 1.0 × 10−3), 0.664 at (1.0 × 10−5, 1.0 × 10−4) and 0.410 at (0, 1.0 × 10−5). As applications, we studied associations of LoF variants with estimated heel BMD and four lipid traits. In addition to replicating dozens of previously reported genes, we also identified three novel associations, two genes PLIN1 and ANGPTL3 for high-density-lipoprotein cholesterol and one gene PDE3B for triglycerides. Our results highlighted the strength of WES based genotype imputation as well as provided useful imputed data within the UKB cohort.

通过基于外显子组测序的基因型插补,有效识别英国生物银行队列中与性状相关的功能丧失变异
英国生物银行(UK Biobank)约20万参与者的大规模开放获取全外显子组测序(WES)数据正在加速新一轮的遗传关联研究,旨在识别与复杂性状和疾病相关的罕见和功能性功能丧失(LoF)变异。我们建议将167,000名UKB同质欧洲参与者的WES基因型和GWAS基因型合并为一个联合参考面板,然后推算出241,911名只有GWAS基因型的UKB同质欧洲参与者。然后,我们使用输入的数据来复制发现WES样本中确定的关联。在所有次要等位基因频率区间,LoF变异的平均输入精度测量r2从中等到高:在MAF区间(0.01,0.5)0.942,在(1.0 × 10−3,0.01)0.807,在(1.0 × 10−4,1.0 × 10−3)0.805,在(1.0 × 10−5,1.0 × 10−4)0.664,在(0,1.0 × 10−5)0.410。作为应用,我们研究了LoF变异与估计的足跟骨密度和四种脂质性状的关系。除了复制数十个先前报道的基因外,我们还发现了三个新的关联,两个基因PLIN1和ANGPTL3与高密度脂蛋白胆固醇有关,一个基因PDE3B与甘油三酯有关。我们的结果强调了基于WES的基因型估算的强度,并在UKB队列中提供了有用的估算数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genetic Epidemiology
Genetic Epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
4.40
自引率
9.50%
发文量
49
审稿时长
6-12 weeks
期刊介绍: Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations. Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信