Genealogy-based trait association with LOCATER boosts power at loci with allelic heterogeneity.

IF 5.5 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Xinxin Wang, Ryan Christ, Erica Young, Chul Joo Kang, Indraniel Das, Edward A Belter, Markku Laakso, Louis J M Aslett, David Steinsaltz, Nathan O Stitziel, Ira M Hall
{"title":"Genealogy-based trait association with LOCATER boosts power at loci with allelic heterogeneity.","authors":"Xinxin Wang, Ryan Christ, Erica Young, Chul Joo Kang, Indraniel Das, Edward A Belter, Markku Laakso, Louis J M Aslett, David Steinsaltz, Nathan O Stitziel, Ira M Hall","doi":"10.1101/gr.280372.124","DOIUrl":null,"url":null,"abstract":"<p><p>A key methodological challenge for genome-wide association studies is how to leverage haplotype diversity and allelic heterogeneity to improve trait association power, especially in noncoding regions where it is difficult to predict variant impacts and define functional units for variant aggregation. Genealogy-based association methods have the potential to bridge this gap by testing combinations of common and rare haplotypes based purely on their ancestral relationships. In parallel work, we have developed an efficient local ancestry inference engine and a novel statistical method (LOCATER) for combining signals present on different branches of a locus-specific haplotype tree. Here, we develop a genome-wide LOCATER analysis pipeline and apply it to a genome sequencing study of 6795 Finnish individuals with 101 cardiometabolic traits and 18.9 million autosomal variants. We identify 351 significant trait associations at 47 distinct genomic loci and find that LOCATER boosts the single marker test (SMT) association signal at five loci by combining independent signals from distinct alleles. LOCATER successfully recovers known quantitative trait loci not found by SMT, including <i>LIPG</i>, recovers known allelic heterogeneity at the <i>APOE/C1/C4/C2</i> gene cluster, and suggests one novel association. We find that confounders have a more pronounced effect on genealogy-based methods than SMT, and we propose a new randomization approach and a general method for genomic control to eliminate their effects. This study demonstrates that genealogy-based methods such as LOCATER excel when multiple causal variants are present and suggests that their application to larger and more diverse cohorts will be fruitful.</p>","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"976-991"},"PeriodicalIF":5.5000,"publicationDate":"2026-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1101/gr.280372.124","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

A key methodological challenge for genome-wide association studies is how to leverage haplotype diversity and allelic heterogeneity to improve trait association power, especially in noncoding regions where it is difficult to predict variant impacts and define functional units for variant aggregation. Genealogy-based association methods have the potential to bridge this gap by testing combinations of common and rare haplotypes based purely on their ancestral relationships. In parallel work, we have developed an efficient local ancestry inference engine and a novel statistical method (LOCATER) for combining signals present on different branches of a locus-specific haplotype tree. Here, we develop a genome-wide LOCATER analysis pipeline and apply it to a genome sequencing study of 6795 Finnish individuals with 101 cardiometabolic traits and 18.9 million autosomal variants. We identify 351 significant trait associations at 47 distinct genomic loci and find that LOCATER boosts the single marker test (SMT) association signal at five loci by combining independent signals from distinct alleles. LOCATER successfully recovers known quantitative trait loci not found by SMT, including LIPG, recovers known allelic heterogeneity at the APOE/C1/C4/C2 gene cluster, and suggests one novel association. We find that confounders have a more pronounced effect on genealogy-based methods than SMT, and we propose a new randomization approach and a general method for genomic control to eliminate their effects. This study demonstrates that genealogy-based methods such as LOCATER excel when multiple causal variants are present and suggests that their application to larger and more diverse cohorts will be fruitful.

基于家谱的LOCATER性状关联提高了等位基因异质性位点的能力。
全基因组关联研究的一个关键方法挑战是如何利用单倍型多样性和等位基因异质性来提高性状关联能力,特别是在难以预测变异影响和定义变异聚集功能单元的非编码区域。基于家谱的关联方法有可能通过纯粹基于祖先关系测试常见和罕见单倍型的组合来弥合这一差距。在并行工作中,我们开发了一种高效的局部祖先推断引擎和一种新的统计方法(LOCATER),用于组合存在于位点特异性单倍型树的不同分支上的信号。在这里,我们开发了一个全基因组LOCATER分析管道,并将其应用于6795名芬兰个体的基因组测序研究,这些个体具有101个心脏代谢特征和1890万个常染色体变异。我们在47个不同的基因组位点上发现了351个显著的性状关联,并发现LOCATER通过组合来自不同等位基因的独立信号,增强了5个位点上的单标记测试(SMT)关联信号。LOCATER成功恢复了SMT未发现的已知数量性状位点,包括LIPG,恢复了APOE/C1/C4/C2基因簇上已知的等位基因异质性,并提出了一种新的关联。我们发现混杂因素对基于家谱的方法的影响比SMT更明显,我们提出了一种新的随机化方法和一种通用的基因组控制方法来消除它们的影响。这项研究表明,当存在多个因果变量时,基于家谱的方法(如LOCATER)表现出色,并表明将其应用于更大、更多样化的队列将是富有成效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genome research
Genome research 生物-生化与分子生物学
CiteScore
12.40
自引率
1.40%
发文量
140
审稿时长
6 months
期刊介绍: Launched in 1995, Genome Research is an international, continuously published, peer-reviewed journal that focuses on research that provides novel insights into the genome biology of all organisms, including advances in genomic medicine. Among the topics considered by the journal are genome structure and function, comparative genomics, molecular evolution, genome-scale quantitative and population genetics, proteomics, epigenomics, and systems biology. The journal also features exciting gene discoveries and reports of cutting-edge computational biology and high-throughput methodologies. New data in these areas are published as research papers, or methods and resource reports that provide novel information on technologies or tools that will be of interest to a broad readership. Complete data sets are presented electronically on the journal''s web site where appropriate. The journal also provides Reviews, Perspectives, and Insight/Outlook articles, which present commentary on the latest advances published both here and elsewhere, placing such progress in its broader biological context.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书