Trait selection strategy in multi-trait GWAS: Boosting SNP discoverability.

IF 3.3 Q2 GENETICS & HEREDITY

HGG Advances Pub Date : 2024-07-18 Epub Date: 2024-06-13 DOI:10.1016/j.xhgg.2024.100319

Yuka Suzuki, Hervé Ménager, Bryan Brancotte, Raphaël Vernet, Cyril Nerin, Christophe Boetto, Antoine Auvergne, Christophe Linhard, Rachel Torchet, Pierre Lechat, Lucie Troubat, Michael H Cho, Emmanuelle Bouzigon, Hugues Aschard, Hanna Julienne

{"title":"Trait selection strategy in multi-trait GWAS: Boosting SNP discoverability.","authors":"Yuka Suzuki, Hervé Ménager, Bryan Brancotte, Raphaël Vernet, Cyril Nerin, Christophe Boetto, Antoine Auvergne, Christophe Linhard, Rachel Torchet, Pierre Lechat, Lucie Troubat, Michael H Cho, Emmanuelle Bouzigon, Hugues Aschard, Hanna Julienne","doi":"10.1016/j.xhgg.2024.100319","DOIUrl":null,"url":null,"abstract":"Since the first genome-wide association studies (GWASs), thousands of variant-trait associations have been discovered. However, comprehensively mapping the genetic determinant of complex traits through univariate testing can require prohibitive sample sizes. Multi-trait GWAS can circumvent this issue and improve statistical power by leveraging the joint genetic architecture of human phenotypes. Although many methodological hurdles of multi-trait testing have been solved, the strategy to select traits has been overlooked. In this study, we conducted multi-trait GWAS on approximately 20,000 combinations of 72 traits using an omnibus test as implemented in the Joint Analysis of Summary Statistics. We assessed which genetic features of the sets of traits analyzed were associated with an increased detection of variants compared with univariate screening. Several features of the set of traits, including the heritability, the number of traits, and the genetic correlation, drive the multi-trait test gain. Using these features jointly in predictive models captures a large fraction of the power gain of the multi-trait test (Pearson's r between the observed and predicted gain equals 0.43, p < 1.6 × 10-60). Applying an alternative multi-trait approach (Multi-Trait Analysis of GWAS), we identified similar features of interest, but with an overall 70% lower number of new associations. Finally, selecting sets based on our data-driven models systematically outperformed the common strategy of selecting clinically similar traits. This work provides a unique picture of the determinant of multi-trait GWAS statistical power and outlines practical strategies for multi-trait testing.","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100319"},"PeriodicalIF":3.3000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11260573/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HGG Advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xhgg.2024.100319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/13 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

Abstract

Since the first genome-wide association studies (GWASs), thousands of variant-trait associations have been discovered. However, comprehensively mapping the genetic determinant of complex traits through univariate testing can require prohibitive sample sizes. Multi-trait GWAS can circumvent this issue and improve statistical power by leveraging the joint genetic architecture of human phenotypes. Although many methodological hurdles of multi-trait testing have been solved, the strategy to select traits has been overlooked. In this study, we conducted multi-trait GWAS on approximately 20,000 combinations of 72 traits using an omnibus test as implemented in the Joint Analysis of Summary Statistics. We assessed which genetic features of the sets of traits analyzed were associated with an increased detection of variants compared with univariate screening. Several features of the set of traits, including the heritability, the number of traits, and the genetic correlation, drive the multi-trait test gain. Using these features jointly in predictive models captures a large fraction of the power gain of the multi-trait test (Pearson's r between the observed and predicted gain equals 0.43, p < 1.6 × 10^-60). Applying an alternative multi-trait approach (Multi-Trait Analysis of GWAS), we identified similar features of interest, but with an overall 70% lower number of new associations. Finally, selecting sets based on our data-driven models systematically outperformed the common strategy of selecting clinically similar traits. This work provides a unique picture of the determinant of multi-trait GWAS statistical power and outlines practical strategies for multi-trait testing.

查看原文本刊更多论文

多性状 GWAS 中的性状选择策略：提高 SNPs 的可发现性。

自首次开展全基因组关联研究（GWAS）以来，已经发现了数以千计的变异与性状的关联。然而，通过单变量测试来全面绘制复杂性状的遗传决定因素可能需要过大的样本量。多性状 GWAS 可以规避这一问题，并通过利用人类表型的联合遗传结构来提高统计能力。虽然多性状测试的许多方法障碍已经解决，但性状选择策略却一直被忽视。在本研究中，我们使用 JASS（汇总统计联合分析）中实施的综合测试，对 72 个性状的约 20,000 个组合进行了多性状 GWAS 分析。与单变量筛选相比，我们评估了所分析性状集的哪些遗传特征与变异检测的增加有关。性状集的几个特征，包括遗传率、性状数量和遗传相关性，都会影响多性状检验的收益。在预测模型中联合使用这些特征可以捕捉到多性状检验增益的很大一部分（观察增益和预测增益之间的皮尔逊γ值等于 0.43，P < 1.6 x 10-60）。采用另一种多性状方法（MTAG），我们发现了类似的兴趣特征，但新关联的总体数量减少了 70%。最后，根据我们的数据驱动模型选择数据集的效果明显优于选择临床相似性状的常见策略。这项工作为多性状 GWAS 统计能力的决定因素提供了一个独特的图景，并概述了多性状测试的实用策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊