Marquitta J. White, Brian L. Yaspan, Olivia J. Veatch, Pagé Goddard, Oona S. Risse-Adams, Maria G. Contreras
{"title":"Strategies for Pathway Analysis Using GWAS and WGS Data","authors":"Marquitta J. White, Brian L. Yaspan, Olivia J. Veatch, Pagé Goddard, Oona S. Risse-Adams, Maria G. Contreras","doi":"10.1002/cphg.79","DOIUrl":null,"url":null,"abstract":"<p>Single-allele study designs, commonly used in genome-wide association studies (GWAS) as well as the more recently developed whole genome sequencing (WGS) studies, are a standard approach for investigating the relationship of common variation within the human genome to a given phenotype of interest. However, single-allele association results published for many GWAS studies represent only the tip of the iceberg for the information that can be extracted from these datasets. The primary analysis strategy for GWAS entails association analysis in which only the single nucleotide polymorphisms (SNPs) with the strongest <i>p-values</i> are declared statistically significant due to issues arising from multiple testing and type I errors. Factors such as locus heterogeneity, epistasis, and multiple genes conferring small effects contribute to the complexity of the genetic models underlying phenotype expression. Thus, many biologically meaningful associations having lower effect sizes at individual genes are overlooked, making it difficult to separate true associations from a sea of false-positive associations. Organizing these individual SNPs into biologically meaningful groups to look at the overall effects of minor perturbations to genes and pathways is desirable. This pathway-based approach provides researchers with insight into the functional foundations of the phenotype being studied and allows testing of various genetic scenarios. © 2018 by John Wiley & Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"100 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2018-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.79","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Protocols in Human Genetics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cphg.79","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24
Abstract
Single-allele study designs, commonly used in genome-wide association studies (GWAS) as well as the more recently developed whole genome sequencing (WGS) studies, are a standard approach for investigating the relationship of common variation within the human genome to a given phenotype of interest. However, single-allele association results published for many GWAS studies represent only the tip of the iceberg for the information that can be extracted from these datasets. The primary analysis strategy for GWAS entails association analysis in which only the single nucleotide polymorphisms (SNPs) with the strongest p-values are declared statistically significant due to issues arising from multiple testing and type I errors. Factors such as locus heterogeneity, epistasis, and multiple genes conferring small effects contribute to the complexity of the genetic models underlying phenotype expression. Thus, many biologically meaningful associations having lower effect sizes at individual genes are overlooked, making it difficult to separate true associations from a sea of false-positive associations. Organizing these individual SNPs into biologically meaningful groups to look at the overall effects of minor perturbations to genes and pathways is desirable. This pathway-based approach provides researchers with insight into the functional foundations of the phenotype being studied and allows testing of various genetic scenarios. © 2018 by John Wiley & Sons, Inc.
利用GWAS和WGS数据进行通路分析的策略
单等位基因研究设计通常用于全基因组关联研究(GWAS)以及最近开发的全基因组测序(WGS)研究,是研究人类基因组内常见变异与特定表型之间关系的标准方法。然而,许多GWAS研究发表的单等位基因关联结果只代表了从这些数据集中可以提取的信息的冰山一角。GWAS的主要分析策略需要关联分析,其中只有具有最强p值的单核苷酸多态性(snp)被宣布为统计显著,这是由于多次测试和I型错误引起的问题。基因座异质性、上位性和多基因赋予的小影响等因素导致了表型表达遗传模型的复杂性。因此,许多在单个基因上具有较低效应大小的具有生物学意义的关联被忽视了,这使得很难将真正的关联从假阳性关联的海洋中分离出来。将这些单个snp组织成具有生物学意义的群体,以观察轻微扰动对基因和途径的总体影响是可取的。这种基于途径的方法为研究人员提供了对正在研究的表型的功能基础的见解,并允许对各种遗传情景进行测试。©2018 by John Wiley &儿子,Inc。
本文章由计算机程序翻译,如有差异,请以英文原文为准。