Whole genome-wide association study using affymetrix SNP chip: a two-stage sequential selection method to identify genes that increase the risk of developing complex diseases.

Howard H Yang, Nan Hu, Philip R Taylor, Maxwell P Lee
{"title":"Whole genome-wide association study using affymetrix SNP chip: a two-stage sequential selection method to identify genes that increase the risk of developing complex diseases.","authors":"Howard H Yang, Nan Hu, Philip R Taylor, Maxwell P Lee","doi":"10.1007/978-1-60327-148-6_2","DOIUrl":null,"url":null,"abstract":"<p><p>Whole-genome association studies of complex diseases hold great promise to identify systematically genetic loci that influence one's risk of developing these diseases. However, the polygenic nature of the complex diseases and genetic interactions among the genes pose significant challenge in both experimental design and data analysis. High-density genotype data make it possible to identify most of the genetic loci that may be involved in the etiology. On the other hand, utilizing large number of statistic tests could lead to false positives if the tests are not adequately adjusted. In this paper, we discuss a two-stage method that sequentially applies a generalized linear model (GLM) and principal components analysis (PCA) to identify genetic loci that jointly determine the likelihood of developing disease. The method was applied to a pilot case-control study of esophageal squamous cell carcinoma (ESCC) that included 50 ESCC patients and 50 neighborhood-matched controls. Genotype data were determined by using the Affymetrix 10K SNP chip. We will discuss some of the special considerations that are important to the proper interpretation of whole genome-wide association studies, which include multiple comparisons, epistatic interaction among multiple genetic loci, and generalization of predictive models.</p>","PeriodicalId":18460,"journal":{"name":"Methods in molecular medicine","volume":"141 ","pages":"23-35"},"PeriodicalIF":0.0000,"publicationDate":"2008-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/978-1-60327-148-6_2","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods in molecular medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-1-60327-148-6_2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

Abstract

Whole-genome association studies of complex diseases hold great promise to identify systematically genetic loci that influence one's risk of developing these diseases. However, the polygenic nature of the complex diseases and genetic interactions among the genes pose significant challenge in both experimental design and data analysis. High-density genotype data make it possible to identify most of the genetic loci that may be involved in the etiology. On the other hand, utilizing large number of statistic tests could lead to false positives if the tests are not adequately adjusted. In this paper, we discuss a two-stage method that sequentially applies a generalized linear model (GLM) and principal components analysis (PCA) to identify genetic loci that jointly determine the likelihood of developing disease. The method was applied to a pilot case-control study of esophageal squamous cell carcinoma (ESCC) that included 50 ESCC patients and 50 neighborhood-matched controls. Genotype data were determined by using the Affymetrix 10K SNP chip. We will discuss some of the special considerations that are important to the proper interpretation of whole genome-wide association studies, which include multiple comparisons, epistatic interaction among multiple genetic loci, and generalization of predictive models.

利用 affymetrix SNP 芯片进行全基因组关联研究:采用两阶段顺序选择法识别增加患复杂疾病风险的基因。
复杂疾病的全基因组关联研究有望系统地确定影响个人患病风险的基因位点。然而,复杂疾病的多基因性和基因间的遗传相互作用给实验设计和数据分析带来了巨大挑战。高密度的基因型数据使确定可能与病因有关的大部分基因位点成为可能。另一方面,如果不对测试进行适当调整,利用大量统计测试可能会导致假阳性。在本文中,我们讨论了一种两阶段方法,该方法依次应用广义线性模型(GLM)和主成分分析(PCA)来确定共同决定发病可能性的遗传位点。该方法被应用于一项食管鳞状细胞癌(ESCC)病例对照试验研究,其中包括 50 名 ESCC 患者和 50 名邻近匹配的对照组。基因型数据是通过 Affymetrix 10K SNP 芯片确定的。我们将讨论对正确解读全基因组关联研究非常重要的一些特殊考虑因素,其中包括多重比较、多个基因位点之间的表观相互作用以及预测模型的推广。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信