Efficient SNP-based tests of association for quantitative phenotypes using pooled DNA

Joel S. Bader, Aruna Bansal, Pak Sham
{"title":"Efficient SNP-based tests of association for quantitative phenotypes using pooled DNA","authors":"Joel S. Bader,&nbsp;Aruna Bansal,&nbsp;Pak Sham","doi":"10.1046/j.1466-920x.2001.00036.x","DOIUrl":null,"url":null,"abstract":"<p><b>Introduction </b> Genetic factors underlying complex diseases are difficult to identify: many polymorphisms may contribute, each having a small effect and low penetrance. These factors may be identified by association studies of large populations, an alternative to family-based linkage studies. Allele frequency measurements of pooled DNA selected from population-level DNA repositories can reduce the costs of these studies. We provide guidance for selecting unrelated individuals for pooling and for comparing the power of studies based on pooled measurements to the power of individual genotyping, particularly for studies using single-nucleotide polymorphism (SNP) markers.</p><p><b>Materials and methods </b> We used exact numerical calculations to set pooling criteria that maximized the power to detect association as a function of marker frequency, inheritance mode, and additive variance. Analytical approximations are also provided.</p><p><b>Results and discussion </b> Power estimates are provided for two pooled DNA designs: the classification of individuals as affected or unaffected, analogous to a case-control design, and the optimized selection of individuals with extreme phenotypic values. Optimized selection is approximately fourfold more efficient than affected/unaffected classification. The optimal design for most markers is to pool the top and bottom 27% of individuals. Neglecting experimental measurement error, this design requires a population only 1.24-fold larger than that required for individual genotyping. When measurement error is included, the pooled DNA association test serves better as a pre-screen to identify candidate markers which then proceed to individual genotyping. This strategy can still provide a 100-fold savings over individual genotyping.</p>","PeriodicalId":100575,"journal":{"name":"GeneScreen","volume":"1 3","pages":"143-150"},"PeriodicalIF":0.0000,"publicationDate":"2002-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1046/j.1466-920x.2001.00036.x","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GeneScreen","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1046/j.1466-920x.2001.00036.x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 29

Abstract

Introduction Genetic factors underlying complex diseases are difficult to identify: many polymorphisms may contribute, each having a small effect and low penetrance. These factors may be identified by association studies of large populations, an alternative to family-based linkage studies. Allele frequency measurements of pooled DNA selected from population-level DNA repositories can reduce the costs of these studies. We provide guidance for selecting unrelated individuals for pooling and for comparing the power of studies based on pooled measurements to the power of individual genotyping, particularly for studies using single-nucleotide polymorphism (SNP) markers.

Materials and methods We used exact numerical calculations to set pooling criteria that maximized the power to detect association as a function of marker frequency, inheritance mode, and additive variance. Analytical approximations are also provided.

Results and discussion Power estimates are provided for two pooled DNA designs: the classification of individuals as affected or unaffected, analogous to a case-control design, and the optimized selection of individuals with extreme phenotypic values. Optimized selection is approximately fourfold more efficient than affected/unaffected classification. The optimal design for most markers is to pool the top and bottom 27% of individuals. Neglecting experimental measurement error, this design requires a population only 1.24-fold larger than that required for individual genotyping. When measurement error is included, the pooled DNA association test serves better as a pre-screen to identify candidate markers which then proceed to individual genotyping. This strategy can still provide a 100-fold savings over individual genotyping.

使用混合DNA进行定量表型关联的高效snp检测
复杂疾病的遗传因素很难确定:许多多态性可能起作用,每个都有小的影响和低外显率。这些因素可以通过大规模人群的关联研究来确定,这是基于家庭的关联研究的另一种选择。从人群水平的DNA库中选择混合DNA的等位基因频率测量可以降低这些研究的成本。我们为选择不相关的个体进行合并提供指导,并比较基于合并测量的研究与个体基因分型的能力,特别是使用单核苷酸多态性(SNP)标记的研究。材料和方法我们使用精确的数值计算来设置池化标准,以最大限度地检测标记频率、遗传模式和加性方差的关联。还提供了解析近似。结果和讨论提供了两种混合DNA设计的功率估计:受影响或未受影响的个体分类,类似于病例对照设计,以及具有极端表型值的个体的优化选择。优化选择的效率大约是受影响/未受影响分类的四倍。对于大多数标记来说,最理想的设计是将最高和最低的27%的个人集中起来。忽略实验测量误差,该设计只需要比个体基因分型大1.24倍的群体。当测量误差包括在内,合并DNA关联测试更好地作为一个预先筛选,以确定候选标记,然后进行个体基因分型。这种策略仍然可以提供比个体基因分型节省100倍的成本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信