Methods to identify population outliers using genetic markers

Sheila A. Fisher, Cathryn M. Lewis
{"title":"Methods to identify population outliers using genetic markers","authors":"Sheila A. Fisher,&nbsp;Cathryn M. Lewis","doi":"10.1046/j.1466-9218.2001.00012.x","DOIUrl":null,"url":null,"abstract":"<div>\n \n <section>\n \n \n <p><b>Introduction </b> Genetic studies to identify linkage or association usually assume participants are sampled from a genetically homogeneous population, so that a single set of marker allele frequencies is appropriate for all individuals in the study.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We have developed a method to identify individuals who are population outliers, because the marker allele frequency distributions from which their genotypes arise differ from the distributions of the remaining individuals in the study. Using allele frequencies estimated from an independent sample, the genotype log likelihood (GLL) test statistic calculates the likelihood of each individual’s genotypes across all markers. Extreme values of the statistic indicate that the individual arises from a different population. The distribution of the test statistic is derived and its convergence under the central limit theorem discussed.</p>\n </section>\n \n <section>\n \n <h3> Results and Discussion</h3>\n \n <p>This method was applied to genome search data from rheumatoid arthritis which identified a single population outlier family. We used allele frequencies from different populations to show that 100 markers provides high power to identify outliers across a range of populations. The GLL test statistic can be used as a screening tool to identify outlier families in any genetic study with genotyping at independent markers.</p>\n </section>\n </div>","PeriodicalId":100575,"journal":{"name":"GeneScreen","volume":"1 3","pages":"125-129"},"PeriodicalIF":0.0000,"publicationDate":"2002-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1046/j.1466-9218.2001.00012.x","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GeneScreen","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1046/j.1466-9218.2001.00012.x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Introduction Genetic studies to identify linkage or association usually assume participants are sampled from a genetically homogeneous population, so that a single set of marker allele frequencies is appropriate for all individuals in the study.

Methods

We have developed a method to identify individuals who are population outliers, because the marker allele frequency distributions from which their genotypes arise differ from the distributions of the remaining individuals in the study. Using allele frequencies estimated from an independent sample, the genotype log likelihood (GLL) test statistic calculates the likelihood of each individual’s genotypes across all markers. Extreme values of the statistic indicate that the individual arises from a different population. The distribution of the test statistic is derived and its convergence under the central limit theorem discussed.

Results and Discussion

This method was applied to genome search data from rheumatoid arthritis which identified a single population outlier family. We used allele frequencies from different populations to show that 100 markers provides high power to identify outliers across a range of populations. The GLL test statistic can be used as a screening tool to identify outlier families in any genetic study with genotyping at independent markers.

方法利用遗传标记鉴定群体异常值
用于确定联系或关联的遗传研究通常假设参与者来自遗传同质的群体,因此一组标记等位基因频率适用于研究中的所有个体。方法我们已经开发了一种方法来识别群体异常个体,因为其基因型产生的标记等位基因频率分布与研究中其余个体的分布不同。使用从独立样本中估计的等位基因频率,基因型对数似然(GLL)检验统计量计算每个个体的基因型在所有标记上的可能性。该统计值的极值表明该个体来自不同的种群。推导了检验统计量的分布,并讨论了检验统计量在中心极限定理下的收敛性。结果与讨论该方法应用于类风湿关节炎的基因组搜索数据,发现了一个单一的群体异常家族。我们使用来自不同种群的等位基因频率来表明,100个标记提供了在一系列种群中识别异常值的高功率。GLL检验统计量可作为一种筛选工具,用于在任何具有独立标记的基因分型的遗传研究中识别异常家族。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信