Accounting for gene flow from unsampled ghost populations while estimating evolutionary history.

IF 2.2 3区 生物学 Q3 GENETICS & HEREDITY
Arun Sethuraman, Melissa Lynch, Margaret Wanjiku, Michael Kuzminskiy
{"title":"Accounting for gene flow from unsampled ghost populations while estimating evolutionary history.","authors":"Arun Sethuraman, Melissa Lynch, Margaret Wanjiku, Michael Kuzminskiy","doi":"10.1093/g3journal/jkaf180","DOIUrl":null,"url":null,"abstract":"<p><p>Gene flow from unsampled or extinct ghost populations leave signatures on the genomes of individuals from extant, sampled populations, often introducing biases, data misinterpretation, and ambiguous results when estimating evolutionary history from population genomic data. Here we establish theoretical expectations for these biases, and then utilize extensive simulations under a variety of ghost topologies to systematically assess biases while accounting, or not accounting for gene flow from ghost populations in (i) population genetics summary statistics such as π, FST, and Tajima's D and (ii) demographic history (mutation-scaled effective population sizes, divergence times, and migration rates) under the Isolation with Migration (IM) model. Estimates of evolutionary history across all scenarios of deep divergence of an outgroup ghost indicate consistent (i) under-estimation of divergence times between sampled populations, (ii) over-estimation of effective population sizes of sampled populations, and (iii) under-estimation of migration rates between sampled populations, with increased gene flow from the unsampled ghost population. Without accounting for an unsampled ghost, summary statistics like FST are under-estimated, and π is over-estimated with increased gene flow from the ghost. These biases in summary statistics and population structure are however not captured under models of recent IM that approximate scales of the evolution of anatomically modern humans and Neanderthals and solely recapitulated using model-based estimation of evolutionary history. We also utilize a 355 locus dataset from African Hunter-Gatherer populations and discuss similar biases in estimating evolutionary history while not accounting for unsampled ghost.</p>","PeriodicalId":12468,"journal":{"name":"G3: Genes|Genomes|Genetics","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12506660/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"G3: Genes|Genomes|Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/g3journal/jkaf180","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Gene flow from unsampled or extinct ghost populations leave signatures on the genomes of individuals from extant, sampled populations, often introducing biases, data misinterpretation, and ambiguous results when estimating evolutionary history from population genomic data. Here we establish theoretical expectations for these biases, and then utilize extensive simulations under a variety of ghost topologies to systematically assess biases while accounting, or not accounting for gene flow from ghost populations in (i) population genetics summary statistics such as π, FST, and Tajima's D and (ii) demographic history (mutation-scaled effective population sizes, divergence times, and migration rates) under the Isolation with Migration (IM) model. Estimates of evolutionary history across all scenarios of deep divergence of an outgroup ghost indicate consistent (i) under-estimation of divergence times between sampled populations, (ii) over-estimation of effective population sizes of sampled populations, and (iii) under-estimation of migration rates between sampled populations, with increased gene flow from the unsampled ghost population. Without accounting for an unsampled ghost, summary statistics like FST are under-estimated, and π is over-estimated with increased gene flow from the ghost. These biases in summary statistics and population structure are however not captured under models of recent IM that approximate scales of the evolution of anatomically modern humans and Neanderthals and solely recapitulated using model-based estimation of evolutionary history. We also utilize a 355 locus dataset from African Hunter-Gatherer populations and discuss similar biases in estimating evolutionary history while not accounting for unsampled ghost.

在估计进化史的同时,从未采样的幽灵种群中计算基因流。
来自未采样或灭绝的幽灵种群的基因流在现存的采样种群的个体基因组上留下了印记,这通常会在根据种群基因组数据估计进化历史时引入偏见、数据误解和模糊的结果。在此,我们建立了对这些偏差的理论预期,然后利用各种鬼种群拓扑下的广泛模拟,在考虑或不考虑鬼种群基因流动的情况下,系统地评估偏差(1)群体遗传学总结统计,如π、FST和Tajima的D,以及(2)人口历史(突变尺度的有效种群规模、分化时间和迁移率)下的隔离与迁移(IM)模型。对外群幽灵的所有深度分化情景的进化史估计表明:a)低估了采样种群之间的分化时间,(b)高估了采样种群的有效种群规模,以及(c)低估了采样种群之间的迁移率,未采样幽灵种群的基因流增加。如果不考虑未采样的幽灵,像FST这样的汇总统计数据就会被低估,而π则会随着幽灵的基因流的增加而被高估。然而,这些在汇总统计和人口结构方面的偏差并没有被最近的IM模型所捕获,这些模型近似于解剖学上的现代人和尼安德特人的进化尺度,并且仅使用基于模型的进化历史估计来概括。我们还利用了来自非洲狩猎采集者群体的355个基因座数据集,并讨论了在估计进化史时的类似偏差,同时不考虑未采样的幽灵。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
G3: Genes|Genomes|Genetics
G3: Genes|Genomes|Genetics GENETICS & HEREDITY-
CiteScore
5.10
自引率
3.80%
发文量
305
审稿时长
3-8 weeks
期刊介绍: G3: Genes, Genomes, Genetics provides a forum for the publication of high‐quality foundational research, particularly research that generates useful genetic and genomic information such as genome maps, single gene studies, genome‐wide association and QTL studies, as well as genome reports, mutant screens, and advances in methods and technology. The Editorial Board of G3 believes that rapid dissemination of these data is the necessary foundation for analysis that leads to mechanistic insights. G3, published by the Genetics Society of America, meets the critical and growing need of the genetics community for rapid review and publication of important results in all areas of genetics. G3 offers the opportunity to publish the puzzling finding or to present unpublished results that may not have been submitted for review and publication due to a perceived lack of a potential high-impact finding. G3 has earned the DOAJ Seal, which is a mark of certification for open access journals, awarded by DOAJ to journals that achieve a high level of openness, adhere to Best Practice and high publishing standards.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信