Jointly representing long-range genetic similarity and spatially heterogeneous isolation-by-distance.

IF 3.7 2区 生物学 Q1 GENETICS & HEREDITY
PLoS Genetics Pub Date : 2025-09-16 eCollection Date: 2025-09-01 DOI:10.1371/journal.pgen.1011612
Vivaswat Shastry, Marco Musiani, John Novembre
{"title":"Jointly representing long-range genetic similarity and spatially heterogeneous isolation-by-distance.","authors":"Vivaswat Shastry, Marco Musiani, John Novembre","doi":"10.1371/journal.pgen.1011612","DOIUrl":null,"url":null,"abstract":"<p><p>Isolation-by-distance patterns in genetic variation are a widespread feature of the geographic structure of genetic variation in many species, and many methods have been developed to illuminate such patterns in genetic data. However, long-range genetic similarities also exist, often as a result of rare or episodic long-range gene flow. Jointly characterizing patterns of isolation-by-distance and long-range genetic similarity in genetic data is an open data analysis challenge that, if resolved, could help produce more complete representations of the geographic structure of genetic data in any given species. Here, we present a computationally tractable method that identifies long-range genetic similarities in a background of spatially heterogeneous isolation-by-distance variation. The method uses a coalescent-based framework, and models long-range genetic similarity in terms of directional events with source fractions describing the fraction of ancestry at a location tracing back to a remote source. The method produces geographic maps annotated with inferred long-range edges, as well as maps of uncertainty in the geographic location of each source of long-range gene flow. We have implemented the method in a package called FEEMSmix (an extension to FEEMS), and validated its implementation using simulations representative of typical data applications. We also apply this method to two empirical data sets. In a data set of over 4,000 humans (Homo sapiens) across Afro-Eurasia, we recover many known signals of long-distance dispersal from recent centuries. Similarly, in a data set of over 100 gray wolves (Canis lupus) across North America, we identify several previously unknown long-range connections, some of which were attributable to recording errors in sampling locations. Therefore, beyond identifying genuine long-range dispersals, our approach also serves as a useful tool for quality control in spatial genetic studies.</p>","PeriodicalId":49007,"journal":{"name":"PLoS Genetics","volume":"21 9","pages":"e1011612"},"PeriodicalIF":3.7000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453258/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1371/journal.pgen.1011612","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Isolation-by-distance patterns in genetic variation are a widespread feature of the geographic structure of genetic variation in many species, and many methods have been developed to illuminate such patterns in genetic data. However, long-range genetic similarities also exist, often as a result of rare or episodic long-range gene flow. Jointly characterizing patterns of isolation-by-distance and long-range genetic similarity in genetic data is an open data analysis challenge that, if resolved, could help produce more complete representations of the geographic structure of genetic data in any given species. Here, we present a computationally tractable method that identifies long-range genetic similarities in a background of spatially heterogeneous isolation-by-distance variation. The method uses a coalescent-based framework, and models long-range genetic similarity in terms of directional events with source fractions describing the fraction of ancestry at a location tracing back to a remote source. The method produces geographic maps annotated with inferred long-range edges, as well as maps of uncertainty in the geographic location of each source of long-range gene flow. We have implemented the method in a package called FEEMSmix (an extension to FEEMS), and validated its implementation using simulations representative of typical data applications. We also apply this method to two empirical data sets. In a data set of over 4,000 humans (Homo sapiens) across Afro-Eurasia, we recover many known signals of long-distance dispersal from recent centuries. Similarly, in a data set of over 100 gray wolves (Canis lupus) across North America, we identify several previously unknown long-range connections, some of which were attributable to recording errors in sampling locations. Therefore, beyond identifying genuine long-range dispersals, our approach also serves as a useful tool for quality control in spatial genetic studies.

共同代表远程遗传相似性和空间异构距离隔离。
遗传变异中的距离隔离模式是许多物种遗传变异地理结构的普遍特征,并且已经开发了许多方法来阐明遗传数据中的这种模式。然而,远程遗传相似性也存在,通常是由于罕见或偶发的远程基因流动。共同描述遗传数据中按距离隔离和远距离遗传相似性的模式是一项开放数据分析挑战,如果得到解决,将有助于对任何给定物种的遗传数据的地理结构产生更完整的表示。在这里,我们提出了一种计算易于处理的方法,可以在空间异构隔离距离变化的背景下识别远程遗传相似性。该方法使用基于聚结的框架,并根据方向性事件对远程遗传相似性进行建模,源分数描述了在一个位置追溯到远程源的祖先的分数。该方法生成带有推断远程边缘的地理地图,以及每个远程基因流源地理位置的不确定性地图。我们已经在一个名为FEEMSmix (FEEMS的扩展)的包中实现了该方法,并使用代表典型数据应用程序的模拟验证了其实现。我们还将这种方法应用于两个经验数据集。在横跨非洲-欧亚大陆的4000多名人类(智人)的数据集中,我们恢复了近几个世纪以来许多已知的长距离分散信号。同样,在北美100多只灰狼(Canis lupus)的数据集中,我们发现了一些以前未知的远程联系,其中一些可归因于采样位置的记录错误。因此,除了识别真正的远距离扩散外,我们的方法还可以作为空间遗传研究质量控制的有用工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
PLoS Genetics
PLoS Genetics GENETICS & HEREDITY-
自引率
2.20%
发文量
438
期刊介绍: PLOS Genetics is run by an international Editorial Board, headed by the Editors-in-Chief, Greg Barsh (HudsonAlpha Institute of Biotechnology, and Stanford University School of Medicine) and Greg Copenhaver (The University of North Carolina at Chapel Hill). Articles published in PLOS Genetics are archived in PubMed Central and cited in PubMed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信