Ben J. Wiens, Lucas H. DeCicco, Jocelyn P. Colella
{"title":"三角形:一个R包,用于识别AIMs和建立三角图使用SNP数据从杂交区","authors":"Ben J. Wiens, Lucas H. DeCicco, Jocelyn P. Colella","doi":"10.1038/s41437-025-00760-2","DOIUrl":null,"url":null,"abstract":"Hybridization provides a window into the speciation process and reshuffles parental alleles to produce novel recombinant genotypes. Presence or absence of specific hybrid classes across a hybrid zone can provide support for various modes of reproductive isolation. Early generation hybrid classes can be distinguished by their combination of hybrid index and interclass heterozygosity, which can be estimated with molecular data. Hybrid index and interclass heterozygosity are routinely calculated for studies of hybrid zones, but available resources for next-generation sequencing datasets are computationally demanding and tools for visualizing triangle plots are lacking. Here, we provide a resource for identifying ancestry-informative markers (AIMs) from single nucleotide polymorphism (SNP) datasets, calculating hybrid index and interclass heterozygosity, and visualizing the relationship as a triangle plot. Our methods are implemented in the R package triangulaR. We validate our methods on an empirical dataset and simulations of genetic data from a hybrid zone between two parental groups at low, medium, and high levels of divergence. triangulaR provides accurate and precise estimates of hybrid index and interclass heterozygosity with sample sizes as low as five individuals per parental group, and similar levels of error as another program for hybrid index and interclass heterozygosity estimation, bgchm. We explore various allele frequency difference thresholds for AIM identification, and how this threshold influences the accuracy and precision of hybrid index and interclass heterozygosity estimates. We contextualize interpretation of triangle plots by describing theoretical expectations under Hardy-Weinberg Equilibrium and provide recommendations for best practices for identifying AIMs and building triangle plots.","PeriodicalId":12991,"journal":{"name":"Heredity","volume":"134 5","pages":"251-262"},"PeriodicalIF":3.1000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41437-025-00760-2.pdf","citationCount":"0","resultStr":"{\"title\":\"triangulaR: an R package for identifying AIMs and building triangle plots using SNP data from hybrid zones\",\"authors\":\"Ben J. Wiens, Lucas H. DeCicco, Jocelyn P. Colella\",\"doi\":\"10.1038/s41437-025-00760-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hybridization provides a window into the speciation process and reshuffles parental alleles to produce novel recombinant genotypes. Presence or absence of specific hybrid classes across a hybrid zone can provide support for various modes of reproductive isolation. Early generation hybrid classes can be distinguished by their combination of hybrid index and interclass heterozygosity, which can be estimated with molecular data. Hybrid index and interclass heterozygosity are routinely calculated for studies of hybrid zones, but available resources for next-generation sequencing datasets are computationally demanding and tools for visualizing triangle plots are lacking. Here, we provide a resource for identifying ancestry-informative markers (AIMs) from single nucleotide polymorphism (SNP) datasets, calculating hybrid index and interclass heterozygosity, and visualizing the relationship as a triangle plot. Our methods are implemented in the R package triangulaR. We validate our methods on an empirical dataset and simulations of genetic data from a hybrid zone between two parental groups at low, medium, and high levels of divergence. triangulaR provides accurate and precise estimates of hybrid index and interclass heterozygosity with sample sizes as low as five individuals per parental group, and similar levels of error as another program for hybrid index and interclass heterozygosity estimation, bgchm. We explore various allele frequency difference thresholds for AIM identification, and how this threshold influences the accuracy and precision of hybrid index and interclass heterozygosity estimates. We contextualize interpretation of triangle plots by describing theoretical expectations under Hardy-Weinberg Equilibrium and provide recommendations for best practices for identifying AIMs and building triangle plots.\",\"PeriodicalId\":12991,\"journal\":{\"name\":\"Heredity\",\"volume\":\"134 5\",\"pages\":\"251-262\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-04-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.nature.com/articles/s41437-025-00760-2.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Heredity\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.nature.com/articles/s41437-025-00760-2\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heredity","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41437-025-00760-2","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECOLOGY","Score":null,"Total":0}
triangulaR: an R package for identifying AIMs and building triangle plots using SNP data from hybrid zones
Hybridization provides a window into the speciation process and reshuffles parental alleles to produce novel recombinant genotypes. Presence or absence of specific hybrid classes across a hybrid zone can provide support for various modes of reproductive isolation. Early generation hybrid classes can be distinguished by their combination of hybrid index and interclass heterozygosity, which can be estimated with molecular data. Hybrid index and interclass heterozygosity are routinely calculated for studies of hybrid zones, but available resources for next-generation sequencing datasets are computationally demanding and tools for visualizing triangle plots are lacking. Here, we provide a resource for identifying ancestry-informative markers (AIMs) from single nucleotide polymorphism (SNP) datasets, calculating hybrid index and interclass heterozygosity, and visualizing the relationship as a triangle plot. Our methods are implemented in the R package triangulaR. We validate our methods on an empirical dataset and simulations of genetic data from a hybrid zone between two parental groups at low, medium, and high levels of divergence. triangulaR provides accurate and precise estimates of hybrid index and interclass heterozygosity with sample sizes as low as five individuals per parental group, and similar levels of error as another program for hybrid index and interclass heterozygosity estimation, bgchm. We explore various allele frequency difference thresholds for AIM identification, and how this threshold influences the accuracy and precision of hybrid index and interclass heterozygosity estimates. We contextualize interpretation of triangle plots by describing theoretical expectations under Hardy-Weinberg Equilibrium and provide recommendations for best practices for identifying AIMs and building triangle plots.
期刊介绍:
Heredity is the official journal of the Genetics Society. It covers a broad range of topics within the field of genetics and therefore papers must address conceptual or applied issues of interest to the journal''s wide readership