{"title":"Estimating current effective sizes of large populations from a single sample of genomic marker data: A comparison of estimators by simulations","authors":"Jinliang Wang","doi":"10.1002/1438-390x.12167","DOIUrl":null,"url":null,"abstract":"Abstract Genome‐wide single nucleotide polymorphisms (SNPs) data are increasingly used in estimating the current effective population sizes ( N e ) for informing the conservation of endangered species and guiding the management of exploited species. Previous assessments of sibship frequency (SF) and linkage disequilibrium (LD) estimators of N e focused on small populations where genetic drift is strong and thus N e is easy to estimate. Genomic single nucleotide polymorphism (SNP) data provide ample information and hold the potential for application of these estimators to large populations where genetic drift is rather weak and thus N e is difficult to estimate. In this study, I simulated very large populations and sampled a widely variable number of individuals (genotyped at 10,000 SNPs) for estimating N e by both SF and LD methods. I also considered the more realistic situation where a population experiences a bottleneck, and where marker data suffer from genotyping errors. The simulations show that both SF and LD methods can yield accurate N e estimates of very large populations when sampled individuals are sufficiently numerous. When n is much smaller than N e , however, N e estimates are in a bimodal distribution with a substantial proportion of the estimates being infinitely large. For a population with a bottleneck, LD estimator overestimates and underestimates the N e of the parental population from samples taken at and after the bottleneck, respectively. LD estimator also overestimates N e substantially when applied to data suffering from allelic dropouts and false alleles. In contrast, SF estimator is unbiased and accurate when populations are changing in size or markers suffer from genotyping errors.","PeriodicalId":54597,"journal":{"name":"Population Ecology","volume":"24 1","pages":"0"},"PeriodicalIF":1.1000,"publicationDate":"2023-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Population Ecology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/1438-390x.12167","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Genome‐wide single nucleotide polymorphisms (SNPs) data are increasingly used in estimating the current effective population sizes ( N e ) for informing the conservation of endangered species and guiding the management of exploited species. Previous assessments of sibship frequency (SF) and linkage disequilibrium (LD) estimators of N e focused on small populations where genetic drift is strong and thus N e is easy to estimate. Genomic single nucleotide polymorphism (SNP) data provide ample information and hold the potential for application of these estimators to large populations where genetic drift is rather weak and thus N e is difficult to estimate. In this study, I simulated very large populations and sampled a widely variable number of individuals (genotyped at 10,000 SNPs) for estimating N e by both SF and LD methods. I also considered the more realistic situation where a population experiences a bottleneck, and where marker data suffer from genotyping errors. The simulations show that both SF and LD methods can yield accurate N e estimates of very large populations when sampled individuals are sufficiently numerous. When n is much smaller than N e , however, N e estimates are in a bimodal distribution with a substantial proportion of the estimates being infinitely large. For a population with a bottleneck, LD estimator overestimates and underestimates the N e of the parental population from samples taken at and after the bottleneck, respectively. LD estimator also overestimates N e substantially when applied to data suffering from allelic dropouts and false alleles. In contrast, SF estimator is unbiased and accurate when populations are changing in size or markers suffer from genotyping errors.
期刊介绍:
Population Ecology, formerly known as Researches on Population Ecology launched in Dec 1952, is the official journal of the Society of Population Ecology. Population Ecology publishes original research articles and reviews (including invited reviews) on various aspects of population ecology, from the individual to the community level. Among the specific fields included are population dynamics and distribution, evolutionary ecology, ecological genetics, theoretical models, conservation biology, agroecosystem studies, and bioresource management. Manuscripts should contain new results of empirical and/or theoretical investigations concerning facts, patterns, processes, mechanisms or concepts of population ecology; those purely descriptive in nature are not suitable for this journal. All manuscripts are reviewed anonymously by two or more referees, and the final editorial decision is made by the Chief Editor or an Associate Editor based on the referees'' evaluations.