Tram Vi, Katarina C Stuart, Hui Zhen Tan, Audald Lloret-Villas, Anna W Santure
{"title":"Assessing Genotype Imputation Methods for Low-Coverage Sequencing Data in Populations With Differing Relatedness and Inbreeding Levels.","authors":"Tram Vi, Katarina C Stuart, Hui Zhen Tan, Audald Lloret-Villas, Anna W Santure","doi":"10.1111/1755-0998.70049","DOIUrl":null,"url":null,"abstract":"<p><p>Low-coverage sequencing (LCS) followed by genotype imputation has become a cost-efficient approach for obtaining whole-genome SNPs. Several imputation methods for LCS data have been developed over the last decade. However, comparisons of their accuracy in inferring missing genotypes and their effectiveness for downstream analysis such as population genetics have not been comprehensively studied. In the present study, we assessed the imputation performance of five different tools: GLIMPSE2, GeneImp, QUILT2, STITCH and Beagle5.4, using populations simulated by SLiM4 that represent different levels of genetic relatedness and inbreeding. Imputation accuracy was calculated at the level of variant, haplotype and sample. The effectiveness of using imputed genotypes in recovering genetic structure, relatedness, inbreeding coefficients and demographic history was subsequently evaluated. The imputation accuracy of different methods was further tested in a real population of 283 hihi (stitchbird) samples. Our results suggest a high accuracy of all the tested methods on populations with high levels of genetic relatedness. However, in populations with low relatedness, the imputation accuracy differed across different tools and impacted the results of some downstream analyses. The simulation and imputation pipeline presented here can help determine the most suitable imputation method for different population scenarios.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e70049"},"PeriodicalIF":5.5000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/1755-0998.70049","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Low-coverage sequencing (LCS) followed by genotype imputation has become a cost-efficient approach for obtaining whole-genome SNPs. Several imputation methods for LCS data have been developed over the last decade. However, comparisons of their accuracy in inferring missing genotypes and their effectiveness for downstream analysis such as population genetics have not been comprehensively studied. In the present study, we assessed the imputation performance of five different tools: GLIMPSE2, GeneImp, QUILT2, STITCH and Beagle5.4, using populations simulated by SLiM4 that represent different levels of genetic relatedness and inbreeding. Imputation accuracy was calculated at the level of variant, haplotype and sample. The effectiveness of using imputed genotypes in recovering genetic structure, relatedness, inbreeding coefficients and demographic history was subsequently evaluated. The imputation accuracy of different methods was further tested in a real population of 283 hihi (stitchbird) samples. Our results suggest a high accuracy of all the tested methods on populations with high levels of genetic relatedness. However, in populations with low relatedness, the imputation accuracy differed across different tools and impacted the results of some downstream analyses. The simulation and imputation pipeline presented here can help determine the most suitable imputation method for different population scenarios.
期刊介绍:
Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines.
In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.