{"title":"The wealth of shared resources: Improving molecular taxonomy using eDNA and public databases","authors":"James F. Fleming","doi":"10.1111/zsc.12591","DOIUrl":null,"url":null,"abstract":"Public databases such as the NCBI's GenBank have been used as repositories for genomic studies for more than 30 years. In this time, our understanding of the natural world, and especially the genomic world, has expanded vastly, and the size of these databases represent this genomic revolution. Databases like GenBank now help populate many molecular studies, supplementing a researcher's newly gathered data with publicly available sequences. Despite this, older sequence records, particularly those from understudied taxa, are frequently not updated in line with this burgeoning understanding, and this means that analyses that leverage this public data – from BLAST through to phylogenetic analyses – cannot do so with the full force of its collective understanding. This is particularly true for environmental DNA (eDNA) records, where older sequence records may identify sequences only to the phylum level, limiting their use in many studies. Here, with a case study of tardigrade 18S sequences, the family identities of 630 sequences, previously only identified to the phylum level, were established using 501 family, genus and species level 18S sequences, effectively doubling the depth and taxonomic resolution of tardigrade 18S sequences in GenBank.","PeriodicalId":49334,"journal":{"name":"Zoologica Scripta","volume":"52 1","pages":"226 - 234"},"PeriodicalIF":2.3000,"publicationDate":"2023-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zoologica Scripta","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/zsc.12591","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"EVOLUTIONARY BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Public databases such as the NCBI's GenBank have been used as repositories for genomic studies for more than 30 years. In this time, our understanding of the natural world, and especially the genomic world, has expanded vastly, and the size of these databases represent this genomic revolution. Databases like GenBank now help populate many molecular studies, supplementing a researcher's newly gathered data with publicly available sequences. Despite this, older sequence records, particularly those from understudied taxa, are frequently not updated in line with this burgeoning understanding, and this means that analyses that leverage this public data – from BLAST through to phylogenetic analyses – cannot do so with the full force of its collective understanding. This is particularly true for environmental DNA (eDNA) records, where older sequence records may identify sequences only to the phylum level, limiting their use in many studies. Here, with a case study of tardigrade 18S sequences, the family identities of 630 sequences, previously only identified to the phylum level, were established using 501 family, genus and species level 18S sequences, effectively doubling the depth and taxonomic resolution of tardigrade 18S sequences in GenBank.
期刊介绍:
Zoologica Scripta publishes papers in animal systematics and phylogeny, i.e. studies of evolutionary relationships among taxa, and the origin and evolution of biological diversity. Papers can also deal with ecological interactions and geographic distributions (phylogeography) if the results are placed in a wider phylogenetic/systematic/evolutionary context. Zoologica Scripta encourages papers on the development of methods for all aspects of phylogenetic inference and biological nomenclature/classification.
Articles published in Zoologica Scripta must be original and present either theoretical or empirical studies of interest to a broad audience in systematics and phylogeny. Purely taxonomic papers, like species descriptions without being placed in a wider systematic/phylogenetic context, will not be considered.