{"title":"丰富的共享资源:利用eDNA和公共数据库改进分子分类学","authors":"James F. Fleming","doi":"10.1111/zsc.12591","DOIUrl":null,"url":null,"abstract":"Public databases such as the NCBI's GenBank have been used as repositories for genomic studies for more than 30 years. In this time, our understanding of the natural world, and especially the genomic world, has expanded vastly, and the size of these databases represent this genomic revolution. Databases like GenBank now help populate many molecular studies, supplementing a researcher's newly gathered data with publicly available sequences. Despite this, older sequence records, particularly those from understudied taxa, are frequently not updated in line with this burgeoning understanding, and this means that analyses that leverage this public data – from BLAST through to phylogenetic analyses – cannot do so with the full force of its collective understanding. This is particularly true for environmental DNA (eDNA) records, where older sequence records may identify sequences only to the phylum level, limiting their use in many studies. Here, with a case study of tardigrade 18S sequences, the family identities of 630 sequences, previously only identified to the phylum level, were established using 501 family, genus and species level 18S sequences, effectively doubling the depth and taxonomic resolution of tardigrade 18S sequences in GenBank.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2023-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The wealth of shared resources: Improving molecular taxonomy using eDNA and public databases\",\"authors\":\"James F. Fleming\",\"doi\":\"10.1111/zsc.12591\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Public databases such as the NCBI's GenBank have been used as repositories for genomic studies for more than 30 years. In this time, our understanding of the natural world, and especially the genomic world, has expanded vastly, and the size of these databases represent this genomic revolution. Databases like GenBank now help populate many molecular studies, supplementing a researcher's newly gathered data with publicly available sequences. Despite this, older sequence records, particularly those from understudied taxa, are frequently not updated in line with this burgeoning understanding, and this means that analyses that leverage this public data – from BLAST through to phylogenetic analyses – cannot do so with the full force of its collective understanding. This is particularly true for environmental DNA (eDNA) records, where older sequence records may identify sequences only to the phylum level, limiting their use in many studies. Here, with a case study of tardigrade 18S sequences, the family identities of 630 sequences, previously only identified to the phylum level, were established using 501 family, genus and species level 18S sequences, effectively doubling the depth and taxonomic resolution of tardigrade 18S sequences in GenBank.\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2023-02-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1111/zsc.12591\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/zsc.12591","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
The wealth of shared resources: Improving molecular taxonomy using eDNA and public databases
Public databases such as the NCBI's GenBank have been used as repositories for genomic studies for more than 30 years. In this time, our understanding of the natural world, and especially the genomic world, has expanded vastly, and the size of these databases represent this genomic revolution. Databases like GenBank now help populate many molecular studies, supplementing a researcher's newly gathered data with publicly available sequences. Despite this, older sequence records, particularly those from understudied taxa, are frequently not updated in line with this burgeoning understanding, and this means that analyses that leverage this public data – from BLAST through to phylogenetic analyses – cannot do so with the full force of its collective understanding. This is particularly true for environmental DNA (eDNA) records, where older sequence records may identify sequences only to the phylum level, limiting their use in many studies. Here, with a case study of tardigrade 18S sequences, the family identities of 630 sequences, previously only identified to the phylum level, were established using 501 family, genus and species level 18S sequences, effectively doubling the depth and taxonomic resolution of tardigrade 18S sequences in GenBank.