{"title":"mkLTG:使用可变同一性阈值对元条码序列进行分类分配的命令行工具。","authors":"Emese Meglécz","doi":"10.1007/s42977-024-00201-x","DOIUrl":null,"url":null,"abstract":"<p><p>Metabarcoding is now a widely used method for biodiversity studies. Taxonomic assignment of environmental sequences is one of the key steps of metabarcoding. Assignments based on lowest common ancestor (LCA) method generally rely on fixed arbitrary thresholds, and this is generally not well adapted for assignment of taxonomically diverse groups with variable coverage in reference databases. The mkLTG is a LCA-based method that uses a series of percentage of identity thresholds starting from stringent parameters and decreasing it if necessary. All parameters can be set separately for each percentage of identity threshold, which makes this tool adaptable for different databases, genetic markers and diverse taxonomic groups. The optimization step was included using the COI marker and a comprehensive, non-redundant database. The mkLTG tool is a command-line application with few dependencies that runs in all operating systems, therefore, it is easy to include into complex pipelines. All scripts are freely available including the benchmarking at https://github.com/meglecz/mkLTG .</p>","PeriodicalId":8853,"journal":{"name":"Biologia futura","volume":" ","pages":"369-375"},"PeriodicalIF":1.8000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"mkLTG: a command-line tool for taxonomic assignment of metabarcoding sequences using variable identity thresholds.\",\"authors\":\"Emese Meglécz\",\"doi\":\"10.1007/s42977-024-00201-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Metabarcoding is now a widely used method for biodiversity studies. Taxonomic assignment of environmental sequences is one of the key steps of metabarcoding. Assignments based on lowest common ancestor (LCA) method generally rely on fixed arbitrary thresholds, and this is generally not well adapted for assignment of taxonomically diverse groups with variable coverage in reference databases. The mkLTG is a LCA-based method that uses a series of percentage of identity thresholds starting from stringent parameters and decreasing it if necessary. All parameters can be set separately for each percentage of identity threshold, which makes this tool adaptable for different databases, genetic markers and diverse taxonomic groups. The optimization step was included using the COI marker and a comprehensive, non-redundant database. The mkLTG tool is a command-line application with few dependencies that runs in all operating systems, therefore, it is easy to include into complex pipelines. All scripts are freely available including the benchmarking at https://github.com/meglecz/mkLTG .</p>\",\"PeriodicalId\":8853,\"journal\":{\"name\":\"Biologia futura\",\"volume\":\" \",\"pages\":\"369-375\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2023-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biologia futura\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s42977-024-00201-x\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/2/1 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biologia futura","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s42977-024-00201-x","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/1 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
元条码是目前生物多样性研究中广泛使用的一种方法。环境序列的分类分配是元标码的关键步骤之一。基于最低共同祖先(LCA)方法的赋值一般依赖于固定的任意阈值,而这种方法一般不太适合赋值参考数据库中覆盖率不一的分类学多样性群体。mkLTG 是一种基于 LCA 的方法,它使用一系列同一性百分比阈值,从严格的参数开始,必要时降低阈值。每个同一性百分比阈值的所有参数都可单独设置,这使得该工具可适用于不同的数据库、遗传标记和不同的分类群体。优化步骤包括使用 COI 标记和一个全面的非冗余数据库。mkLTG 工具是一个命令行应用程序,依赖性极少,可在所有操作系统中运行,因此很容易纳入复杂的管道中。包括基准测试在内的所有脚本均可在 https://github.com/meglecz/mkLTG 免费获取。
mkLTG: a command-line tool for taxonomic assignment of metabarcoding sequences using variable identity thresholds.
Metabarcoding is now a widely used method for biodiversity studies. Taxonomic assignment of environmental sequences is one of the key steps of metabarcoding. Assignments based on lowest common ancestor (LCA) method generally rely on fixed arbitrary thresholds, and this is generally not well adapted for assignment of taxonomically diverse groups with variable coverage in reference databases. The mkLTG is a LCA-based method that uses a series of percentage of identity thresholds starting from stringent parameters and decreasing it if necessary. All parameters can be set separately for each percentage of identity threshold, which makes this tool adaptable for different databases, genetic markers and diverse taxonomic groups. The optimization step was included using the COI marker and a comprehensive, non-redundant database. The mkLTG tool is a command-line application with few dependencies that runs in all operating systems, therefore, it is easy to include into complex pipelines. All scripts are freely available including the benchmarking at https://github.com/meglecz/mkLTG .
Biologia futuraAgricultural and Biological Sciences-Agricultural and Biological Sciences (all)
CiteScore
3.50
自引率
0.00%
发文量
27
期刊介绍:
How can the scientific knowledge we possess now influence that future? That is, the FUTURE of Earth and life − of humankind. Can we make choices in the present to change our future? How can 21st century biological research ask proper scientific questions and find solid answers? Addressing these questions is the main goal of Biologia Futura (formerly Acta Biologica Hungarica).
In keeping with the name, the new mission is to focus on areas of biology where major advances are to be expected, areas of biology with strong inter-disciplinary connection and to provide new avenues for future research in biology. Biologia Futura aims to publish articles from all fields of biology.