通过整合 MAGFlow 和 BIgMAG 实现元基因组质量度量和分类注释可视化。

Q2 Pharmacology, Toxicology and Pharmaceutics

F1000Research Pub Date : 2024-09-23 eCollection Date: 2024-01-01 DOI:10.12688/f1000research.152290.2

Jeferyd Yepes-García, Laurent Falquet

{"title":"通过整合 MAGFlow 和 BIgMAG 实现元基因组质量度量和分类注释可视化。","authors":"Jeferyd Yepes-García, Laurent Falquet","doi":"10.12688/f1000research.152290.2","DOIUrl":null,"url":null,"abstract":"Background: Building Metagenome-Assembled Genomes (MAGs) from highly complex metagenomics datasets encompasses a series of steps covering from cleaning the sequences, assembling them to finally group them into bins. Along the process, multiple tools aimed to assess the quality and integrity of each MAG are implemented. Nonetheless, even when incorporated within end-to-end pipelines, the outputs of these pieces of software must be visualized and analyzed manually lacking integration in a complete framework.Methods: We developed a Nextflow pipeline (MAGFlow) for estimating the quality of MAGs through a wide variety of approaches (BUSCO, CheckM2, GUNC and QUAST), as well as for annotating taxonomically the metagenomes using GTDB-Tk2. MAGFlow is coupled to a Python-Dash application (BIgMAG) that displays the concatenated outcomes from the tools included by MAGFlow, highlighting the most important metrics in a single interactive environment along with a comparison/clustering of the input data.Results: By using MAGFlow/BIgMAG, the user will be able to benchmark the MAGs obtained through different workflows or establish the quality of the MAGs belonging to different samples following the divide and rule methodology.Conclusions: MAGFlow/BIgMAG represents a unique tool that integrates state-of-the-art tools to study different quality metrics and extract visually as much information as possible from a wide range of genome features.","PeriodicalId":12260,"journal":{"name":"F1000Research","volume":"13 ","pages":"640"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11445639/pdf/","citationCount":"0","resultStr":"{\"title\":\"Metagenome quality metrics and taxonomical annotation visualization through the integration of MAGFlow and BIgMAG.\",\"authors\":\"Jeferyd Yepes-García, Laurent Falquet\",\"doi\":\"10.12688/f1000research.152290.2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Building Metagenome-Assembled Genomes (MAGs) from highly complex metagenomics datasets encompasses a series of steps covering from cleaning the sequences, assembling them to finally group them into bins. Along the process, multiple tools aimed to assess the quality and integrity of each MAG are implemented. Nonetheless, even when incorporated within end-to-end pipelines, the outputs of these pieces of software must be visualized and analyzed manually lacking integration in a complete framework.Methods: We developed a Nextflow pipeline (MAGFlow) for estimating the quality of MAGs through a wide variety of approaches (BUSCO, CheckM2, GUNC and QUAST), as well as for annotating taxonomically the metagenomes using GTDB-Tk2. MAGFlow is coupled to a Python-Dash application (BIgMAG) that displays the concatenated outcomes from the tools included by MAGFlow, highlighting the most important metrics in a single interactive environment along with a comparison/clustering of the input data.Results: By using MAGFlow/BIgMAG, the user will be able to benchmark the MAGs obtained through different workflows or establish the quality of the MAGs belonging to different samples following the divide and rule methodology.Conclusions: MAGFlow/BIgMAG represents a unique tool that integrates state-of-the-art tools to study different quality metrics and extract visually as much information as possible from a wide range of genome features.\",\"PeriodicalId\":12260,\"journal\":{\"name\":\"F1000Research\",\"volume\":\"13 \",\"pages\":\"640\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11445639/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"F1000Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12688/f1000research.152290.2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"Pharmacology, Toxicology and Pharmaceutics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"F1000Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/f1000research.152290.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}

引用次数: 0

摘要

背景：从高度复杂的元基因组学数据集中构建元基因组组装基因组（MAGs）包含一系列步骤，从清理序列、组装序列到最终将序列分组。在这个过程中，需要使用多种工具来评估每个 MAG 的质量和完整性。然而，即使将这些工具纳入端到端管道，也必须手动对这些软件的输出进行可视化和分析，而无法集成到一个完整的框架中：我们开发了一个 Nextflow 管道（MAGFlow），用于通过多种方法（BUSCO、CheckM2、GUNC 和 QUAST）评估 MAGs 的质量，并使用 GTDB-Tk2 对元基因组进行分类注释。MAGFlow 与 Python-Dash 应用程序 (BIgMAG) 相结合，可显示 MAGFlow 所含工具的合并结果，在一个交互式环境中突出显示最重要的指标，并对输入数据进行比较/聚类：通过使用 MAGFlow/BIgMAG，用户将能够对通过不同工作流程获得的 MAGs 进行基准测试，或按照分而治之的方法确定属于不同样本的 MAGs 的质量：MAGFlow/BIgMAG是一种独特的工具，它整合了最先进的工具来研究不同的质量指标，并从广泛的基因组特征中直观地提取尽可能多的信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Metagenome quality metrics and taxonomical annotation visualization through the integration of MAGFlow and BIgMAG.

Background: Building Metagenome-Assembled Genomes (MAGs) from highly complex metagenomics datasets encompasses a series of steps covering from cleaning the sequences, assembling them to finally group them into bins. Along the process, multiple tools aimed to assess the quality and integrity of each MAG are implemented. Nonetheless, even when incorporated within end-to-end pipelines, the outputs of these pieces of software must be visualized and analyzed manually lacking integration in a complete framework.

Methods: We developed a Nextflow pipeline (MAGFlow) for estimating the quality of MAGs through a wide variety of approaches (BUSCO, CheckM2, GUNC and QUAST), as well as for annotating taxonomically the metagenomes using GTDB-Tk2. MAGFlow is coupled to a Python-Dash application (BIgMAG) that displays the concatenated outcomes from the tools included by MAGFlow, highlighting the most important metrics in a single interactive environment along with a comparison/clustering of the input data.

Results: By using MAGFlow/BIgMAG, the user will be able to benchmark the MAGs obtained through different workflows or establish the quality of the MAGs belonging to different samples following the divide and rule methodology.

Conclusions: MAGFlow/BIgMAG represents a unique tool that integrates state-of-the-art tools to study different quality metrics and extract visually as much information as possible from a wide range of genome features.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

F1000Research Pharmacology, Toxicology and Pharmaceutics-Pharmacology, Toxicology and Pharmaceutics (all)

CiteScore

5.00

自引率

0.00%

发文量

1646

审稿时长

1 weeks

期刊介绍： F1000Research publishes articles and other research outputs reporting basic scientific, scholarly, translational and clinical research across the physical and life sciences, engineering, medicine, social sciences and humanities. F1000Research is a scholarly publication platform set up for the scientific, scholarly and medical research community; each article has at least one author who is a qualified researcher, scholar or clinician actively working in their speciality and who has made a key contribution to the article. Articles must be original (not duplications). All research is suitable irrespective of the perceived level of interest or novelty; we welcome confirmatory and negative results, as well as null studies. F1000Research publishes different type of research, including clinical trials, systematic reviews, software tools, method articles, and many others. Reviews and Opinion articles providing a balanced and comprehensive overview of the latest discoveries in a particular field, or presenting a personal perspective on recent developments, are also welcome. See the full list of article types we accept for more information.