Ekaterina Avershina, Arfa Irej Qureshi, Hanne C Winther-Larsen, Trine B Rounge
{"title":"Challenges in capturing the mycobiome from shotgun metagenome data: lack of software and databases.","authors":"Ekaterina Avershina, Arfa Irej Qureshi, Hanne C Winther-Larsen, Trine B Rounge","doi":"10.1186/s40168-025-02048-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The mycobiome, representing the fungal component of microbial communities, is increasingly acknowledged as an integral part of the gut microbiome. However, research in this area remains relatively limited. The characterization of mycobiome taxa from metagenomic data is heavily reliant on the quality of the software and databases. In this study, we evaluated the feasibility of mycobiome profiling using existing bioinformatics tools on simulated fungal metagenomic data.</p><p><strong>Results: </strong>We identified seven tools claiming to perform taxonomic assignment of fungal shotgun metagenomic sequences. One of these was outdated and required substantial modifications of the code to be functional and was thus excluded. To evaluate the accuracy of identification and relative abundance of the remaining tools (Kraken2, MetaPhlAn4, EukDetect, FunOMIC, MiCoP, and HumanMycobiomeScan), we constructed 18 mock communities of varying species richness and abundance levels. The mock communities comprised up to 165 fungal species belonging to the phyla Ascomycota and Basidiomycota, commonly found in gut microbiomes. Of the tools, FunOMIC and HumanMycobiomeScan needed source code modifications to run. Notably, only one species, Candida orthopsilosis, was consistently identified by all tools across all communities where it was included. Increasing community richness improved precision of Kraken2 and the relative abundance accuracy of all tools on species, genus, and family levels. MetaPhlAn4 accurately identified all genera present in the communities and FunOMIC identified most species. The top three tools for overall accuracy in both identification and relative abundance estimation were EukDetect, MiCoP, and FunOMIC, respectively. Adding 90% and 99% bacterial background did not significantly impact these tools' performance. Among the whole genome reference tools (Kraken2, HMS, and MiCoP), MiCoP exhibited the highest accuracy when the same reference database was used.</p><p><strong>Conclusion: </strong>Our survey of mycobiome-specific software revealed a very limited selection of such tools and their poor robustness due to error-prone software, along with a significant lack of comprehensive databases enabling characterization of the mycobiome. None of the implemented tools fully agreed on the mock community profiles. FunOMIC recognized most of the species, but EukDetect and MiCoP provided predictions that were closest to the correct compositions. The bacterial background did not impact these tools' performance. Video Abstract.</p>","PeriodicalId":18447,"journal":{"name":"Microbiome","volume":"13 1","pages":"66"},"PeriodicalIF":13.8000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11887097/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s40168-025-02048-3","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The mycobiome, representing the fungal component of microbial communities, is increasingly acknowledged as an integral part of the gut microbiome. However, research in this area remains relatively limited. The characterization of mycobiome taxa from metagenomic data is heavily reliant on the quality of the software and databases. In this study, we evaluated the feasibility of mycobiome profiling using existing bioinformatics tools on simulated fungal metagenomic data.
Results: We identified seven tools claiming to perform taxonomic assignment of fungal shotgun metagenomic sequences. One of these was outdated and required substantial modifications of the code to be functional and was thus excluded. To evaluate the accuracy of identification and relative abundance of the remaining tools (Kraken2, MetaPhlAn4, EukDetect, FunOMIC, MiCoP, and HumanMycobiomeScan), we constructed 18 mock communities of varying species richness and abundance levels. The mock communities comprised up to 165 fungal species belonging to the phyla Ascomycota and Basidiomycota, commonly found in gut microbiomes. Of the tools, FunOMIC and HumanMycobiomeScan needed source code modifications to run. Notably, only one species, Candida orthopsilosis, was consistently identified by all tools across all communities where it was included. Increasing community richness improved precision of Kraken2 and the relative abundance accuracy of all tools on species, genus, and family levels. MetaPhlAn4 accurately identified all genera present in the communities and FunOMIC identified most species. The top three tools for overall accuracy in both identification and relative abundance estimation were EukDetect, MiCoP, and FunOMIC, respectively. Adding 90% and 99% bacterial background did not significantly impact these tools' performance. Among the whole genome reference tools (Kraken2, HMS, and MiCoP), MiCoP exhibited the highest accuracy when the same reference database was used.
Conclusion: Our survey of mycobiome-specific software revealed a very limited selection of such tools and their poor robustness due to error-prone software, along with a significant lack of comprehensive databases enabling characterization of the mycobiome. None of the implemented tools fully agreed on the mock community profiles. FunOMIC recognized most of the species, but EukDetect and MiCoP provided predictions that were closest to the correct compositions. The bacterial background did not impact these tools' performance. Video Abstract.
期刊介绍:
Microbiome is a journal that focuses on studies of microbiomes in humans, animals, plants, and the environment. It covers both natural and manipulated microbiomes, such as those in agriculture. The journal is interested in research that uses meta-omics approaches or novel bioinformatics tools and emphasizes the community/host interaction and structure-function relationship within the microbiome. Studies that go beyond descriptive omics surveys and include experimental or theoretical approaches will be considered for publication. The journal also encourages research that establishes cause and effect relationships and supports proposed microbiome functions. However, studies of individual microbial isolates/species without exploring their impact on the host or the complex microbiome structures and functions will not be considered for publication. Microbiome is indexed in BIOSIS, Current Contents, DOAJ, Embase, MEDLINE, PubMed, PubMed Central, and Science Citations Index Expanded.