Genome-resolved metagenomics from short-read sequencing data in the era of artificial intelligence.

IF 3.1 4区 生物学 Q1 GENETICS & HEREDITY
Hajra Qayyum, Zaara Ishaq, Amjad Ali, Masood Ur Rehman Kayani, Lisu Huang
{"title":"Genome-resolved metagenomics from short-read sequencing data in the era of artificial intelligence.","authors":"Hajra Qayyum, Zaara Ishaq, Amjad Ali, Masood Ur Rehman Kayani, Lisu Huang","doi":"10.1007/s10142-025-01625-x","DOIUrl":null,"url":null,"abstract":"<p><p>Genome-resolved metagenomics is a computational method that enables researchers to reconstruct microbial genomes from a given sample directly. This process involves three major steps, i.e. (i) preprocessing of the reads (ii) metagenome assembly, and (iii) genome binning, with (iv) taxonomic classification, and (v) functional annotation as additional steps. Despite the availability of multiple bioinformatics approaches, metagenomic data analysis encounters various challenges due to high dimensionality, data sparseness, and complexity. Meanwhile, integrating artificial intelligence (AI) at different stages of data analysis has transformed genome-resolved metagenomics. Though the application of machine learning and deep learning in metagenomic annotation started earlier, the emergence of better sequencing technologies, improved throughput, and reduced processing time have rendered the initial models less efficient. Consequently, the number of AI-based metagenomics tools is continuously increasing. The recent AI-based tools demonstrate superior performance in handling complex and multi-dimensional metagenomics data, offering improved accuracy, scalability, and efficiency compared to traditional models. In this paper, we reviewed recent AI-based tools specifically developed for short-read metagenomic data, and their underlying models for genome-resolved metagenomics. It also discusses the performance of these tools and overviews their usability in metagenomics research. We believe this study will provide researchers with insights into the strengths and limitations of current AI-based approaches, serving as a valuable resource for selecting appropriate tools and guiding future advancements in genome-resolved metagenomics.</p>","PeriodicalId":574,"journal":{"name":"Functional & Integrative Genomics","volume":"25 1","pages":"124"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Functional & Integrative Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s10142-025-01625-x","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Genome-resolved metagenomics is a computational method that enables researchers to reconstruct microbial genomes from a given sample directly. This process involves three major steps, i.e. (i) preprocessing of the reads (ii) metagenome assembly, and (iii) genome binning, with (iv) taxonomic classification, and (v) functional annotation as additional steps. Despite the availability of multiple bioinformatics approaches, metagenomic data analysis encounters various challenges due to high dimensionality, data sparseness, and complexity. Meanwhile, integrating artificial intelligence (AI) at different stages of data analysis has transformed genome-resolved metagenomics. Though the application of machine learning and deep learning in metagenomic annotation started earlier, the emergence of better sequencing technologies, improved throughput, and reduced processing time have rendered the initial models less efficient. Consequently, the number of AI-based metagenomics tools is continuously increasing. The recent AI-based tools demonstrate superior performance in handling complex and multi-dimensional metagenomics data, offering improved accuracy, scalability, and efficiency compared to traditional models. In this paper, we reviewed recent AI-based tools specifically developed for short-read metagenomic data, and their underlying models for genome-resolved metagenomics. It also discusses the performance of these tools and overviews their usability in metagenomics research. We believe this study will provide researchers with insights into the strengths and limitations of current AI-based approaches, serving as a valuable resource for selecting appropriate tools and guiding future advancements in genome-resolved metagenomics.

人工智能时代短读测序数据的基因组解析宏基因组学。
基因组解析宏基因组学是一种计算方法,使研究人员能够直接从给定样本中重建微生物基因组。该过程包括三个主要步骤,即:(i) reads预处理;(ii)宏基因组组装;(iii)基因组分组;(iv)分类分类;(v)功能注释作为附加步骤。尽管有多种生物信息学方法,但宏基因组数据分析因其高维性、数据稀疏性和复杂性而面临各种挑战。同时,在数据分析的不同阶段整合人工智能(AI)已经改变了基因组解析的宏基因组学。虽然机器学习和深度学习在宏基因组注释中的应用起步较早,但更好的测序技术的出现、吞吐量的提高和处理时间的缩短使得初始模型的效率降低。因此,基于人工智能的宏基因组学工具的数量不断增加。最近基于人工智能的工具在处理复杂和多维元基因组学数据方面表现出卓越的性能,与传统模型相比,提供了更高的准确性、可扩展性和效率。在本文中,我们回顾了最近专门为短读元基因组数据开发的基于人工智能的工具,以及它们用于基因组解析元基因组的基础模型。它还讨论了这些工具的性能,并概述了它们在宏基因组学研究中的可用性。我们相信这项研究将为研究人员提供对当前基于人工智能方法的优势和局限性的见解,作为选择合适工具和指导基因组解析宏基因组学未来发展的宝贵资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.50
自引率
3.40%
发文量
92
审稿时长
2 months
期刊介绍: Functional & Integrative Genomics is devoted to large-scale studies of genomes and their functions, including systems analyses of biological processes. The journal will provide the research community an integrated platform where researchers can share, review and discuss their findings on important biological questions that will ultimately enable us to answer the fundamental question: How do genomes work?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信