Shradha Sharma, Hari Priya Narahari, Karthik Raman
{"title":"利用机器学习进行宏基因组数据分析:趋势和应用。","authors":"Shradha Sharma, Hari Priya Narahari, Karthik Raman","doi":"10.1128/msystems.01642-24","DOIUrl":null,"url":null,"abstract":"<p><p>Metagenomic sequencing has revolutionized our understanding of microbial ecosystems by enabling high-resolution profiling of microbes across diverse environments. However, the resulting data are high-dimensional, sparse, and noisy, posing challenges for downstream data analysis. Machine learning (ML) has provided an arsenal of tools to extract meaningful insights from such large and complex data sets. This review surveys the existing state of ML applications in metagenomic data analysis, from traditional supervised and unsupervised learning to time-series modeling, transfer learning, and newer directions such as causal ML and generative models. We highlight certain key challenges and delve into important issues like model interpretability, emphasizing the importance of explainable AI (XAI). We also compare ML with mechanistic models, commenting on their relative advantages, disadvantages, and prospects for synergy. Finally, we preview future directions, such as the incorporation of multi-omics data, synthetic data generation, and Agentic AI systems, highlighting the increasingly prominent role that AI and ML will play in the future of microbiome science.</p>","PeriodicalId":18819,"journal":{"name":"mSystems","volume":" ","pages":"e0164224"},"PeriodicalIF":4.6000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Harnessing machine learning for metagenomic data analysis: trends and applications.\",\"authors\":\"Shradha Sharma, Hari Priya Narahari, Karthik Raman\",\"doi\":\"10.1128/msystems.01642-24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Metagenomic sequencing has revolutionized our understanding of microbial ecosystems by enabling high-resolution profiling of microbes across diverse environments. However, the resulting data are high-dimensional, sparse, and noisy, posing challenges for downstream data analysis. Machine learning (ML) has provided an arsenal of tools to extract meaningful insights from such large and complex data sets. This review surveys the existing state of ML applications in metagenomic data analysis, from traditional supervised and unsupervised learning to time-series modeling, transfer learning, and newer directions such as causal ML and generative models. We highlight certain key challenges and delve into important issues like model interpretability, emphasizing the importance of explainable AI (XAI). We also compare ML with mechanistic models, commenting on their relative advantages, disadvantages, and prospects for synergy. Finally, we preview future directions, such as the incorporation of multi-omics data, synthetic data generation, and Agentic AI systems, highlighting the increasingly prominent role that AI and ML will play in the future of microbiome science.</p>\",\"PeriodicalId\":18819,\"journal\":{\"name\":\"mSystems\",\"volume\":\" \",\"pages\":\"e0164224\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"mSystems\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1128/msystems.01642-24\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"mSystems","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/msystems.01642-24","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
Harnessing machine learning for metagenomic data analysis: trends and applications.
Metagenomic sequencing has revolutionized our understanding of microbial ecosystems by enabling high-resolution profiling of microbes across diverse environments. However, the resulting data are high-dimensional, sparse, and noisy, posing challenges for downstream data analysis. Machine learning (ML) has provided an arsenal of tools to extract meaningful insights from such large and complex data sets. This review surveys the existing state of ML applications in metagenomic data analysis, from traditional supervised and unsupervised learning to time-series modeling, transfer learning, and newer directions such as causal ML and generative models. We highlight certain key challenges and delve into important issues like model interpretability, emphasizing the importance of explainable AI (XAI). We also compare ML with mechanistic models, commenting on their relative advantages, disadvantages, and prospects for synergy. Finally, we preview future directions, such as the incorporation of multi-omics data, synthetic data generation, and Agentic AI systems, highlighting the increasingly prominent role that AI and ML will play in the future of microbiome science.
mSystemsBiochemistry, Genetics and Molecular Biology-Biochemistry
CiteScore
10.50
自引率
3.10%
发文量
308
审稿时长
13 weeks
期刊介绍:
mSystems™ will publish preeminent work that stems from applying technologies for high-throughput analyses to achieve insights into the metabolic and regulatory systems at the scale of both the single cell and microbial communities. The scope of mSystems™ encompasses all important biological and biochemical findings drawn from analyses of large data sets, as well as new computational approaches for deriving these insights. mSystems™ will welcome submissions from researchers who focus on the microbiome, genomics, metagenomics, transcriptomics, metabolomics, proteomics, glycomics, bioinformatics, and computational microbiology. mSystems™ will provide streamlined decisions, while carrying on ASM''s tradition of rigorous peer review.