Susanna R Grigson, George Bouras, Bas E Dutilh, Robert D Olson, Robert A Edwards
{"title":"细菌和噬菌体蛋白的计算功能预测。","authors":"Susanna R Grigson, George Bouras, Bas E Dutilh, Robert D Olson, Robert A Edwards","doi":"10.1128/mmbr.00022-25","DOIUrl":null,"url":null,"abstract":"<p><p>SUMMARYUnderstanding protein functions is crucial for interpreting microbial life; however, reliable function annotation remains a major challenge in computational biology. Despite significant advances in bioinformatics methods, ~30% of all bacterial and ~65% of all bacteriophage (phage) protein sequences cannot be confidently annotated. In this review, we examine state-of-the-art bioinformatics tools and methodologies for annotating bacterial and phage proteins, particularly those of unknown or poorly characterized function. We describe the process of identifying protein-coding regions and the systems to classify protein functionalities. Additionally, we explore a range of protein annotation methods, from traditional homology-based methods to cutting-edge machine learning models. In doing so, we provide a toolbox for confidently annotating previously unknown bacterial and phage proteins, advancing the discovery of novel functions and our understanding of microbial systems.</p>","PeriodicalId":18520,"journal":{"name":"Microbiology and Molecular Biology Reviews","volume":" ","pages":"e0002225"},"PeriodicalIF":7.8000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462290/pdf/","citationCount":"0","resultStr":"{\"title\":\"Computational function prediction of bacteria and phage proteins.\",\"authors\":\"Susanna R Grigson, George Bouras, Bas E Dutilh, Robert D Olson, Robert A Edwards\",\"doi\":\"10.1128/mmbr.00022-25\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>SUMMARYUnderstanding protein functions is crucial for interpreting microbial life; however, reliable function annotation remains a major challenge in computational biology. Despite significant advances in bioinformatics methods, ~30% of all bacterial and ~65% of all bacteriophage (phage) protein sequences cannot be confidently annotated. In this review, we examine state-of-the-art bioinformatics tools and methodologies for annotating bacterial and phage proteins, particularly those of unknown or poorly characterized function. We describe the process of identifying protein-coding regions and the systems to classify protein functionalities. Additionally, we explore a range of protein annotation methods, from traditional homology-based methods to cutting-edge machine learning models. In doing so, we provide a toolbox for confidently annotating previously unknown bacterial and phage proteins, advancing the discovery of novel functions and our understanding of microbial systems.</p>\",\"PeriodicalId\":18520,\"journal\":{\"name\":\"Microbiology and Molecular Biology Reviews\",\"volume\":\" \",\"pages\":\"e0002225\"},\"PeriodicalIF\":7.8000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462290/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microbiology and Molecular Biology Reviews\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1128/mmbr.00022-25\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/18 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiology and Molecular Biology Reviews","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/mmbr.00022-25","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
Computational function prediction of bacteria and phage proteins.
SUMMARYUnderstanding protein functions is crucial for interpreting microbial life; however, reliable function annotation remains a major challenge in computational biology. Despite significant advances in bioinformatics methods, ~30% of all bacterial and ~65% of all bacteriophage (phage) protein sequences cannot be confidently annotated. In this review, we examine state-of-the-art bioinformatics tools and methodologies for annotating bacterial and phage proteins, particularly those of unknown or poorly characterized function. We describe the process of identifying protein-coding regions and the systems to classify protein functionalities. Additionally, we explore a range of protein annotation methods, from traditional homology-based methods to cutting-edge machine learning models. In doing so, we provide a toolbox for confidently annotating previously unknown bacterial and phage proteins, advancing the discovery of novel functions and our understanding of microbial systems.
期刊介绍:
Microbiology and Molecular Biology Reviews (MMBR), a journal that explores the significance and interrelationships of recent discoveries in various microbiology fields, publishes review articles that help both specialists and nonspecialists understand and apply the latest findings in their own research. MMBR covers a wide range of topics in microbiology, including microbial ecology, evolution, parasitology, biotechnology, and immunology. The journal caters to scientists with diverse interests in all areas of microbial science and encompasses viruses, bacteria, archaea, fungi, unicellular eukaryotes, and microbial parasites. MMBR primarily publishes authoritative and critical reviews that push the boundaries of knowledge, appealing to both specialists and generalists. The journal often includes descriptive figures and tables to enhance understanding. Indexed/Abstracted in various databases such as Agricola, BIOSIS Previews, CAB Abstracts, Cambridge Scientific Abstracts, Chemical Abstracts Service, Current Contents- Life Sciences, EMBASE, Food Science and Technology Abstracts, Illustrata, MEDLINE, Science Citation Index Expanded (Web of Science), Summon, and Scopus, among others.