Philip Sweet, Matthew Burroughs, Sungyeon Jang, Lydia Contreras
{"title":"TolRad是一个利用Pfam注释预测辐射耐受性的模型,它能从参考基因组和MAGs中识别新型辐射敏感细菌物种。","authors":"Philip Sweet, Matthew Burroughs, Sungyeon Jang, Lydia Contreras","doi":"10.1128/spectrum.03838-23","DOIUrl":null,"url":null,"abstract":"<p><p>The trait of ionizing radiation (IR) tolerance is variable between bacterium, with species succumbing to acute doses as low as 60 Gy and extremophiles able to survive doses exceeding 10,000 Gy. While survival screens have identified multiple highly radioresistant bacteria, such systemic searches have not been conducted for IR-sensitive bacteria. The taxonomy-level diversity of IR sensitivity is poorly understood, as are genetic elements that influence IR sensitivity. Using the protein domain (Pfam) frequencies from 61 bacterial species with experimentally determined <i>D</i><sub>10</sub> values (the dose at which only 10% of the population survives), we trained TolRad, a random forest binary classifier, to distinguish between radiosensitive (<i>D</i><sub>10</sub> < 200 Gy) and radiation-tolerant (<i>D</i><sub>10</sub> > 200 Gy) bacteria. On untrained species, TolRad had an accuracy of 0.900. We applied TolRad to 152 UniProt-hosted bacterial proteomes associated with the human microbiome, including 37 strains from the ATCC Human Microbiome Collection, and classified 34 species as radiosensitive. Whereas IR-sensitive species (<i>D</i><sub>10</sub> < 200 Gy) in the training data set had been confined to the phylum <i>Proteobacterium</i>, this initial TolRad screen identified radiosensitive bacteria in two additional phyla. We experimentally validated the predicted radiosensitivity of a <i>Bacteroidota</i> species from the human microbiome. To demonstrate that TolRad can be applied to metagenome-assembled genomes (MAGs), we tested the accuracy of TolRad on Egg-NOG assembled proteomes (0.965) and partial proteomes. Finally, three collections of MAGs were screened using TolRad, identifying further phyla with radiosensitive species and suggesting that environmental conditions influence the abundance of radiosensitive bacteria.</p><p><strong>Importance: </strong>Bacterial species have vast genetic diversity, allowing for life in extreme environments and the conduction of complex chemistry. The ability to harness the full potential of bacterial diversity is hampered by the lack of high-throughput experimental or bioinformatic methods for characterizing bacterial traits. Here, we present a computational model that uses <i>de novo</i>-generated genome annotations to classify a bacterium as tolerant of ionizing radiation (IR) or as radiosensitive. This model allows for rapid screening of bacterial communities for low-tolerance species that are of interest for both mechanistic studies into bacterial sensitivity to IR and biomarkers of IR exposure.</p>","PeriodicalId":18670,"journal":{"name":"Microbiology spectrum","volume":null,"pages":null},"PeriodicalIF":3.7000,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11466087/pdf/","citationCount":"0","resultStr":"{\"title\":\"TolRad, a model for predicting radiation tolerance using Pfam annotations, identifies novel radiosensitive bacterial species from reference genomes and MAGs.\",\"authors\":\"Philip Sweet, Matthew Burroughs, Sungyeon Jang, Lydia Contreras\",\"doi\":\"10.1128/spectrum.03838-23\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The trait of ionizing radiation (IR) tolerance is variable between bacterium, with species succumbing to acute doses as low as 60 Gy and extremophiles able to survive doses exceeding 10,000 Gy. While survival screens have identified multiple highly radioresistant bacteria, such systemic searches have not been conducted for IR-sensitive bacteria. The taxonomy-level diversity of IR sensitivity is poorly understood, as are genetic elements that influence IR sensitivity. Using the protein domain (Pfam) frequencies from 61 bacterial species with experimentally determined <i>D</i><sub>10</sub> values (the dose at which only 10% of the population survives), we trained TolRad, a random forest binary classifier, to distinguish between radiosensitive (<i>D</i><sub>10</sub> < 200 Gy) and radiation-tolerant (<i>D</i><sub>10</sub> > 200 Gy) bacteria. On untrained species, TolRad had an accuracy of 0.900. We applied TolRad to 152 UniProt-hosted bacterial proteomes associated with the human microbiome, including 37 strains from the ATCC Human Microbiome Collection, and classified 34 species as radiosensitive. Whereas IR-sensitive species (<i>D</i><sub>10</sub> < 200 Gy) in the training data set had been confined to the phylum <i>Proteobacterium</i>, this initial TolRad screen identified radiosensitive bacteria in two additional phyla. We experimentally validated the predicted radiosensitivity of a <i>Bacteroidota</i> species from the human microbiome. To demonstrate that TolRad can be applied to metagenome-assembled genomes (MAGs), we tested the accuracy of TolRad on Egg-NOG assembled proteomes (0.965) and partial proteomes. Finally, three collections of MAGs were screened using TolRad, identifying further phyla with radiosensitive species and suggesting that environmental conditions influence the abundance of radiosensitive bacteria.</p><p><strong>Importance: </strong>Bacterial species have vast genetic diversity, allowing for life in extreme environments and the conduction of complex chemistry. The ability to harness the full potential of bacterial diversity is hampered by the lack of high-throughput experimental or bioinformatic methods for characterizing bacterial traits. Here, we present a computational model that uses <i>de novo</i>-generated genome annotations to classify a bacterium as tolerant of ionizing radiation (IR) or as radiosensitive. This model allows for rapid screening of bacterial communities for low-tolerance species that are of interest for both mechanistic studies into bacterial sensitivity to IR and biomarkers of IR exposure.</p>\",\"PeriodicalId\":18670,\"journal\":{\"name\":\"Microbiology spectrum\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11466087/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microbiology spectrum\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1128/spectrum.03838-23\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/9/5 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiology spectrum","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/spectrum.03838-23","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/5 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
TolRad, a model for predicting radiation tolerance using Pfam annotations, identifies novel radiosensitive bacterial species from reference genomes and MAGs.
The trait of ionizing radiation (IR) tolerance is variable between bacterium, with species succumbing to acute doses as low as 60 Gy and extremophiles able to survive doses exceeding 10,000 Gy. While survival screens have identified multiple highly radioresistant bacteria, such systemic searches have not been conducted for IR-sensitive bacteria. The taxonomy-level diversity of IR sensitivity is poorly understood, as are genetic elements that influence IR sensitivity. Using the protein domain (Pfam) frequencies from 61 bacterial species with experimentally determined D10 values (the dose at which only 10% of the population survives), we trained TolRad, a random forest binary classifier, to distinguish between radiosensitive (D10 < 200 Gy) and radiation-tolerant (D10 > 200 Gy) bacteria. On untrained species, TolRad had an accuracy of 0.900. We applied TolRad to 152 UniProt-hosted bacterial proteomes associated with the human microbiome, including 37 strains from the ATCC Human Microbiome Collection, and classified 34 species as radiosensitive. Whereas IR-sensitive species (D10 < 200 Gy) in the training data set had been confined to the phylum Proteobacterium, this initial TolRad screen identified radiosensitive bacteria in two additional phyla. We experimentally validated the predicted radiosensitivity of a Bacteroidota species from the human microbiome. To demonstrate that TolRad can be applied to metagenome-assembled genomes (MAGs), we tested the accuracy of TolRad on Egg-NOG assembled proteomes (0.965) and partial proteomes. Finally, three collections of MAGs were screened using TolRad, identifying further phyla with radiosensitive species and suggesting that environmental conditions influence the abundance of radiosensitive bacteria.
Importance: Bacterial species have vast genetic diversity, allowing for life in extreme environments and the conduction of complex chemistry. The ability to harness the full potential of bacterial diversity is hampered by the lack of high-throughput experimental or bioinformatic methods for characterizing bacterial traits. Here, we present a computational model that uses de novo-generated genome annotations to classify a bacterium as tolerant of ionizing radiation (IR) or as radiosensitive. This model allows for rapid screening of bacterial communities for low-tolerance species that are of interest for both mechanistic studies into bacterial sensitivity to IR and biomarkers of IR exposure.
期刊介绍:
Microbiology Spectrum publishes commissioned review articles on topics in microbiology representing ten content areas: Archaea; Food Microbiology; Bacterial Genetics, Cell Biology, and Physiology; Clinical Microbiology; Environmental Microbiology and Ecology; Eukaryotic Microbes; Genomics, Computational, and Synthetic Microbiology; Immunology; Pathogenesis; and Virology. Reviews are interrelated, with each review linking to other related content. A large board of Microbiology Spectrum editors aids in the development of topics for potential reviews and in the identification of an editor, or editors, who shepherd each collection.