TolRad, a model for predicting radiation tolerance using Pfam annotations, identifies novel radiosensitive bacterial species from reference genomes and MAGs.

IF 3.7 2区 生物学 Q2 MICROBIOLOGY
Philip Sweet, Matthew Burroughs, Sungyeon Jang, Lydia Contreras
{"title":"TolRad, a model for predicting radiation tolerance using Pfam annotations, identifies novel radiosensitive bacterial species from reference genomes and MAGs.","authors":"Philip Sweet, Matthew Burroughs, Sungyeon Jang, Lydia Contreras","doi":"10.1128/spectrum.03838-23","DOIUrl":null,"url":null,"abstract":"<p><p>The trait of ionizing radiation (IR) tolerance is variable between bacterium, with species succumbing to acute doses as low as 60 Gy and extremophiles able to survive doses exceeding 10,000 Gy. While survival screens have identified multiple highly radioresistant bacteria, such systemic searches have not been conducted for IR-sensitive bacteria. The taxonomy-level diversity of IR sensitivity is poorly understood, as are genetic elements that influence IR sensitivity. Using the protein domain (Pfam) frequencies from 61 bacterial species with experimentally determined <i>D</i><sub>10</sub> values (the dose at which only 10% of the population survives), we trained TolRad, a random forest binary classifier, to distinguish between radiosensitive (<i>D</i><sub>10</sub> < 200 Gy) and radiation-tolerant (<i>D</i><sub>10</sub> > 200 Gy) bacteria. On untrained species, TolRad had an accuracy of 0.900. We applied TolRad to 152 UniProt-hosted bacterial proteomes associated with the human microbiome, including 37 strains from the ATCC Human Microbiome Collection, and classified 34 species as radiosensitive. Whereas IR-sensitive species (<i>D</i><sub>10</sub> < 200 Gy) in the training data set had been confined to the phylum <i>Proteobacterium</i>, this initial TolRad screen identified radiosensitive bacteria in two additional phyla. We experimentally validated the predicted radiosensitivity of a <i>Bacteroidota</i> species from the human microbiome. To demonstrate that TolRad can be applied to metagenome-assembled genomes (MAGs), we tested the accuracy of TolRad on Egg-NOG assembled proteomes (0.965) and partial proteomes. Finally, three collections of MAGs were screened using TolRad, identifying further phyla with radiosensitive species and suggesting that environmental conditions influence the abundance of radiosensitive bacteria.</p><p><strong>Importance: </strong>Bacterial species have vast genetic diversity, allowing for life in extreme environments and the conduction of complex chemistry. The ability to harness the full potential of bacterial diversity is hampered by the lack of high-throughput experimental or bioinformatic methods for characterizing bacterial traits. Here, we present a computational model that uses <i>de novo</i>-generated genome annotations to classify a bacterium as tolerant of ionizing radiation (IR) or as radiosensitive. This model allows for rapid screening of bacterial communities for low-tolerance species that are of interest for both mechanistic studies into bacterial sensitivity to IR and biomarkers of IR exposure.</p>","PeriodicalId":18670,"journal":{"name":"Microbiology spectrum","volume":null,"pages":null},"PeriodicalIF":3.7000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiology spectrum","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/spectrum.03838-23","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The trait of ionizing radiation (IR) tolerance is variable between bacterium, with species succumbing to acute doses as low as 60 Gy and extremophiles able to survive doses exceeding 10,000 Gy. While survival screens have identified multiple highly radioresistant bacteria, such systemic searches have not been conducted for IR-sensitive bacteria. The taxonomy-level diversity of IR sensitivity is poorly understood, as are genetic elements that influence IR sensitivity. Using the protein domain (Pfam) frequencies from 61 bacterial species with experimentally determined D10 values (the dose at which only 10% of the population survives), we trained TolRad, a random forest binary classifier, to distinguish between radiosensitive (D10 < 200 Gy) and radiation-tolerant (D10 > 200 Gy) bacteria. On untrained species, TolRad had an accuracy of 0.900. We applied TolRad to 152 UniProt-hosted bacterial proteomes associated with the human microbiome, including 37 strains from the ATCC Human Microbiome Collection, and classified 34 species as radiosensitive. Whereas IR-sensitive species (D10 < 200 Gy) in the training data set had been confined to the phylum Proteobacterium, this initial TolRad screen identified radiosensitive bacteria in two additional phyla. We experimentally validated the predicted radiosensitivity of a Bacteroidota species from the human microbiome. To demonstrate that TolRad can be applied to metagenome-assembled genomes (MAGs), we tested the accuracy of TolRad on Egg-NOG assembled proteomes (0.965) and partial proteomes. Finally, three collections of MAGs were screened using TolRad, identifying further phyla with radiosensitive species and suggesting that environmental conditions influence the abundance of radiosensitive bacteria.

Importance: Bacterial species have vast genetic diversity, allowing for life in extreme environments and the conduction of complex chemistry. The ability to harness the full potential of bacterial diversity is hampered by the lack of high-throughput experimental or bioinformatic methods for characterizing bacterial traits. Here, we present a computational model that uses de novo-generated genome annotations to classify a bacterium as tolerant of ionizing radiation (IR) or as radiosensitive. This model allows for rapid screening of bacterial communities for low-tolerance species that are of interest for both mechanistic studies into bacterial sensitivity to IR and biomarkers of IR exposure.

TolRad是一个利用Pfam注释预测辐射耐受性的模型,它能从参考基因组和MAGs中识别新型辐射敏感细菌物种。
细菌对电离辐射(IR)的耐受性各不相同,有的细菌会屈服于低至 60 Gy 的急性剂量,而嗜极细菌则能在超过 10,000 Gy 的剂量下存活。虽然生存筛选已经发现了多种高度抗辐射细菌,但尚未对红外敏感细菌进行此类系统搜索。人们对红外敏感性在分类一级的多样性以及影响红外敏感性的遗传因子知之甚少。利用 61 个细菌物种的蛋白质结构域(Pfam)频率和实验确定的 D10 值(只有 10% 的种群能存活的剂量),我们训练了随机森林二元分类器 TolRad,以区分辐射敏感细菌(D10 < 200 Gy)和辐射耐受细菌(D10 > 200 Gy)。对于未经训练的物种,TolRad 的准确率为 0.900。我们将 TolRad 应用于 152 个与人类微生物组相关的 UniProt 寄存细菌蛋白质组,其中包括来自 ATCC 人类微生物组收集的 37 株菌株,并将 34 个物种归类为辐射敏感菌。训练数据集中对红外敏感的物种(D10 < 200 Gy)仅限于变形菌门,而这次 TolRad 的初步筛选在另外两个菌门中发现了对辐射敏感的细菌。我们通过实验验证了人类微生物组中一个类杆菌属物种的辐射敏感性预测结果。为了证明 TolRad 可用于元基因组组装基因组(MAG),我们测试了 TolRad 在 Egg-NOG 组装蛋白质组(0.965)和部分蛋白质组上的准确性。最后,我们使用 TolRad 筛选了三个 MAGs 库,发现了更多具有辐射敏感物种的门类,并表明环境条件会影响辐射敏感细菌的数量:细菌物种具有巨大的遗传多样性,能够在极端环境中生存并进行复杂的化学反应。由于缺乏表征细菌特征的高通量实验或生物信息学方法,人们无法充分利用细菌多样性的潜力。在这里,我们提出了一个计算模型,利用新生成的基因组注释将细菌分为耐电离辐射(IR)细菌和辐射敏感细菌。该模型可快速筛选细菌群落中的低耐受性物种,这些物种对细菌对 IR 敏感性的机理研究和 IR 暴露的生物标志物都很有意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Microbiology spectrum
Microbiology spectrum Biochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.20
自引率
5.40%
发文量
1800
期刊介绍: Microbiology Spectrum publishes commissioned review articles on topics in microbiology representing ten content areas: Archaea; Food Microbiology; Bacterial Genetics, Cell Biology, and Physiology; Clinical Microbiology; Environmental Microbiology and Ecology; Eukaryotic Microbes; Genomics, Computational, and Synthetic Microbiology; Immunology; Pathogenesis; and Virology. Reviews are interrelated, with each review linking to other related content. A large board of Microbiology Spectrum editors aids in the development of topics for potential reviews and in the identification of an editor, or editors, who shepherd each collection.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信