AcrDB update: Predicted 3D structures of anti-CRISPRs in human gut viromes.

IF 5.2 3区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Protein Science Pub Date : 2025-06-01 DOI:10.1002/pro.70177
Minal Khatri, N R Siva Shanmugam, Xinpeng Zhang, Revanth Sai Kumar Reddy Patel, Yanbin Yin
{"title":"AcrDB update: Predicted 3D structures of anti-CRISPRs in human gut viromes.","authors":"Minal Khatri, N R Siva Shanmugam, Xinpeng Zhang, Revanth Sai Kumar Reddy Patel, Yanbin Yin","doi":"10.1002/pro.70177","DOIUrl":null,"url":null,"abstract":"<p><p>Anti-CRISPR (Acr) proteins play a key role in phage-host interactions and hold great promise for advancing genome-editing technologies. However, finding new Acrs has been challenging due to their low sequence similarity. Recent advances in protein structure prediction have opened new pathways for Acr discovery by using 3D structure similarity. This study presents an updated AcrDB, with the following new features not available in other databases: (1) predicted Acrs from human gut virome databases, (2) Acr structures predicted by AlphaFold2, (3) a structural similarity search function to allow users to submit new sequences and structures to search against 3D structures of experimentally known Acrs. The updated AcrDB contains predicted 3D structures of 795 candidate Acrs with structural similarity (TM-score ≥0.7) to known Acrs supported by at least two of the three non-sequence similarity-based tools (TM-Vec, Foldseek, AcrPred). Among these candidate Acrs, 121 are supported by all three tools. AcrDB also includes 3D structures of 122 experimentally characterized Acr proteins. The 121 most confident candidate Acrs were combined with the 122 known Acrs and clustered into 163 sequence similarity-based Acr families. The 163 families were further subject to a structure similarity-based hierarchical clustering, revealing structural similarity between 44 candidate Acr (cAcr) families and 119 known Acr families. The bacterial hosts of these 163 Acr families are mainly from Bacillota, Pseudomonadota, and Bacteroidota, which are all dominant gut bacterial phyla. Many of these 163 Acr families are also co-localized in Acr operons. All the data and visualization are provided on our website: https://pro.unl.edu/AcrDB.</p>","PeriodicalId":20761,"journal":{"name":"Protein Science","volume":"34 6","pages":"e70177"},"PeriodicalIF":5.2000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12095918/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pro.70177","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Anti-CRISPR (Acr) proteins play a key role in phage-host interactions and hold great promise for advancing genome-editing technologies. However, finding new Acrs has been challenging due to their low sequence similarity. Recent advances in protein structure prediction have opened new pathways for Acr discovery by using 3D structure similarity. This study presents an updated AcrDB, with the following new features not available in other databases: (1) predicted Acrs from human gut virome databases, (2) Acr structures predicted by AlphaFold2, (3) a structural similarity search function to allow users to submit new sequences and structures to search against 3D structures of experimentally known Acrs. The updated AcrDB contains predicted 3D structures of 795 candidate Acrs with structural similarity (TM-score ≥0.7) to known Acrs supported by at least two of the three non-sequence similarity-based tools (TM-Vec, Foldseek, AcrPred). Among these candidate Acrs, 121 are supported by all three tools. AcrDB also includes 3D structures of 122 experimentally characterized Acr proteins. The 121 most confident candidate Acrs were combined with the 122 known Acrs and clustered into 163 sequence similarity-based Acr families. The 163 families were further subject to a structure similarity-based hierarchical clustering, revealing structural similarity between 44 candidate Acr (cAcr) families and 119 known Acr families. The bacterial hosts of these 163 Acr families are mainly from Bacillota, Pseudomonadota, and Bacteroidota, which are all dominant gut bacterial phyla. Many of these 163 Acr families are also co-localized in Acr operons. All the data and visualization are provided on our website: https://pro.unl.edu/AcrDB.

AcrDB更新:预测人类肠道病毒组中抗crispr的3D结构。
抗crispr (Acr)蛋白在噬菌体-宿主相互作用中起着关键作用,对推进基因组编辑技术具有很大的希望。然而,由于它们的序列相似性较低,寻找新的Acrs一直具有挑战性。蛋白质结构预测的最新进展为利用三维结构相似性发现Acr开辟了新的途径。本研究提出了一个更新的AcrDB,具有以下在其他数据库中没有的新功能:(1)预测来自人类肠道病毒数据库的Acr, (2) AlphaFold2预测的Acr结构,(3)结构相似性搜索功能,允许用户提交新的序列和结构来搜索实验已知的Acr的3D结构。更新后的AcrDB包含795个候选Acrs的预测3D结构,这些候选Acrs的结构相似性(tm评分≥0.7)与已知的Acrs具有至少两个基于非序列相似性的工具(TM-Vec, Foldseek, AcrPred)的支持。在这些候选acr中,有121个被所有三个工具支持。AcrDB还包括122个实验表征的Acr蛋白的3D结构。将121个最有信心的候选Acr与122个已知的Acr结合,聚类成163个基于序列相似性的Acr家族。163个家族进一步进行了基于结构相似性的分层聚类,发现44个候选Acr (cAcr)家族与119个已知Acr家族结构相似。163个Acr科细菌宿主主要来自杆菌门、假单胞菌门和拟杆菌门,均为优势肠道菌门。这163个Acr家族中的许多也在Acr操作子中共定位。所有的数据和可视化都在我们的网站上提供:https://pro.unl.edu/AcrDB。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Protein Science
Protein Science 生物-生化与分子生物学
CiteScore
12.40
自引率
1.20%
发文量
246
审稿时长
1 months
期刊介绍: Protein Science, the flagship journal of The Protein Society, is a publication that focuses on advancing fundamental knowledge in the field of protein molecules. The journal welcomes original reports and review articles that contribute to our understanding of protein function, structure, folding, design, and evolution. Additionally, Protein Science encourages papers that explore the applications of protein science in various areas such as therapeutics, protein-based biomaterials, bionanotechnology, synthetic biology, and bioelectronics. The journal accepts manuscript submissions in any suitable format for review, with the requirement of converting the manuscript to journal-style format only upon acceptance for publication. Protein Science is indexed and abstracted in numerous databases, including the Agricultural & Environmental Science Database (ProQuest), Biological Science Database (ProQuest), CAS: Chemical Abstracts Service (ACS), Embase (Elsevier), Health & Medical Collection (ProQuest), Health Research Premium Collection (ProQuest), Materials Science & Engineering Database (ProQuest), MEDLINE/PubMed (NLM), Natural Science Collection (ProQuest), and SciTech Premium Collection (ProQuest).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信