Ameya Harmalkar, Lee-Shin Chu, Samuel W Canner, Rituparna Samanta, Rahel Frick, Fatima A Davila-Hernandez, Sudeep Sarma, Fatima Hitawala, Jeffrey J Gray
{"title":"CAPRI第47-55轮与Rosetta和深度学习方法对接。","authors":"Ameya Harmalkar, Lee-Shin Chu, Samuel W Canner, Rituparna Samanta, Rahel Frick, Fatima A Davila-Hernandez, Sudeep Sarma, Fatima Hitawala, Jeffrey J Gray","doi":"10.1002/prot.70016","DOIUrl":null,"url":null,"abstract":"<p><p>Critical Assessment of PRediction of Interactions (CAPRI) rounds 47 through 55 introduced 49 targets comprising multistage assemblies, antibody-antigen complexes, and flexible interfaces. For these rounds, we combined various Rosetta docking approaches (RosettaDock, ReplicaDock, and SymDock) with deep learning approaches (AlphaFold2, IgFold, and AlphaRED). Since prior CAPRI rounds, we have developed methods to better capture conformational changes, updated our scoring function, and integrated structure prediction tools such as AlphaFold2 in our docking routines. Here, we highlight several notable CAPRI targets and address the major challenges in the blind prediction of protein-protein interactions, including binding-induced conformational changes, large multimeric proteins, and antibody-antigen interactions. Although predictors have achieved modest improvements in accuracy for simpler targets post-AlphaFold2, performance for more flexible complexes remains limited. We employed RosettaDock 4.0, ReplicaDock 2.0, and AlphaRED to enhance backbone conformational sampling for flexible complexes. Our docking routines improved the DockQ score (0.77 vs. 0.62 for AF2-multimer) for a GP2 bacteriophage protein (T194), effectively capturing binding-induced conformational changes. Additionally, we introduce a fold-and-dock approach for predicting the assembly of a surface-layer SAP protein derived from Bacillus anthracis (T160), a large hetero-multimer comprising six distinct sub-units. For large symmetric complexes, we used Rosetta-based SymDock 2.0, successfully predicting a human DNA repair protein complex with A10 stoichiometry (T230) with high CAPRI-quality ranking. We also address the challenges in modeling antibody/nanobody-antigen interactions, particularly through the integration of deep learning tools and docking methods. Despite advances with tools like IgFold and AlphaFold2, accurately predicting CDR H3 loops and antibody-antigen binding interfaces remains challenging. Combining ReplicaDock 2.0 with deep learning highlights these difficulties and underscores the need for extensive sampling and CDR-focused strategies to improve prediction accuracy.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462888/pdf/","citationCount":"0","resultStr":"{\"title\":\"Docking With Rosetta and Deep Learning Approaches in CAPRI Rounds 47-55.\",\"authors\":\"Ameya Harmalkar, Lee-Shin Chu, Samuel W Canner, Rituparna Samanta, Rahel Frick, Fatima A Davila-Hernandez, Sudeep Sarma, Fatima Hitawala, Jeffrey J Gray\",\"doi\":\"10.1002/prot.70016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Critical Assessment of PRediction of Interactions (CAPRI) rounds 47 through 55 introduced 49 targets comprising multistage assemblies, antibody-antigen complexes, and flexible interfaces. For these rounds, we combined various Rosetta docking approaches (RosettaDock, ReplicaDock, and SymDock) with deep learning approaches (AlphaFold2, IgFold, and AlphaRED). Since prior CAPRI rounds, we have developed methods to better capture conformational changes, updated our scoring function, and integrated structure prediction tools such as AlphaFold2 in our docking routines. Here, we highlight several notable CAPRI targets and address the major challenges in the blind prediction of protein-protein interactions, including binding-induced conformational changes, large multimeric proteins, and antibody-antigen interactions. Although predictors have achieved modest improvements in accuracy for simpler targets post-AlphaFold2, performance for more flexible complexes remains limited. We employed RosettaDock 4.0, ReplicaDock 2.0, and AlphaRED to enhance backbone conformational sampling for flexible complexes. Our docking routines improved the DockQ score (0.77 vs. 0.62 for AF2-multimer) for a GP2 bacteriophage protein (T194), effectively capturing binding-induced conformational changes. Additionally, we introduce a fold-and-dock approach for predicting the assembly of a surface-layer SAP protein derived from Bacillus anthracis (T160), a large hetero-multimer comprising six distinct sub-units. For large symmetric complexes, we used Rosetta-based SymDock 2.0, successfully predicting a human DNA repair protein complex with A10 stoichiometry (T230) with high CAPRI-quality ranking. We also address the challenges in modeling antibody/nanobody-antigen interactions, particularly through the integration of deep learning tools and docking methods. Despite advances with tools like IgFold and AlphaFold2, accurately predicting CDR H3 loops and antibody-antigen binding interfaces remains challenging. Combining ReplicaDock 2.0 with deep learning highlights these difficulties and underscores the need for extensive sampling and CDR-focused strategies to improve prediction accuracy.</p>\",\"PeriodicalId\":56271,\"journal\":{\"name\":\"Proteins-Structure Function and Bioinformatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462888/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proteins-Structure Function and Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/prot.70016\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.70016","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
相互作用预测的关键评估(CAPRI)第47至55轮介绍了49个靶标,包括多级组装、抗体-抗原复合物和灵活的界面。在这几轮中,我们将各种Rosetta对接方法(RosettaDock、ReplicaDock和SymDock)与深度学习方法(AlphaFold2、IgFold和AlphaRED)结合起来。自之前的CAPRI轮以来,我们开发了更好地捕获构象变化的方法,更新了评分功能,并在对接例程中集成了结构预测工具,如AlphaFold2。在这里,我们强调了几个值得注意的CAPRI靶点,并解决了蛋白质相互作用盲目预测的主要挑战,包括结合诱导的构象变化、大的多聚体蛋白质和抗体-抗原相互作用。虽然预测器在alphafold2之后对更简单的目标的准确性有了适度的提高,但对更灵活的复合物的性能仍然有限。我们使用RosettaDock 4.0、ReplicaDock 2.0和AlphaRED来增强柔性配合物的主链构象采样。我们的对接程序提高了GP2噬菌体蛋白(T194)的DockQ评分(0.77 vs. af2多聚体0.62),有效地捕获了结合诱导的构象变化。此外,我们介绍了一种折叠-dock方法,用于预测来自炭疽芽孢杆菌(T160)的表层SAP蛋白的组装,这是一种由六个不同亚基组成的大型异多聚体。对于大型对称复合物,我们使用基于rosetta的SymDock 2.0,成功预测了具有A10化学计量(T230)的人类DNA修复蛋白复合物,具有较高的capri质量排名。我们还解决了建模抗体/纳米体-抗原相互作用的挑战,特别是通过深度学习工具和对接方法的集成。尽管IgFold和AlphaFold2等工具取得了进步,但准确预测CDR H3环和抗体-抗原结合界面仍然具有挑战性。将ReplicaDock 2.0与深度学习相结合,突出了这些困难,并强调需要广泛的采样和以cdr为重点的策略来提高预测准确性。
Docking With Rosetta and Deep Learning Approaches in CAPRI Rounds 47-55.
Critical Assessment of PRediction of Interactions (CAPRI) rounds 47 through 55 introduced 49 targets comprising multistage assemblies, antibody-antigen complexes, and flexible interfaces. For these rounds, we combined various Rosetta docking approaches (RosettaDock, ReplicaDock, and SymDock) with deep learning approaches (AlphaFold2, IgFold, and AlphaRED). Since prior CAPRI rounds, we have developed methods to better capture conformational changes, updated our scoring function, and integrated structure prediction tools such as AlphaFold2 in our docking routines. Here, we highlight several notable CAPRI targets and address the major challenges in the blind prediction of protein-protein interactions, including binding-induced conformational changes, large multimeric proteins, and antibody-antigen interactions. Although predictors have achieved modest improvements in accuracy for simpler targets post-AlphaFold2, performance for more flexible complexes remains limited. We employed RosettaDock 4.0, ReplicaDock 2.0, and AlphaRED to enhance backbone conformational sampling for flexible complexes. Our docking routines improved the DockQ score (0.77 vs. 0.62 for AF2-multimer) for a GP2 bacteriophage protein (T194), effectively capturing binding-induced conformational changes. Additionally, we introduce a fold-and-dock approach for predicting the assembly of a surface-layer SAP protein derived from Bacillus anthracis (T160), a large hetero-multimer comprising six distinct sub-units. For large symmetric complexes, we used Rosetta-based SymDock 2.0, successfully predicting a human DNA repair protein complex with A10 stoichiometry (T230) with high CAPRI-quality ranking. We also address the challenges in modeling antibody/nanobody-antigen interactions, particularly through the integration of deep learning tools and docking methods. Despite advances with tools like IgFold and AlphaFold2, accurately predicting CDR H3 loops and antibody-antigen binding interfaces remains challenging. Combining ReplicaDock 2.0 with deep learning highlights these difficulties and underscores the need for extensive sampling and CDR-focused strategies to improve prediction accuracy.
期刊介绍:
PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.