Ameya Harmalkar, Lee-Shin Chu, Samuel W Canner, Rituparna Samanta, Rahel Frick, Fatima A Davila-Hernandez, Sudeep Sarma, Fatima Hitawala, Jeffrey J Gray
{"title":"Docking With Rosetta and Deep Learning Approaches in CAPRI Rounds 47-55.","authors":"Ameya Harmalkar, Lee-Shin Chu, Samuel W Canner, Rituparna Samanta, Rahel Frick, Fatima A Davila-Hernandez, Sudeep Sarma, Fatima Hitawala, Jeffrey J Gray","doi":"10.1002/prot.70016","DOIUrl":null,"url":null,"abstract":"<p><p>Critical Assessment of PRediction of Interactions (CAPRI) rounds 47 through 55 introduced 49 targets comprising multistage assemblies, antibody-antigen complexes, and flexible interfaces. For these rounds, we combined various Rosetta docking approaches (RosettaDock, ReplicaDock, and SymDock) with deep learning approaches (AlphaFold2, IgFold, and AlphaRED). Since prior CAPRI rounds, we have developed methods to better capture conformational changes, updated our scoring function, and integrated structure prediction tools such as AlphaFold2 in our docking routines. Here, we highlight several notable CAPRI targets and address the major challenges in the blind prediction of protein-protein interactions, including binding-induced conformational changes, large multimeric proteins, and antibody-antigen interactions. Although predictors have achieved modest improvements in accuracy for simpler targets post-AlphaFold2, performance for more flexible complexes remains limited. We employed RosettaDock 4.0, ReplicaDock 2.0, and AlphaRED to enhance backbone conformational sampling for flexible complexes. Our docking routines improved the DockQ score (0.77 vs. 0.62 for AF2-multimer) for a GP2 bacteriophage protein (T194), effectively capturing binding-induced conformational changes. Additionally, we introduce a fold-and-dock approach for predicting the assembly of a surface-layer SAP protein derived from Bacillus anthracis (T160), a large hetero-multimer comprising six distinct sub-units. For large symmetric complexes, we used Rosetta-based SymDock 2.0, successfully predicting a human DNA repair protein complex with A10 stoichiometry (T230) with high CAPRI-quality ranking. We also address the challenges in modeling antibody/nanobody-antigen interactions, particularly through the integration of deep learning tools and docking methods. Despite advances with tools like IgFold and AlphaFold2, accurately predicting CDR H3 loops and antibody-antigen binding interfaces remains challenging. Combining ReplicaDock 2.0 with deep learning highlights these difficulties and underscores the need for extensive sampling and CDR-focused strategies to improve prediction accuracy.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462888/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.70016","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Critical Assessment of PRediction of Interactions (CAPRI) rounds 47 through 55 introduced 49 targets comprising multistage assemblies, antibody-antigen complexes, and flexible interfaces. For these rounds, we combined various Rosetta docking approaches (RosettaDock, ReplicaDock, and SymDock) with deep learning approaches (AlphaFold2, IgFold, and AlphaRED). Since prior CAPRI rounds, we have developed methods to better capture conformational changes, updated our scoring function, and integrated structure prediction tools such as AlphaFold2 in our docking routines. Here, we highlight several notable CAPRI targets and address the major challenges in the blind prediction of protein-protein interactions, including binding-induced conformational changes, large multimeric proteins, and antibody-antigen interactions. Although predictors have achieved modest improvements in accuracy for simpler targets post-AlphaFold2, performance for more flexible complexes remains limited. We employed RosettaDock 4.0, ReplicaDock 2.0, and AlphaRED to enhance backbone conformational sampling for flexible complexes. Our docking routines improved the DockQ score (0.77 vs. 0.62 for AF2-multimer) for a GP2 bacteriophage protein (T194), effectively capturing binding-induced conformational changes. Additionally, we introduce a fold-and-dock approach for predicting the assembly of a surface-layer SAP protein derived from Bacillus anthracis (T160), a large hetero-multimer comprising six distinct sub-units. For large symmetric complexes, we used Rosetta-based SymDock 2.0, successfully predicting a human DNA repair protein complex with A10 stoichiometry (T230) with high CAPRI-quality ranking. We also address the challenges in modeling antibody/nanobody-antigen interactions, particularly through the integration of deep learning tools and docking methods. Despite advances with tools like IgFold and AlphaFold2, accurately predicting CDR H3 loops and antibody-antigen binding interfaces remains challenging. Combining ReplicaDock 2.0 with deep learning highlights these difficulties and underscores the need for extensive sampling and CDR-focused strategies to improve prediction accuracy.
期刊介绍:
PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.