AI-Assisted Protein–Peptide Complex Prediction in a Practical Setting

IF 3.4 3区化学 Q2 CHEMISTRY, MULTIDISCIPLINARY

Journal of Computational Chemistry Pub Date : 2025-05-22 DOI:10.1002/jcc.70137

Darren Y. Wang, Luxuan Wang, Andrew Mi, Junmei Wang

{"title":"AI-Assisted Protein–Peptide Complex Prediction in a Practical Setting","authors":"Darren Y. Wang, Luxuan Wang, Andrew Mi, Junmei Wang","doi":"10.1002/jcc.70137","DOIUrl":null,"url":null,"abstract":"<p>Accurate prediction of protein–peptide complex structures plays a critical role in structure-based drug design, including antibody design. Most peptide-docking benchmark studies were conducted using crystal structures of protein–peptide complexes; as such, the performance of the current peptide docking tools in the practical setting is unknown. Here, the practical setting implies there are no crystal or other experimental structures for the complex, nor for the receptor and peptide. In this work, we have developed a practical docking protocol that incorporated two famous machine learning models, AlphaFold 2 for structural prediction and ANI-2x for ab initio potential prediction, to achieve a high success rate in modeling protein–peptide complex structures. The docking protocol consists of three major stages. In the first stage, the 3D structure of the receptor is predicted by AlphaFold 2 using the monomer mode, and that of the peptide is predicted by AlphaFold 2 using the multimer mode. We found that it is essential to include the receptor information to generate a high-quality 3D structure of the peptide. In the second stage, rigid protein–peptide docking is performed using ZDOCK software. In the last stage, the top 10 docking poses are relaxed and refined by ANI-2x in conjunction with our in-house geometry optimization algorithm—conjugate gradient with backtracking line search (CG-BS). CG-BS was developed by us to more efficiently perform geometry optimization, which takes the potential and force directly from ANI-2x machine learning models. The docking protocol achieved a very encouraging performance for a set of 62 very challenging protein–peptide systems which had an overall success rate of 34% if only the top 1 docking poses were considered. This success rate increased to 45% if the top 3 docking poses were considered. It is emphasized that this encouraging protein–peptide docking performance was achieved without using any crystal or experimental structures.</p>","PeriodicalId":188,"journal":{"name":"Journal of Computational Chemistry","volume":"46 14","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jcc.70137","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Chemistry","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70137","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate prediction of protein–peptide complex structures plays a critical role in structure-based drug design, including antibody design. Most peptide-docking benchmark studies were conducted using crystal structures of protein–peptide complexes; as such, the performance of the current peptide docking tools in the practical setting is unknown. Here, the practical setting implies there are no crystal or other experimental structures for the complex, nor for the receptor and peptide. In this work, we have developed a practical docking protocol that incorporated two famous machine learning models, AlphaFold 2 for structural prediction and ANI-2x for ab initio potential prediction, to achieve a high success rate in modeling protein–peptide complex structures. The docking protocol consists of three major stages. In the first stage, the 3D structure of the receptor is predicted by AlphaFold 2 using the monomer mode, and that of the peptide is predicted by AlphaFold 2 using the multimer mode. We found that it is essential to include the receptor information to generate a high-quality 3D structure of the peptide. In the second stage, rigid protein–peptide docking is performed using ZDOCK software. In the last stage, the top 10 docking poses are relaxed and refined by ANI-2x in conjunction with our in-house geometry optimization algorithm—conjugate gradient with backtracking line search (CG-BS). CG-BS was developed by us to more efficiently perform geometry optimization, which takes the potential and force directly from ANI-2x machine learning models. The docking protocol achieved a very encouraging performance for a set of 62 very challenging protein–peptide systems which had an overall success rate of 34% if only the top 1 docking poses were considered. This success rate increased to 45% if the top 3 docking poses were considered. It is emphasized that this encouraging protein–peptide docking performance was achieved without using any crystal or experimental structures.

查看原文本刊更多论文

人工智能辅助蛋白肽复合物预测在实际设置

蛋白质-肽复合物结构的准确预测在基于结构的药物设计（包括抗体设计）中起着至关重要的作用。大多数肽对接基准研究都是利用蛋白-肽复合物的晶体结构进行的；因此，目前的肽对接工具在实际设置中的性能是未知的。在这里，实际设置意味着没有晶体或其他实验结构的复合体，也没有受体和肽。在这项工作中，我们开发了一个实用的对接协议，该协议结合了两个著名的机器学习模型，用于结构预测的AlphaFold 2和用于从头算电位预测的ANI-2x，以实现对蛋白质-肽复合物结构建模的高成功率。对接协议包括三个主要阶段。在第一阶段，AlphaFold 2使用单体模式预测受体的三维结构，AlphaFold 2使用多聚体模式预测肽的三维结构。我们发现包含受体信息对于生成高质量的肽3D结构至关重要。第二阶段，使用ZDOCK软件进行刚性蛋白肽对接。在最后阶段，ANI-2x结合我们内部的几何优化算法-共轭梯度回溯线搜索（CG-BS），对前10个对接姿势进行放松和优化。CG-BS是由我们开发的，可以更有效地执行几何优化，直接从ANI-2x机器学习模型中获取潜力和力量。对接方案对于一组62个非常具有挑战性的蛋白质-肽系统取得了非常令人鼓舞的性能，如果只考虑前1个对接姿势，则总体成功率为34%。如果考虑到前3个对接姿势，成功率增加到45%。值得强调的是，这种令人鼓舞的蛋白肽对接性能是在没有使用任何晶体或实验结构的情况下实现的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Computational Chemistry 化学-化学综合

CiteScore

6.60

自引率

3.30%

发文量

247

审稿时长

1.7 months

期刊介绍： This distinguished journal publishes articles concerned with all aspects of computational chemistry: analytical, biological, inorganic, organic, physical, and materials. The Journal of Computational Chemistry presents original research, contemporary developments in theory and methodology, and state-of-the-art applications. Computational areas that are featured in the journal include ab initio and semiempirical quantum mechanics, density functional theory, molecular mechanics, molecular dynamics, statistical mechanics, cheminformatics, biomolecular structure prediction, molecular design, and bioinformatics.