{"title":"Protein–peptide docking with a rational and accurate diffusion generative model","authors":"Huifeng Zhao, Odin Zhang, Dejun Jiang, Zhenxing Wu, Hongyan Du, Xiaorui Wang, Yihao Zhao, Yuansheng Huang, Jingxuan Ge, Tingjun Hou, Yu Kang","doi":"10.1038/s42256-025-01077-9","DOIUrl":null,"url":null,"abstract":"Therapeutic peptides represent the forefront of drug discovery, offering potent and safe alternatives to traditional small molecules. However, their weak and context-dependent nature complicates the efficient virtual screening and structural characterization of protein–peptide patterns. Here we introduce RAPiDock, a diffusion generative model designed for rational, accurate and rapid protein–peptide docking at an all-atomic level. RAPiDock efficiently reduces the sampling space by incorporating physical constraints and uses a bi-scale graph to effectively capture multidimensional structural information while balancing efficiency. In addition, the model uses a Clebsch–Gordan tensor product-based architecture to ensure physical symmetry. RAPiDock outperforms existing tools in prediction of protein–peptide-binding patterns, achieving a 93.7% success rate at top-25 predictions (13.4% higher than AlphaFold2-Multimer), with an execution speed of 0.35 seconds per complex (~270 times faster than AlphaFold2-Multimer). Extensive experiments demonstrate RAPiDock’s remarkable ability to handle 92 types of residue including posttranslational modifications, accurately predict subtle docking patterns, successfully identify multiple potential peptide-binding sites in global docking and serve as a powerful tool for high-throughput virtual screening with structural precision. All these push the boundaries of efficient protein–peptide docking in multiple real-application scenarios. Zhao et al. present RAPiDock, an all-atom diffusion model that predicts peptide–protein binding patterns across 92 amino acid types, enabling high-throughput virtual screening for advancing therapeutic peptide design.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 8","pages":"1308-1321"},"PeriodicalIF":23.9000,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Machine Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.nature.com/articles/s42256-025-01077-9","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Therapeutic peptides represent the forefront of drug discovery, offering potent and safe alternatives to traditional small molecules. However, their weak and context-dependent nature complicates the efficient virtual screening and structural characterization of protein–peptide patterns. Here we introduce RAPiDock, a diffusion generative model designed for rational, accurate and rapid protein–peptide docking at an all-atomic level. RAPiDock efficiently reduces the sampling space by incorporating physical constraints and uses a bi-scale graph to effectively capture multidimensional structural information while balancing efficiency. In addition, the model uses a Clebsch–Gordan tensor product-based architecture to ensure physical symmetry. RAPiDock outperforms existing tools in prediction of protein–peptide-binding patterns, achieving a 93.7% success rate at top-25 predictions (13.4% higher than AlphaFold2-Multimer), with an execution speed of 0.35 seconds per complex (~270 times faster than AlphaFold2-Multimer). Extensive experiments demonstrate RAPiDock’s remarkable ability to handle 92 types of residue including posttranslational modifications, accurately predict subtle docking patterns, successfully identify multiple potential peptide-binding sites in global docking and serve as a powerful tool for high-throughput virtual screening with structural precision. All these push the boundaries of efficient protein–peptide docking in multiple real-application scenarios. Zhao et al. present RAPiDock, an all-atom diffusion model that predicts peptide–protein binding patterns across 92 amino acid types, enabling high-throughput virtual screening for advancing therapeutic peptide design.
期刊介绍:
Nature Machine Intelligence is a distinguished publication that presents original research and reviews on various topics in machine learning, robotics, and AI. Our focus extends beyond these fields, exploring their profound impact on other scientific disciplines, as well as societal and industrial aspects. We recognize limitless possibilities wherein machine intelligence can augment human capabilities and knowledge in domains like scientific exploration, healthcare, medical diagnostics, and the creation of safe and sustainable cities, transportation, and agriculture. Simultaneously, we acknowledge the emergence of ethical, social, and legal concerns due to the rapid pace of advancements.
To foster interdisciplinary discussions on these far-reaching implications, Nature Machine Intelligence serves as a platform for dialogue facilitated through Comments, News Features, News & Views articles, and Correspondence. Our goal is to encourage a comprehensive examination of these subjects.
Similar to all Nature-branded journals, Nature Machine Intelligence operates under the guidance of a team of skilled editors. We adhere to a fair and rigorous peer-review process, ensuring high standards of copy-editing and production, swift publication, and editorial independence.