Elton J F Chaves, João Sartori, Whendel M Santos, Carlos H B Cruz, Emmanuel N Mhrous, Manassés F Nacimento-Filho, Matheus V F Ferraz, Roberto D Lins
{"title":"用超级学习器模型估计蛋白质的绝对结合自由能。","authors":"Elton J F Chaves, João Sartori, Whendel M Santos, Carlos H B Cruz, Emmanuel N Mhrous, Manassés F Nacimento-Filho, Matheus V F Ferraz, Roberto D Lins","doi":"10.1021/acs.jcim.4c01641","DOIUrl":null,"url":null,"abstract":"<p><p>Protein-protein binding is central to most biochemical processes of all living beings. Its importance underlies mechanisms ranging from cell interactions to metabolic control, but also to <i>ex vivo</i> biotechnology, such as the development of therapeutic monoclonal antibodies, the engineering of enzymes for industrial biocatalysis, the development of biosensors for disease detection, and the assembly of artificial protein complexes for drug screening. Therefore, predicting the strength of their association allows for understanding the molecular mechanisms and ultimately controlling them. We devised a machine learning ensemble model that uses Rosetta-based quantities to predict binding free energies of protein-protein complexes with accuracy rivaling both computationally demanding methods and currently available ML/DL tools. The method was encoded into an application Python pipeline named PBEE, which stands for Protein Binding Energy Estimator, allowing a rapid calculation of the absolute binding free energies of protein complexes from their PDB coordinates.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"2602-2609"},"PeriodicalIF":5.3000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11898044/pdf/","citationCount":"0","resultStr":"{\"title\":\"Estimating Absolute Protein-Protein Binding Free Energies by a Super Learner Model.\",\"authors\":\"Elton J F Chaves, João Sartori, Whendel M Santos, Carlos H B Cruz, Emmanuel N Mhrous, Manassés F Nacimento-Filho, Matheus V F Ferraz, Roberto D Lins\",\"doi\":\"10.1021/acs.jcim.4c01641\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Protein-protein binding is central to most biochemical processes of all living beings. Its importance underlies mechanisms ranging from cell interactions to metabolic control, but also to <i>ex vivo</i> biotechnology, such as the development of therapeutic monoclonal antibodies, the engineering of enzymes for industrial biocatalysis, the development of biosensors for disease detection, and the assembly of artificial protein complexes for drug screening. Therefore, predicting the strength of their association allows for understanding the molecular mechanisms and ultimately controlling them. We devised a machine learning ensemble model that uses Rosetta-based quantities to predict binding free energies of protein-protein complexes with accuracy rivaling both computationally demanding methods and currently available ML/DL tools. The method was encoded into an application Python pipeline named PBEE, which stands for Protein Binding Energy Estimator, allowing a rapid calculation of the absolute binding free energies of protein complexes from their PDB coordinates.</p>\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\" \",\"pages\":\"2602-2609\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11898044/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.4c01641\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/20 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01641","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/20 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
摘要
蛋白质与蛋白质的结合是所有生物大多数生化过程的核心。从细胞相互作用到代谢控制,以及离体生物技术,如治疗性单克隆抗体的开发、用于工业生物催化的酶工程、用于疾病检测的生物传感器的开发,以及用于药物筛选的人工蛋白质复合物的组装,都是其重要性的基础。因此,预测它们的关联强度有助于理解分子机制并最终控制它们。我们设计了一个机器学习集成模型,该模型使用基于rosetta的量来预测蛋白质-蛋白质复合物的结合自由能,其精度可与计算要求高的方法和当前可用的ML/DL工具相媲美。该方法被编码到一个名为PBEE (Protein Binding Energy Estimator)的Python应用程序管道中,可以根据蛋白质复合物的PDB坐标快速计算出它们的绝对结合自由能。
Estimating Absolute Protein-Protein Binding Free Energies by a Super Learner Model.
Protein-protein binding is central to most biochemical processes of all living beings. Its importance underlies mechanisms ranging from cell interactions to metabolic control, but also to ex vivo biotechnology, such as the development of therapeutic monoclonal antibodies, the engineering of enzymes for industrial biocatalysis, the development of biosensors for disease detection, and the assembly of artificial protein complexes for drug screening. Therefore, predicting the strength of their association allows for understanding the molecular mechanisms and ultimately controlling them. We devised a machine learning ensemble model that uses Rosetta-based quantities to predict binding free energies of protein-protein complexes with accuracy rivaling both computationally demanding methods and currently available ML/DL tools. The method was encoded into an application Python pipeline named PBEE, which stands for Protein Binding Energy Estimator, allowing a rapid calculation of the absolute binding free energies of protein complexes from their PDB coordinates.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.