当模拟遇到机器学习:重新定义蛋白质-糖胺聚糖系统的分子对接

IF 4.8 3区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Krzysztof K. Bojarski
{"title":"当模拟遇到机器学习:重新定义蛋白质-糖胺聚糖系统的分子对接","authors":"Krzysztof K. Bojarski","doi":"10.1002/jcc.70161","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Glycosaminoglycans (GAGs) are linear, negatively charged carbohydrates that modulate enzymatic activity in the extracellular matrix. Their high flexibility and specificity in protein-GAG interactions pose challenges for both experimental and computational studies. Here, the repulsive scaling replica exchange molecular dynamics (RS-REMD) method, combined with molecular mechanics generalized born surface area (MM-GBSA), was implemented using the CHARMM36m force field to evaluate its ability to guide ligands to their native binding sites in seven protein-GAG/carbohydrate complexes. A five machine learning (ML)-based models including fully connected neural network (FCNN), linear regression, LightGBM, random forest and support vector regressor (SVR) were also trained to predict binding accuracy (RMSatd) based on MM-GBSA energy components, protein-GAG distances, and hydrogen bond counts. While MM-GBSA values showed weak to moderate correlations with RMSatd, most of the trained AI models significantly improved the selection of native-like binding poses with Random Forest model providing most accurate predictions. This study highlights the potential of integrating simulations with ML to refine molecular docking for flexible ligands like GAGs.</p>\n </div>","PeriodicalId":188,"journal":{"name":"Journal of Computational Chemistry","volume":"46 17","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"When Simulations Meet Machine Learning: Redefining Molecular Docking for Protein-Glycosaminoglycan Systems\",\"authors\":\"Krzysztof K. Bojarski\",\"doi\":\"10.1002/jcc.70161\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Glycosaminoglycans (GAGs) are linear, negatively charged carbohydrates that modulate enzymatic activity in the extracellular matrix. Their high flexibility and specificity in protein-GAG interactions pose challenges for both experimental and computational studies. Here, the repulsive scaling replica exchange molecular dynamics (RS-REMD) method, combined with molecular mechanics generalized born surface area (MM-GBSA), was implemented using the CHARMM36m force field to evaluate its ability to guide ligands to their native binding sites in seven protein-GAG/carbohydrate complexes. A five machine learning (ML)-based models including fully connected neural network (FCNN), linear regression, LightGBM, random forest and support vector regressor (SVR) were also trained to predict binding accuracy (RMSatd) based on MM-GBSA energy components, protein-GAG distances, and hydrogen bond counts. While MM-GBSA values showed weak to moderate correlations with RMSatd, most of the trained AI models significantly improved the selection of native-like binding poses with Random Forest model providing most accurate predictions. This study highlights the potential of integrating simulations with ML to refine molecular docking for flexible ligands like GAGs.</p>\\n </div>\",\"PeriodicalId\":188,\"journal\":{\"name\":\"Journal of Computational Chemistry\",\"volume\":\"46 17\",\"pages\":\"\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70161\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Chemistry","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70161","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

糖胺聚糖(GAGs)是线性的,带负电荷的碳水化合物,可调节细胞外基质中的酶活性。它们在蛋白质- gag相互作用中的高灵活性和特异性对实验和计算研究都提出了挑战。本研究采用排斥尺度复制交换分子动力学(RS-REMD)方法,结合分子力学广义出生表面积(MM-GBSA),利用CHARMM36m力场评价其在7种蛋白质- gag /碳水化合物复合物中引导配体到达天然结合位点的能力。基于全连接神经网络(FCNN)、线性回归、LightGBM、随机森林和支持向量回归(SVR)等5种机器学习(ML)模型进行训练,以预测基于MM-GBSA能量成分、蛋白质- gag距离和氢键计数的结合精度(RMSatd)。虽然MM-GBSA值与RMSatd呈弱至中度相关性,但大多数训练后的人工智能模型显著改善了对原生类结合姿势的选择,其中随机森林模型提供了最准确的预测。这项研究强调了将模拟与ML相结合以改进柔性配体(如GAGs)的分子对接的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

When Simulations Meet Machine Learning: Redefining Molecular Docking for Protein-Glycosaminoglycan Systems

When Simulations Meet Machine Learning: Redefining Molecular Docking for Protein-Glycosaminoglycan Systems

Glycosaminoglycans (GAGs) are linear, negatively charged carbohydrates that modulate enzymatic activity in the extracellular matrix. Their high flexibility and specificity in protein-GAG interactions pose challenges for both experimental and computational studies. Here, the repulsive scaling replica exchange molecular dynamics (RS-REMD) method, combined with molecular mechanics generalized born surface area (MM-GBSA), was implemented using the CHARMM36m force field to evaluate its ability to guide ligands to their native binding sites in seven protein-GAG/carbohydrate complexes. A five machine learning (ML)-based models including fully connected neural network (FCNN), linear regression, LightGBM, random forest and support vector regressor (SVR) were also trained to predict binding accuracy (RMSatd) based on MM-GBSA energy components, protein-GAG distances, and hydrogen bond counts. While MM-GBSA values showed weak to moderate correlations with RMSatd, most of the trained AI models significantly improved the selection of native-like binding poses with Random Forest model providing most accurate predictions. This study highlights the potential of integrating simulations with ML to refine molecular docking for flexible ligands like GAGs.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.60
自引率
3.30%
发文量
247
审稿时长
1.7 months
期刊介绍: This distinguished journal publishes articles concerned with all aspects of computational chemistry: analytical, biological, inorganic, organic, physical, and materials. The Journal of Computational Chemistry presents original research, contemporary developments in theory and methodology, and state-of-the-art applications. Computational areas that are featured in the journal include ab initio and semiempirical quantum mechanics, density functional theory, molecular mechanics, molecular dynamics, statistical mechanics, cheminformatics, biomolecular structure prediction, molecular design, and bioinformatics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信