{"title":"When Simulations Meet Machine Learning: Redefining Molecular Docking for Protein-Glycosaminoglycan Systems","authors":"Krzysztof K. Bojarski","doi":"10.1002/jcc.70161","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Glycosaminoglycans (GAGs) are linear, negatively charged carbohydrates that modulate enzymatic activity in the extracellular matrix. Their high flexibility and specificity in protein-GAG interactions pose challenges for both experimental and computational studies. Here, the repulsive scaling replica exchange molecular dynamics (RS-REMD) method, combined with molecular mechanics generalized born surface area (MM-GBSA), was implemented using the CHARMM36m force field to evaluate its ability to guide ligands to their native binding sites in seven protein-GAG/carbohydrate complexes. A five machine learning (ML)-based models including fully connected neural network (FCNN), linear regression, LightGBM, random forest and support vector regressor (SVR) were also trained to predict binding accuracy (RMSatd) based on MM-GBSA energy components, protein-GAG distances, and hydrogen bond counts. While MM-GBSA values showed weak to moderate correlations with RMSatd, most of the trained AI models significantly improved the selection of native-like binding poses with Random Forest model providing most accurate predictions. This study highlights the potential of integrating simulations with ML to refine molecular docking for flexible ligands like GAGs.</p>\n </div>","PeriodicalId":188,"journal":{"name":"Journal of Computational Chemistry","volume":"46 17","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Chemistry","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70161","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Glycosaminoglycans (GAGs) are linear, negatively charged carbohydrates that modulate enzymatic activity in the extracellular matrix. Their high flexibility and specificity in protein-GAG interactions pose challenges for both experimental and computational studies. Here, the repulsive scaling replica exchange molecular dynamics (RS-REMD) method, combined with molecular mechanics generalized born surface area (MM-GBSA), was implemented using the CHARMM36m force field to evaluate its ability to guide ligands to their native binding sites in seven protein-GAG/carbohydrate complexes. A five machine learning (ML)-based models including fully connected neural network (FCNN), linear regression, LightGBM, random forest and support vector regressor (SVR) were also trained to predict binding accuracy (RMSatd) based on MM-GBSA energy components, protein-GAG distances, and hydrogen bond counts. While MM-GBSA values showed weak to moderate correlations with RMSatd, most of the trained AI models significantly improved the selection of native-like binding poses with Random Forest model providing most accurate predictions. This study highlights the potential of integrating simulations with ML to refine molecular docking for flexible ligands like GAGs.
期刊介绍:
This distinguished journal publishes articles concerned with all aspects of computational chemistry: analytical, biological, inorganic, organic, physical, and materials. The Journal of Computational Chemistry presents original research, contemporary developments in theory and methodology, and state-of-the-art applications. Computational areas that are featured in the journal include ab initio and semiempirical quantum mechanics, density functional theory, molecular mechanics, molecular dynamics, statistical mechanics, cheminformatics, biomolecular structure prediction, molecular design, and bioinformatics.