Aleksei Kabedev, Christel A. S. Bergström, Per Larsson
{"title":"Molecular dynamics study on micelle-small molecule interactions: developing a strategy for an extensive comparison","authors":"Aleksei Kabedev, Christel A. S. Bergström, Per Larsson","doi":"10.1007/s10822-023-00541-1","DOIUrl":"10.1007/s10822-023-00541-1","url":null,"abstract":"<div><p>Theoretical predictions of the solubilizing capacity of micelles and vesicles present in intestinal fluid are important for the development of new delivery techniques and bioavailability improvement. A balance between accuracy and computational cost is a key factor for an extensive study of numerous compounds in diverse environments. In this study, we aimed to determine an optimal molecular dynamics (MD) protocol to evaluate small-molecule interactions with micelles composed of bile salts and phospholipids. MD simulations were used to produce free energy profiles for three drug molecules (danazol, probucol, and prednisolone) and one surfactant molecule (sodium caprate) as a function of the distance from the colloid center of mass. To address the challenges associated with such tasks, we compared different simulation setups, including freely assembled colloids versus pre-organized spherical micelles, full free energy profiles versus only a few points of interest, and a coarse-grained model versus an all-atom model. Our findings demonstrate that combining these techniques is advantageous for achieving optimal performance and accuracy when evaluating the solubilization capacity of micelles.</p><h3>Graphical abstract</h3><p>All-atom (AA) and coarse-grained (CG) umbrella sampling (US) simulations and point-wise free energy (FE) calculations were compared to their efficiency to computationally analyze the solubilization of active pharmaceutical ingredients in intestinal fluid colloids.</p>\u0000<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-023-00541-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138689379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"QM assisted ML for 19F NMR chemical shift prediction","authors":"Patrick Penner, Anna Vulpetti","doi":"10.1007/s10822-023-00542-0","DOIUrl":"10.1007/s10822-023-00542-0","url":null,"abstract":"<div><div><h3>Background</h3><p>Ligand-observed 19F NMR detection is an efficient method for screening libraries of fluorinated molecules in fragment-based drug design campaigns. Screening fluorinated molecules in large mixtures makes 19F NMR a high-throughput method. Typically, these mixtures are generated from pools of well-characterized fragments. By predicting 19F NMR chemical shift, mixtures could be generated for arbitrary fluorinated molecules facilitating for example focused screens.</p><h3>Methods</h3><p>In a previous publication, we introduced a method to predict 19F NMR chemical shift using rooted fluorine fingerprints and machine learning (ML) methods. Having observed that the quality of the prediction depends on similarity to the training set, we here propose to assist the prediction with quantum mechanics (QM) based methods in cases where compounds are not well covered by a training set.</p><h3>Results</h3><p>Beyond similarity, the performance of ML methods could be associated with individual features in compounds. A combination of both could be used as a procedure to split input data sets into those that could be predicted by ML and those that required QM processing. We could show on a proprietary fluorinated fragment library, known as LEF (Local Environment of Fluorine), and a public Enamine data set of 19F NMR chemical shifts that ML and QM methods could synergize to outperform either method individually. Models built on Enamine data, as well as model building and QM workflow tools, can be found at https://github.com/PatrickPenner/lefshift and https://github.com/PatrickPenner/lefqm.</p></div></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138570832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Open-ComBind: harnessing unlabeled data for improved binding pose prediction","authors":"Andrew T. McNutt, David Ryan Koes","doi":"10.1007/s10822-023-00544-y","DOIUrl":"10.1007/s10822-023-00544-y","url":null,"abstract":"<div><p>Determination of the bound pose of a ligand is a critical first step in many in silico drug discovery tasks. Molecular docking is the main tool for the prediction of non-covalent binding of a protein and ligand system. Molecular docking pipelines often only utilize the information of one ligand binding to the protein despite the commonly held hypothesis that different ligands share binding interactions when bound to the same receptor. Here we describe Open-ComBind, an easy-to-use, open-source version of the ComBind molecular docking pipeline that leverages information from multiple ligands without known bound structures to enhance pose selection. We first create distributions of feature similarities between ligand pose pairs, comparing near-native poses with all sampled docked poses. These distributions capture the likelihood of observing similar features, such as hydrogen bonds or hydrophobic contacts, in different pose configurations. These similarity distributions are then combined with a per-ligand docking score to enhance overall pose selection by 5% and 4.5% for high-affinity and congeneric series helper ligands, respectively. Open-ComBind reduces the average RMSD of ligands in our benchmark dataset by 9.0%. We provide Open-ComBind as an easy-to-use command line and Python API to increase pose prediction performance at www.github.com/drewnutt/open_combind.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-023-00544-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138552568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Binding of small molecule inhibitors to RNA polymerase-Spt5 complex impacts RNA and DNA stability","authors":"Adan Gallardo, Bercem Dutagaci","doi":"10.1007/s10822-023-00543-z","DOIUrl":"10.1007/s10822-023-00543-z","url":null,"abstract":"<div><p>Spt5 is an elongation factor that associates with RNA polymerase II (Pol II) during transcription and has important functions in promoter-proximal pausing and elongation processivity. Spt5 was also recognized for its roles in the transcription of expanded-repeat genes that are related to neurodegenerative diseases. Recently, a set of Spt5-Pol II small molecule inhibitors (SPIs) were reported, which selectively inhibit mutant huntingtin gene transcription. Inhibition mechanisms as well as interaction sites of these SPIs with Pol II and Spt5 are not entirely known. In this study, we predicted the binding sites of three selected SPIs at the Pol II-Spt5 interface by docking and molecular dynamics simulations. Two molecules out of three demonstrated strong binding with Spt5 and Pol II, while the other molecule was more loosely bound and sampled multiple binding sites. Strongly bound SPIs indirectly affected RNA and DNA dynamics at the exit site as DNA became more flexible while RNA was stabilized by increased interactions with Spt5. Our results suggest that the transcription inhibition mechanism induced by SPIs can be related to Spt5-nucleic acid interactions, which were altered to some extent with strong binding of SPIs.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10663202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138175193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sadra Kashef Ol Gheta, Anne Bonin, Thomas Gerlach, Andreas H. Göller
{"title":"Predicting absolute aqueous solubility by applying a machine learning model for an artificially liquid-state as proxy for the solid-state","authors":"Sadra Kashef Ol Gheta, Anne Bonin, Thomas Gerlach, Andreas H. Göller","doi":"10.1007/s10822-023-00538-w","DOIUrl":"10.1007/s10822-023-00538-w","url":null,"abstract":"<div><p>In this study, we use machine learning algorithms with QM-derived COSMO-RS descriptors, along with Morgan fingerprints, to predict the absolute solubility of drug-like compounds. The QM-derived descriptors account for the molecular properties of the solute, i.e., the solute–solute interactions in an artificial-liquid-state (super-cooled liquid), and the solute–solvent interactions in solution. We employ two main approaches to predict solubility: (i) a hypothetical pathway that involves melting the solute at room temperature T = T¯ (<span>({Delta }_{fus}{G}_{A}^{ominus })</span>) and mixing the artificially liquid solute into the solvent (<span>({Delta }_{m}{G}_{left(A:Bright)}^{ominus })</span>). In this approach <span>({Delta }_{fus}{G}_{A}^{ominus })</span> is predicted using machine learning models, and the <span>({Delta }_{m}{G}_{left(A:Bright)}^{ominus })</span> is obtained from COSMO-RS calculations; (ii) direct solubility prediction using machine learning algorithms. The models were trained on a large number of Bayer in-house compounds for which water solubility data is available at physiological pH of 6.5 and ambient temperature. We also evaluated our models using external datasets from a solubility challenge. Our models present great improvements compared to the absolute solubility prediction with the QSAR model for the artificial liquid state as implemented in the COSMO<i>therm</i> software, for both in-house and external datasets. We are furthermore able to demonstrate the superiority of QM-derived descriptors compared to cheminformatics descriptors. We finally present low-cost alternative models using fragment-based COSMO<i>quick</i> calculations with only marginal reduction in the quality of predicted solubility.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"37 12","pages":"765 - 789"},"PeriodicalIF":3.5,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50156780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nour Jamal Jaradat, Mamon Hatmal, Dana Alqudah, Mutasem Omar Taha
{"title":"Correction to: Computational workflow for discovering small molecular binders for shallow binding sites by integrating molecular dynamics simulation, pharmacophore modeling, and machine learning: STAT3 as case study","authors":"Nour Jamal Jaradat, Mamon Hatmal, Dana Alqudah, Mutasem Omar Taha","doi":"10.1007/s10822-023-00540-2","DOIUrl":"10.1007/s10822-023-00540-2","url":null,"abstract":"","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"37 12","pages":"679 - 679"},"PeriodicalIF":3.5,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49673023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tiago O. Pereira, Maryam Abbasi, Rita I. Oliveira, Romina A. Guedes, Jorge A. R. Salvador, Joel P. Arrais
{"title":"Artificial intelligence for prediction of biological activities and generation of molecular hits using stereochemical information","authors":"Tiago O. Pereira, Maryam Abbasi, Rita I. Oliveira, Romina A. Guedes, Jorge A. R. Salvador, Joel P. Arrais","doi":"10.1007/s10822-023-00539-9","DOIUrl":"10.1007/s10822-023-00539-9","url":null,"abstract":"<div><p>In this work, we develop a method for generating targeted hit compounds by applying deep reinforcement learning and attention mechanisms to predict binding affinity against a biological target while considering stereochemical information. The novelty of this work is a deep model Predictor that can establish the relationship between chemical structures and their corresponding <span>(pIC_{50})</span> values. We thoroughly study the effect of different molecular descriptors such as ECFP4, ECFP6, SMILES and RDKFingerprint. Also, we demonstrated the importance of attention mechanisms to capture long-range dependencies in molecular sequences. Due to the importance of stereochemical information for the binding mechanism, this information was employed both in the prediction and generation processes. To identify the most promising hits, we apply the self-adaptive multi-objective optimization strategy. Moreover, to ensure the existence of stereochemical information, we consider all the possible enumerated stereoisomers to provide the most appropriate 3D structures. We evaluated this approach against the Ubiquitin-Specific Protease 7 (USP7) by generating putative inhibitors for this target. The predictor with SMILES notations as descriptor plus bidirectional recurrent neural network using attention mechanism has the best performance. Additionally, our methodology identify the regions of the generated molecules that are important for the interaction with the receptor’s active site. Also, the obtained results demonstrate that it is possible to discover synthesizable molecules with high biological affinity for the target, containing the indication of their optimal stereochemical conformation.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"37 12","pages":"791 - 806"},"PeriodicalIF":3.5,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41231479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gabriel Corrêa Veríssimo, Simone Queiroz Pantaleão, Philipe de Olveira Fernandes, Jadson Castro Gertrudes, Thales Kronenberger, Kathia Maria Honorio, Vinícius Gonçalves Maltarollo
{"title":"MASSA Algorithm: an automated rational sampling of training and test subsets for QSAR modeling","authors":"Gabriel Corrêa Veríssimo, Simone Queiroz Pantaleão, Philipe de Olveira Fernandes, Jadson Castro Gertrudes, Thales Kronenberger, Kathia Maria Honorio, Vinícius Gonçalves Maltarollo","doi":"10.1007/s10822-023-00536-y","DOIUrl":"10.1007/s10822-023-00536-y","url":null,"abstract":"<div><p>QSAR models capable of predicting biological, toxicity, and pharmacokinetic properties were widely used to search lead bioactive molecules in chemical databases. The dataset’s preparation to build these models has a strong influence on the quality of the generated models, and sampling requires that the original dataset be divided into training (for model training) and test (for statistical evaluation) sets. This sampling can be done randomly or rationally, but the rational division is superior. In this paper, we present MASSA, a Python tool that can be used to automatically sample datasets by exploring the biological, physicochemical, and structural spaces of molecules using PCA, HCA, and K-modes. The proposed algorithm is very useful when the variables used for QSAR are not available or to construct multiple QSAR models with the same training and test sets, producing models with lower variability and better values for validation metrics. These results were obtained even when the descriptors used in the QSAR/QSPR were different from those used in the separation of training and test sets, indicating that this tool can be used to build models for more than one QSAR/QSPR technique. Finally, this tool also generates useful graphical representations that can provide insights into the data.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"37 12","pages":"735 - 754"},"PeriodicalIF":3.5,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41094801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Martina Hrast Rambaher, Irena Zdovc, Nina Kočevar Glavač, Stanislav Gobec, Rok Frlan
{"title":"Mur ligase F as a new target for the flavonoids quercitrin, myricetin, and (–)-epicatechin","authors":"Martina Hrast Rambaher, Irena Zdovc, Nina Kočevar Glavač, Stanislav Gobec, Rok Frlan","doi":"10.1007/s10822-023-00535-z","DOIUrl":"10.1007/s10822-023-00535-z","url":null,"abstract":"<div><p>MurC, D, E, and F are ATP-dependent ligases involved in the stepwise assembly of the tetrapeptide stem of forming peptidoglycan. As highly conserved targets found exclusively in bacterial cells, they are of significant interest for antibacterial drug discovery. In this study, we employed a computer-aided molecular design approach to identify potential inhibitors of MurF. A biochemical inhibition assay was conducted, screening twenty-four flavonoids and related compounds against MurC-F, resulting in the identification of quercitrin, myricetin, and (–)-epicatechin as MurF inhibitors with IC<sub>50</sub> values of 143 µM, 139 µM, and 92 µM, respectively. Notably, (–)-epicatechin demonstrated mixed type inhibition with ATP and uncompetitive inhibition with <span>d</span>-Ala-<span>d</span>-Ala dipeptide and UM3DAP substrates. Furthermore, <i>in silico</i> analysis using Sitemap and subsequent docking analysis using Glide revealed two plausible binding sites for (–)-epicatechin. The study also investigated the crucial structural features required for activity, with a particular focus on the substitution pattern and hydroxyl group positions, which were found to be important for the activity. The study highlights the significance of computational approaches in targeting essential enzymes involved in bacterial peptidoglycan synthesis.</p><h3>Graphical abstract</h3>\u0000 <div><figure><div><div><picture><source><img></source></picture></div></div></figure></div>\u0000 </div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"37 12","pages":"721 - 733"},"PeriodicalIF":3.5,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41097969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}