Abdallah Abou Hajal, Richard A Bryce, Boulbaba Ben Amor, Noor Atatreh, Mohammad A Ghattas
{"title":"Boosting the Accuracy and Chemical Space Coverage of the Detection of Small Colloidal Aggregating Molecules Using the BAD Molecule Filter.","authors":"Abdallah Abou Hajal, Richard A Bryce, Boulbaba Ben Amor, Noor Atatreh, Mohammad A Ghattas","doi":"10.1021/acs.jcim.4c00363","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00363","url":null,"abstract":"<p><p>The ability to conduct effective high throughput screening (HTS) campaigns in drug discovery is often hampered by the detection of false positives in these assays due to small colloidally aggregating molecules (SCAMs). SCAMs can produce artifactual hits in HTS by nonspecific inhibition of the protein target. In this work, we present a new computational prediction tool for detecting SCAMs based on their 2D chemical structure. The tool, called the boosted aggregation detection (BAD) molecule filter, employs decision tree ensemble methods, namely, the CatBoost classifier and the light gradient-boosting machine, to significantly improve the detection of SCAMs. In developing the filter, we explore models trained on individual data sets, a consensus approach using these models, and, third, a merged data set approach, each tailored for specific drug discovery needs. The individual data set method emerged as most effective, achieving 93% sensitivity and 90% specificity, outperforming existing state-of-the-art models by 20 and 5%, respectively. The consensus models offer broader chemical space coverage, exceeding 90% for all testing sets. This feature is an important aspect particularly for early stage medicinal chemistry projects, and provides information on applicability domain. Meanwhile, the merged data set models demonstrated robust performance, with a notable sensitivity of 79% in the comprehensive 10-fold cross-validation test set. A SHAP analysis of model features indicates the importance of hydrophobicity and molecular complexity as primary factors influencing the aggregation propensity. The BAD molecule filter is readily accessible for the public usage on https://molmodlab-aau.com/Tools.html. This filter provides a new, more robust tool for aggregate prediction in the early stages of drug discovery to optimize hit rates and reduce associated testing and validation overheads.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141449037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"TransfIGN: A Structure-Based Deep Learning Method for Modeling the Interaction between HLA-A*02:01 and Antigen Peptides.","authors":"Nanqi Hong, Dejun Jiang, Zhe Wang, Huiyong Sun, Hao Luo, Lingjie Bao, Mingli Song, Yu Kang, Tingjun Hou","doi":"10.1021/acs.jcim.4c00678","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00678","url":null,"abstract":"<p><p>The intricate interaction between major histocompatibility complexes (MHCs) and antigen peptides with diverse amino acid sequences plays a pivotal role in immune responses and T cell activity. In recent years, deep learning (DL)-based models have emerged as promising tools for accelerating antigen peptide screening. However, most of these models solely rely on one-dimensional amino acid sequences, overlooking crucial information required for the three-dimensional (3-D) space binding process. In this study, we propose TransfIGN, a structure-based DL model that is inspired by our previously developed framework, Interaction Graph Network (IGN), and incorporates sequence information from transformers to predict the interactions between HLA-A*02:01 and antigen peptides. Our model, trained on a comprehensive data set containing 61,816 sequences with 9051 binding affinity labels and 56,848 eluted ligand labels, achieves an area under the curve (AUC) of 0.893 on the binary data set, better than state-of-the-art sequence-based models trained on larger data sets such as NetMHCpan4.1, ANN, and TransPHLA. Furthermore, when evaluated on the IEDB weekly benchmark data sets, our predictions (AUC = 0.816) are better than those of the recommended methods like the IEDB consensus (AUC = 0.795). Notably, the interaction weight matrices generated by our method highlight the strong interactions at specific positions within peptides, emphasizing the model's ability to provide physical interpretability. This capability to unveil binding mechanisms through intricate structural features holds promise for new immunotherapeutic avenues.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141449043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Petra Kührová, Vojtěch Mlýnský, Michal Otyepka, Jiří Šponer, Pavel Banáš
{"title":"Correction to \"Sensitivity of the RNA Structure to Ion Conditions as Probed by Molecular Dynamics Simulations of Common Canonical RNA Duplexes\".","authors":"Petra Kührová, Vojtěch Mlýnský, Michal Otyepka, Jiří Šponer, Pavel Banáš","doi":"10.1021/acs.jcim.4c01067","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c01067","url":null,"abstract":"","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141453732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DrugFlow: An AI-Driven One-Stop Platform for Innovative Drug Discovery.","authors":"Chao Shen, Jianfei Song, Chang-Yu Hsieh, Dongsheng Cao, Yu Kang, Wenling Ye, Zhenxing Wu, Jike Wang, Odin Zhang, Xujun Zhang, Hao Zeng, Heng Cai, Yu Chen, Linkang Chen, Hao Luo, Xinda Zhao, Tianye Jian, Tong Chen, Dejun Jiang, Mingyang Wang, Qing Ye, Jialu Wu, Hongyan Du, Hui Shi, Yafeng Deng, Tingjun Hou","doi":"10.1021/acs.jcim.4c00621","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00621","url":null,"abstract":"<p><p>Artificial intelligence (AI)-aided drug design has demonstrated unprecedented effects on modern drug discovery, but there is still an urgent need for user-friendly interfaces that bridge the gap between these sophisticated tools and scientists, particularly those who are less computer savvy. Herein, we present DrugFlow, an AI-driven one-stop platform that offers a clean, convenient, and cloud-based interface to streamline early drug discovery workflows. By seamlessly integrating a range of innovative AI algorithms, covering molecular docking, quantitative structure-activity relationship modeling, molecular generation, ADMET (absorption, distribution, metabolism, excretion and toxicity) prediction, and virtual screening, DrugFlow can offer effective AI solutions for almost all crucial stages in early drug discovery, including hit identification and hit/lead optimization. We hope that the platform can provide sufficiently valuable guidance to aid real-word drug design and discovery. The platform is available at https://drugflow.com.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141449041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Computational Investigation of Coaggregation and Cross-Seeding between Aβ and hIAPP Underpinning the Cross-Talk in Alzheimer's Disease and Type 2 Diabetes.","authors":"Xinjie Fan, Xiaohan Zhang, Jiajia Yan, Huan Xu, Wenhui Zhao, Feng Ding, Fengjuan Huang, Yunxiang Sun","doi":"10.1021/acs.jcim.4c00859","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00859","url":null,"abstract":"<p><p>The coexistence of amyloid-β (Aβ) and human islet amyloid polypeptide (hIAPP) in the brain and pancreas is associated with an increased risk of Alzheimer's disease (AD) and type 2 diabetes (T2D) due to their coaggregation and cross-seeding. Despite this, the molecular mechanisms underlying their interaction remain elusive. Here, we systematically investigated the cross-talk between Aβ and hIAPP using atomistic discrete molecular dynamics (DMD) simulations. Our results revealed that the amyloidogenic core regions of both Aβ (Aβ<sub>10-21</sub> and Aβ<sub>30-41</sub>) and hIAPP (hIAPP<sub>8-20</sub> and hIAPP<sub>22-29</sub>), driving their self-aggregation, also exhibited a strong tendency for cross-interaction. This propensity led to the formation of β-sheet-rich heterocomplexes, including potentially toxic β-barrel oligomers. The formation of Aβ and hIAPP heteroaggregates did not impede the recruitment of additional peptides to grow into larger aggregates. Our cross-seeding simulations demonstrated that both Aβ and hIAPP fibrils could mutually act as seeds, assisting each other's monomers in converting into β-sheets at the exposed fibril elongation ends. The amyloidogenic core regions of Aβ and hIAPP, in both oligomeric and fibrillar states, exhibited the ability to recruit isolated peptides, thereby extending the β-sheet edges, with limited sensitivity to the amino acid sequence. These findings suggest that targeting these regions by capping them with amyloid-resistant peptide drugs may hold potential as a therapeutic approach for addressing AD, T2D, and their copathologies.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141449039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Josh Kemppainen, Jacob R Gissinger, S Gowtham, Gregory M Odegard
{"title":"LUNAR: Automated Input Generation and Analysis for Reactive LAMMPS Simulations.","authors":"Josh Kemppainen, Jacob R Gissinger, S Gowtham, Gregory M Odegard","doi":"10.1021/acs.jcim.4c00730","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00730","url":null,"abstract":"<p><p>Generating simulation-ready molecular models for the LAMMPS molecular dynamics (MD) simulation software package is a difficult task and impedes the more widespread and efficient use of MD in materials design and development. Fixed-bond force fields generally require manual assignment of atom types, bonded interactions, charges, and simulation domain sizes. A new LAMMPS pre- and postprocessing toolkit (LUNAR) is presented that efficiently builds molecular systems for LAMMPS. LUNAR automatically assigns atom types, generates bonded interactions, assigns charges, and provides initial configuration methods to generate large molecular systems. LUNAR can also incorporate chemical reactivity into simulations by facilitating the use of the REACTER protocol. Additionally, LUNAR provides postprocessing for free volume calculations, cure characterization calculations, and property predictions from LAMMPS thermodynamic outputs. LUNAR has been validated via building and simulation of pure epoxy and cyanate ester polymer systems with a comparison of the corresponding predicted structures and properties to benchmark values, including experimental results from the literature. LUNAR provides the tools for the computationally driven development of next-generation composite materials in the Integrated Computational Materials Engineering (ICME) and Materials Genome Initiative (MGI) frameworks. LUNAR is written in Python with the usage of NumPy and can be used via a graphical user interface, a command line interface, or an integrated design environment. LUNAR is freely available via GitHub.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141453733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Classifying and Predicting the Thermal Expansion Properties of Metal-Organic Frameworks: A Data-Driven Approach.","authors":"Yifei Yue, Saad Aldin Mohamed, Jianwen Jiang","doi":"10.1021/acs.jcim.4c00057","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00057","url":null,"abstract":"<p><p>Metal-organic frameworks (MOFs) are versatile materials for a wide variety of potential applications. Tunable thermal expansion properties promote the application of MOFs in thermally sensitive composite materials; however, they are currently available only in a handful of structures. Herein, we report the first data set for thermal expansion properties of 33,131 diverse MOFs generated from molecular simulations and subsequently develop machine learning (ML) models to (1) classify different thermal expansion behaviors and (2) predict volumetric thermal expansion coefficients (α<sub>V</sub>). The random forest model trained on hybrid descriptors combining geometric, chemical, and topological features exhibits the best performance among different ML models. Based on feature importance analysis, linker chemistry and topological arrangement are revealed to have a dominant impact on thermal expansion. Furthermore, we identify common building blocks in MOFs with exceptional thermal expansion properties. This data-driven study is the first of its kind, not only constructing a useful data set to facilitate future studies on this important topic but also providing design guidelines for advancing new MOFs with desired thermal expansion properties.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141449038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dissecting the Role of the Hydroxyl Moiety at C14 in (+)-Opioid-Based TLR4 Antagonists via Wet-Lab Experiments and Molecular Dynamics Simulations.","authors":"Jingwei Gao, Cong Zhang, Hangyu Xu, Tianshu Zhang, Hongshuang Wang, Yibo Wang, Xiaohui Wang","doi":"10.1021/acs.jcim.4c00692","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00692","url":null,"abstract":"<p><p>Toll-like receptor 4 (TLR4) is pivotal as an innate immune receptor, playing a critical role in mediating neuropathic pain and drug addiction through its regulation of the neuroinflammatory response. The nonclassical (+)-opioid isomers represent a unique subset of TLR4 antagonists known for their effective blood-brain barrier permeability. Despite growing interest in the structure-activity relationship of these (+)-opioid-based TLR4 antagonists, the specific impact of heteroatoms on their TLR4 antagonistic activities has not been fully explored. This study investigated the influence of the hydroxyl group at C14 in six (+)-opioid TLR4 antagonists (<b>1</b>-<b>6</b>) using wet-lab experiments and <i>in silico</i> simulations. The corresponding C14-deoxy derivatives (<b>7</b>-<b>12</b>) were synthesized, and upon comparison with their corresponding counterparts (<b>1</b>-<b>6</b>), it was discovered that their TLR4 antagonistic activities were significantly diminished. Molecular dynamics simulations showed that the (+)-opioid TLR4 antagonists (<b>1</b>-<b>6</b>) possessed more negative binding free energies to the TLR4 coreceptor MD2, which was responsible for ligand recognition. This was primarily attributed to the formation of a hydrogen bond between the hydroxyl group at the C-14 position of the antagonists (<b>1</b>-<b>6</b>) and the R90 residue of MD2 during the binding process. Such an interaction facilitated the entry and subsequent binding of these molecules within the MD2 cavity. In contrast, the C14-deoxy derivatives (<b>7</b>-<b>12</b>), lacking the hydroxyl group at the C-14 position, missed this crucial hydrogen bond interaction with the R90 residue of MD2, leading to their egression from the MD2 cavity during simulations. This study underscores the significant role of the C14 hydroxyl moiety in enhancing the effectiveness of (+)-opioid TLR4 antagonists, which provides insightful guidance for designing future (+)-isomer opioid-derived TLR4 antagonists.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141449040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Subpocket Similarity-Based Hit Identification for Challenging Targets: Application to the WDR Domain of LRRK2.","authors":"Merveille Eguida, Guillaume Bret, François Sindt, Fengling Li, Irene Chau, Suzanne Ackloo, Cheryl Arrowsmith, Albina Bolotokova, Pegah Ghiabi, Elisa Gibson, Levon Halabelian, Scott Houliston, Rachel J Harding, Ashley Hutchinson, Peter Loppnau, Sumera Perveen, Almagul Seitova, Hong Zeng, Matthieu Schapira, Didier Rognan","doi":"10.1021/acs.jcim.4c00601","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00601","url":null,"abstract":"<p><p>We herewith applied <i>a priori</i> a generic hit identification method (POEM) for difficult targets of known three-dimensional structure, relying on the simple knowledge of physicochemical and topological properties of a user-selected cavity. Searching for local similarity to a set of fragment-bound protein microenvironments of known structure, a point cloud registration algorithm is first applied to align known subpockets to the target cavity. The resulting alignment then permits us to directly pose the corresponding seed fragments in a target cavity space not typically amenable to classical docking approaches. Last, linking potentially connectable atoms by a deep generative linker enables full ligand enumeration. When applied to the WD40 repeat (WDR) central cavity of leucine-rich repeat kinase 2 (LRRK2), an unprecedented binding site, POEM was able to quickly propose 94 potential hits, five of which were subsequently confirmed to bind <i>in vitro</i> to LRRK2-WDR.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141445566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Magnus Lundborg, Christian Wennberg, Erik Lindahl, Lars Norlén
{"title":"Simulating the Skin Permeation Process of Ionizable Molecules.","authors":"Magnus Lundborg, Christian Wennberg, Erik Lindahl, Lars Norlén","doi":"10.1021/acs.jcim.4c00722","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c00722","url":null,"abstract":"<p><p>It is commonly assumed that ionizable molecules, such as drugs, permeate through the skin barrier in their neutral form. By using molecular dynamics simulations of the charged and neutral states separately, we can study the dynamic protonation behavior during the permeation process. We have studied three weak acids and three weak bases and conclude that the acids are ionized to a larger extent than the bases, when passing through the headgroup region of the lipid barrier structure, at pH values close to their p<i>K</i><sub>a</sub>. It can also be observed that even if these dynamic protonation simulations are informative, in the cases studied herein they are not necessary for the calculation of permeability coefficients. It is sufficient to base the calculations only on the neutral form, as is commonly done.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141449042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}