Donya Ohadi, Kiran Kumar, Suchitra Ravula, Renee L DesJarlais, Mark J Seierstad, Amy Y Shih, Michael D Hack, Jamie M Schiffer
{"title":"输入姿势是自由能扰动性能的关键:以单酰甘油脂肪酶为基准。","authors":"Donya Ohadi, Kiran Kumar, Suchitra Ravula, Renee L DesJarlais, Mark J Seierstad, Amy Y Shih, Michael D Hack, Jamie M Schiffer","doi":"10.1021/acs.jcim.4c01223","DOIUrl":null,"url":null,"abstract":"<p><p>Free energy perturbation (FEP) methodologies have become commonplace methods for modeling potency in hit-to-lead and lead optimization stages of drug discovery. The conformational states of the initial poses of compounds for FEP+ calculations are often set up by alignment to a cocrystal structure ligand, but it is not clear if this method provides the best result for all proteins or all ligands. Not only are ligand conformational states potential variables in modeling compound potency in FEP but also the selection of crystallographic water molecules for inclusion in the FEP input structures can impact FEP models. Here, we report the results of FEP calculations using FEP+ from Schrödinger and starting from maximum common substructure alignment and docked poses generated with an array of docking methodologies. As a benchmark data set, we use monoacylglycerol lipase (MAGL), an important clinical drug target in cancer malignancy, neurological diseases, and metabolic disorders, and a set of 17 MAGL inhibitors. We found a large variation among FEP+ correlations to experimental IC<sub>50</sub> values depending on the method used to generate the input pose and that the inclusion of ligand-based information in the docking process, with some methods, increases the correlation between FEP+ free energies and IC<sub>50</sub> values. Upon analysis of the initial poses, we found that the differences in FEP+ correlations stemmed from rotation around a tertiary amide bond as well as translation of the compound toward the more hydrophobic side of the MAGL pocket. FEP+ estimation improved across all pose modeling methods when hydrogen bond constraint information was added. However, simple maximum common substructure alignment in the presence of all crystallographic water molecules outperformed all other methods in correlation between estimated and experimental IC<sub>50</sub> values. Taken together, these findings suggest that pose selection and crystallographic water inclusion greatly impact how well FEP+ estimated IC<sub>50</sub> values align with experimental IC<sub>50</sub> values and that modelers should benchmark a few different pose generation methodologies and different water inclusion strategies for their hit-to-lead and lead optimization drug discovery projects.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"8859-8869"},"PeriodicalIF":5.6000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Input Pose is Key to Performance of Free Energy Perturbation: Benchmarking with Monoacylglycerol Lipase.\",\"authors\":\"Donya Ohadi, Kiran Kumar, Suchitra Ravula, Renee L DesJarlais, Mark J Seierstad, Amy Y Shih, Michael D Hack, Jamie M Schiffer\",\"doi\":\"10.1021/acs.jcim.4c01223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Free energy perturbation (FEP) methodologies have become commonplace methods for modeling potency in hit-to-lead and lead optimization stages of drug discovery. The conformational states of the initial poses of compounds for FEP+ calculations are often set up by alignment to a cocrystal structure ligand, but it is not clear if this method provides the best result for all proteins or all ligands. Not only are ligand conformational states potential variables in modeling compound potency in FEP but also the selection of crystallographic water molecules for inclusion in the FEP input structures can impact FEP models. Here, we report the results of FEP calculations using FEP+ from Schrödinger and starting from maximum common substructure alignment and docked poses generated with an array of docking methodologies. As a benchmark data set, we use monoacylglycerol lipase (MAGL), an important clinical drug target in cancer malignancy, neurological diseases, and metabolic disorders, and a set of 17 MAGL inhibitors. We found a large variation among FEP+ correlations to experimental IC<sub>50</sub> values depending on the method used to generate the input pose and that the inclusion of ligand-based information in the docking process, with some methods, increases the correlation between FEP+ free energies and IC<sub>50</sub> values. Upon analysis of the initial poses, we found that the differences in FEP+ correlations stemmed from rotation around a tertiary amide bond as well as translation of the compound toward the more hydrophobic side of the MAGL pocket. FEP+ estimation improved across all pose modeling methods when hydrogen bond constraint information was added. However, simple maximum common substructure alignment in the presence of all crystallographic water molecules outperformed all other methods in correlation between estimated and experimental IC<sub>50</sub> values. Taken together, these findings suggest that pose selection and crystallographic water inclusion greatly impact how well FEP+ estimated IC<sub>50</sub> values align with experimental IC<sub>50</sub> values and that modelers should benchmark a few different pose generation methodologies and different water inclusion strategies for their hit-to-lead and lead optimization drug discovery projects.</p>\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\" \",\"pages\":\"8859-8869\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2024-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.4c01223\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/19 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01223","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
Input Pose is Key to Performance of Free Energy Perturbation: Benchmarking with Monoacylglycerol Lipase.
Free energy perturbation (FEP) methodologies have become commonplace methods for modeling potency in hit-to-lead and lead optimization stages of drug discovery. The conformational states of the initial poses of compounds for FEP+ calculations are often set up by alignment to a cocrystal structure ligand, but it is not clear if this method provides the best result for all proteins or all ligands. Not only are ligand conformational states potential variables in modeling compound potency in FEP but also the selection of crystallographic water molecules for inclusion in the FEP input structures can impact FEP models. Here, we report the results of FEP calculations using FEP+ from Schrödinger and starting from maximum common substructure alignment and docked poses generated with an array of docking methodologies. As a benchmark data set, we use monoacylglycerol lipase (MAGL), an important clinical drug target in cancer malignancy, neurological diseases, and metabolic disorders, and a set of 17 MAGL inhibitors. We found a large variation among FEP+ correlations to experimental IC50 values depending on the method used to generate the input pose and that the inclusion of ligand-based information in the docking process, with some methods, increases the correlation between FEP+ free energies and IC50 values. Upon analysis of the initial poses, we found that the differences in FEP+ correlations stemmed from rotation around a tertiary amide bond as well as translation of the compound toward the more hydrophobic side of the MAGL pocket. FEP+ estimation improved across all pose modeling methods when hydrogen bond constraint information was added. However, simple maximum common substructure alignment in the presence of all crystallographic water molecules outperformed all other methods in correlation between estimated and experimental IC50 values. Taken together, these findings suggest that pose selection and crystallographic water inclusion greatly impact how well FEP+ estimated IC50 values align with experimental IC50 values and that modelers should benchmark a few different pose generation methodologies and different water inclusion strategies for their hit-to-lead and lead optimization drug discovery projects.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.