Generative adversarial network (GAN) model-based design of potent SARS-CoV-2 Mpro inhibitors using the electron density of ligands and 3D binding pockets: insights from molecular docking, dynamics simulation, and MM-GBSA analysis.
{"title":"Generative adversarial network (GAN) model-based design of potent SARS-CoV-2 M<sup>pro</sup> inhibitors using the electron density of ligands and 3D binding pockets: insights from molecular docking, dynamics simulation, and MM-GBSA analysis.","authors":"Annesha Chakraborty, Vignesh Krishnan, Subbiah Thamotharan","doi":"10.1007/s11030-024-11047-9","DOIUrl":null,"url":null,"abstract":"<p><p>Deep learning-based generative adversarial network (GAN) frameworks have recently been developed to expedite the drug discovery process. These models generate novel molecules from scratch and validate them through molecular docking simulation to identify the most promising candidates for a given drug target. In this study, the SARS-CoV-2 main protease (M<sup>pro</sup>) was selected as the drug target. Two distinct GAN algorithms were employed to generate novel small molecules. One approach utilized experimental electron density (ED-based) data of ligands for training to generate drug-like molecules, while the second approach leveraged the target binding pocket to capture spatial and bonding relationship between atoms within the binding pockets. The ED-based approach generated approximately 26,000 molecules, whereas the binding pocket-based method produced around 100 molecules. These generated molecules were subsequently ranked based on molecular docking results using the glide XP score (both flexible and rigid docking) and AutoDock Vina. To identify the most potent GAN-derived molecules, molecular docking was also performed on co-crystallized inhibitor molecules of M<sup>pro</sup>. The six most promising molecules from these GAN approaches were further evaluated for stability, interactions, and MM-GBSA binding free energy through molecular dynamics simulations. This analysis led to the identification of four potent M<sup>pro</sup> inhibitor molecules, all featuring a 2-benzyl-6-bromophenol scaffold. The binding free energies of these compounds were compared with those of other M<sup>pro</sup> inhibitors, revealing that our compounds demonstrated better affinity for M<sup>pro</sup> than some broad-spectrum protease inhibitors. The dynamic cross-correlation matrix plot indicated strongly correlated and anti-correlated regions, potentially linked to ligand binding.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-024-11047-9","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning-based generative adversarial network (GAN) frameworks have recently been developed to expedite the drug discovery process. These models generate novel molecules from scratch and validate them through molecular docking simulation to identify the most promising candidates for a given drug target. In this study, the SARS-CoV-2 main protease (Mpro) was selected as the drug target. Two distinct GAN algorithms were employed to generate novel small molecules. One approach utilized experimental electron density (ED-based) data of ligands for training to generate drug-like molecules, while the second approach leveraged the target binding pocket to capture spatial and bonding relationship between atoms within the binding pockets. The ED-based approach generated approximately 26,000 molecules, whereas the binding pocket-based method produced around 100 molecules. These generated molecules were subsequently ranked based on molecular docking results using the glide XP score (both flexible and rigid docking) and AutoDock Vina. To identify the most potent GAN-derived molecules, molecular docking was also performed on co-crystallized inhibitor molecules of Mpro. The six most promising molecules from these GAN approaches were further evaluated for stability, interactions, and MM-GBSA binding free energy through molecular dynamics simulations. This analysis led to the identification of four potent Mpro inhibitor molecules, all featuring a 2-benzyl-6-bromophenol scaffold. The binding free energies of these compounds were compared with those of other Mpro inhibitors, revealing that our compounds demonstrated better affinity for Mpro than some broad-spectrum protease inhibitors. The dynamic cross-correlation matrix plot indicated strongly correlated and anti-correlated regions, potentially linked to ligand binding.
期刊介绍:
Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including:
combinatorial chemistry and parallel synthesis;
small molecule libraries;
microwave synthesis;
flow synthesis;
fluorous synthesis;
diversity oriented synthesis (DOS);
nanoreactors;
click chemistry;
multiplex technologies;
fragment- and ligand-based design;
structure/function/SAR;
computational chemistry and molecular design;
chemoinformatics;
screening techniques and screening interfaces;
analytical and purification methods;
robotics, automation and miniaturization;
targeted libraries;
display libraries;
peptides and peptoids;
proteins;
oligonucleotides;
carbohydrates;
natural diversity;
new methods of library formulation and deconvolution;
directed evolution, origin of life and recombination;
search techniques, landscapes, random chemistry and more;