Computational Screening Using a Combination of Ligand-Based Machine Learning and Molecular Docking Methods for the Repurposing of Antivirals Targeting the SARS-CoV-2 Main Protease.
Gusti Putu Wahyunanda Crista Yuda, Naufa Hanif, Adam Hermawan
{"title":"Computational Screening Using a Combination of Ligand-Based Machine Learning and Molecular Docking Methods for the Repurposing of Antivirals Targeting the SARS-CoV-2 Main Protease.","authors":"Gusti Putu Wahyunanda Crista Yuda, Naufa Hanif, Adam Hermawan","doi":"10.1007/s40199-023-00484-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>COVID-19 is an infectious disease caused by SARS-CoV-2, a close relative of SARS-CoV. Several studies have searched for COVID-19 therapies. The topics of these works ranged from vaccine discovery to natural products targeting the SARS-CoV-2 main protease (M<sup>pro</sup>), a potential therapeutic target due to its essential role in replication and conserved sequences. However, published research on this target is limited, presenting an opportunity for drug discovery and development.</p><p><strong>Method: </strong>This study aims to repurpose 10692 drugs in DrugBank by using ligand-based virtual screening (LBVS) machine learning (ML) with Konstanz Information Miner (KNIME) to seek potential therapeutics based on M<sup>pro</sup> inhibitors. The top candidate compounds, the native ligand (GC-376) of the M<sup>pro</sup> inhibitor, and the positive control boceprevir were then subjected to absorption, distribution, metabolism, excretion, and toxicity (ADMET) characterization, drug-likeness prediction, and molecular docking (MD). Protein-protein interaction (PPI) network analysis was added to provide accurate information about the M<sup>pro</sup> regulatory network.</p><p><strong>Results: </strong>This study identified 3,166 compound candidates inhibiting M<sup>pro</sup>. The random forest (RF) molecular access system ML model provided the highest confidence score of 0.95 (bromo-7-nitroindazole) and identified the top 22 candidate compounds. Subjecting the 22 candidate compounds, the native ligand GC-376, and boceprevir to further ADMET property characterization and drug-likeness predictions revealed that one compound had two violations of Lipinski's rule. Additional MD results showed that only five compounds had more negative binding energies than the native ligand (- 12.25 kcal/mol). Among these compounds, CCX-140 exhibited the lowest score of - 13.64 kcal/mol. Through literature analysis, six compound classes with potential activity for M<sup>pro</sup> were discovered. They included benzopyrazole, azole, pyrazolopyrimidine, carboxylic acids and derivatives, benzene and substituted derivatives, and diazine. Four pathologies were also discovered on the basis of the M<sup>pro</sup> PPI network.</p><p><strong>Conclusion: </strong>Results demonstrated the efficiency of LBVS combined with MD. This combined strategy provided positive evidence showing that the top screened drugs, including CCX-140, which had the lowest MD score, can be reasonably advanced to the in vitro phase. This combined method may accelerate the discovery of therapies for novel or orphan diseases from existing drugs.</p>","PeriodicalId":10888,"journal":{"name":"DARU Journal of Pharmaceutical Sciences","volume":" ","pages":"47-65"},"PeriodicalIF":2.5000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11087449/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DARU Journal of Pharmaceutical Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40199-023-00484-w","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/10/31 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: COVID-19 is an infectious disease caused by SARS-CoV-2, a close relative of SARS-CoV. Several studies have searched for COVID-19 therapies. The topics of these works ranged from vaccine discovery to natural products targeting the SARS-CoV-2 main protease (Mpro), a potential therapeutic target due to its essential role in replication and conserved sequences. However, published research on this target is limited, presenting an opportunity for drug discovery and development.
Method: This study aims to repurpose 10692 drugs in DrugBank by using ligand-based virtual screening (LBVS) machine learning (ML) with Konstanz Information Miner (KNIME) to seek potential therapeutics based on Mpro inhibitors. The top candidate compounds, the native ligand (GC-376) of the Mpro inhibitor, and the positive control boceprevir were then subjected to absorption, distribution, metabolism, excretion, and toxicity (ADMET) characterization, drug-likeness prediction, and molecular docking (MD). Protein-protein interaction (PPI) network analysis was added to provide accurate information about the Mpro regulatory network.
Results: This study identified 3,166 compound candidates inhibiting Mpro. The random forest (RF) molecular access system ML model provided the highest confidence score of 0.95 (bromo-7-nitroindazole) and identified the top 22 candidate compounds. Subjecting the 22 candidate compounds, the native ligand GC-376, and boceprevir to further ADMET property characterization and drug-likeness predictions revealed that one compound had two violations of Lipinski's rule. Additional MD results showed that only five compounds had more negative binding energies than the native ligand (- 12.25 kcal/mol). Among these compounds, CCX-140 exhibited the lowest score of - 13.64 kcal/mol. Through literature analysis, six compound classes with potential activity for Mpro were discovered. They included benzopyrazole, azole, pyrazolopyrimidine, carboxylic acids and derivatives, benzene and substituted derivatives, and diazine. Four pathologies were also discovered on the basis of the Mpro PPI network.
Conclusion: Results demonstrated the efficiency of LBVS combined with MD. This combined strategy provided positive evidence showing that the top screened drugs, including CCX-140, which had the lowest MD score, can be reasonably advanced to the in vitro phase. This combined method may accelerate the discovery of therapies for novel or orphan diseases from existing drugs.
期刊介绍:
DARU Journal of Pharmaceutical Sciences is a peer-reviewed journal published on behalf of Tehran University of Medical Sciences. The journal encompasses all fields of the pharmaceutical sciences and presents timely research on all areas of drug conception, design, manufacture, classification and assessment.
The term DARU is derived from the Persian name meaning drug or medicine. This journal is a unique platform to improve the knowledge of researchers and scientists by publishing novel articles including basic and clinical investigations from members of the global scientific community in the forms of original articles, systematic or narrative reviews, meta-analyses, letters, and short communications.