QSARINS Based Computational Identification of Sars-Cov-2 Main Protease Inhibitors

J. Castillo-Garit, Y. Cañizares-Carmenate, H. Pham-The, F. Torrens, F. Pérez-Giménez
{"title":"QSARINS Based Computational Identification of Sars-Cov-2 Main Protease Inhibitors","authors":"J. Castillo-Garit, Y. Cañizares-Carmenate, H. Pham-The, F. Torrens, F. Pérez-Giménez","doi":"10.3390/mol2net-07-11622","DOIUrl":null,"url":null,"abstract":"Abstract. The novel coronavirus SARS-CoV-2 responsible for COVID-19, for which there is no vaccine or any known effective treatment created a sense of urgency for novel drug discovery approaches. One of the most important COVID-19 protein targets is the 3C-like (main) protease for which the crystal structure is known. In this study, we used QSAR methodology to identify compounds with potential inhibition activity for 3C-like protease. First we collect a dataset of 204 compounds, with experimental report of inhibition against SARS-CoV main protease, to develop a predictive model, using Multiple Linear Regression and a Genetic Algorithm for the selection of variables, implemented in the QSARINS software. The model was assessed and validated using the OECDs principles. The best model showed good value for the determination coefficient (R 2 =0.61), and others parameters were appropriate for fitting ( s =0.47 and RMSE tr =0.45). The validation results confirmed that the model has good robustness (Q 2LOO =0.53) and stability (R 2 –Q 2LOO =0.08) with low correlation between the descriptors (K XX =0.41), an excellent predictive power (R 2ext =0.54) and was product of a non-random correlation (R 2Yscr =0.06). This model is employed for the virtual screening of the Drug Bank database and several compounds, which belong to the applicability domain of the models, were identified as potential 3C-like protease inhibitors and proposed to further experiments to corroborate the predicted activity.","PeriodicalId":136053,"journal":{"name":"Proceedings of MOL2NET'21, Conference on Molecular, Biomedical & Computational Sciences and Engineering, 7th ed.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of MOL2NET'21, Conference on Molecular, Biomedical & Computational Sciences and Engineering, 7th ed.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/mol2net-07-11622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract. The novel coronavirus SARS-CoV-2 responsible for COVID-19, for which there is no vaccine or any known effective treatment created a sense of urgency for novel drug discovery approaches. One of the most important COVID-19 protein targets is the 3C-like (main) protease for which the crystal structure is known. In this study, we used QSAR methodology to identify compounds with potential inhibition activity for 3C-like protease. First we collect a dataset of 204 compounds, with experimental report of inhibition against SARS-CoV main protease, to develop a predictive model, using Multiple Linear Regression and a Genetic Algorithm for the selection of variables, implemented in the QSARINS software. The model was assessed and validated using the OECDs principles. The best model showed good value for the determination coefficient (R 2 =0.61), and others parameters were appropriate for fitting ( s =0.47 and RMSE tr =0.45). The validation results confirmed that the model has good robustness (Q 2LOO =0.53) and stability (R 2 –Q 2LOO =0.08) with low correlation between the descriptors (K XX =0.41), an excellent predictive power (R 2ext =0.54) and was product of a non-random correlation (R 2Yscr =0.06). This model is employed for the virtual screening of the Drug Bank database and several compounds, which belong to the applicability domain of the models, were identified as potential 3C-like protease inhibitors and proposed to further experiments to corroborate the predicted activity.
基于QSARINS的Sars-Cov-2主要蛋白酶抑制剂的计算鉴定
摘要导致COVID-19的新型冠状病毒SARS-CoV-2没有疫苗或任何已知的有效治疗方法,这使人们对新药物发现方法产生了紧迫感。最重要的COVID-19蛋白靶点之一是已知晶体结构的3c样(主要)蛋白酶。在这项研究中,我们使用QSAR方法鉴定了具有潜在抑制活性的3c样蛋白酶化合物。首先,我们收集了204个化合物的数据集,并对SARS-CoV主要蛋白酶进行了抑制实验报告,利用多元线性回归和遗传算法选择变量,建立了预测模型,并在QSARINS软件中实现。使用经合组织的原则对该模型进行了评估和验证。最佳模型的决定系数值较好(r2 =0.61),其他参数的拟合效果较好(s =0.47, RMSE tr =0.45)。验证结果表明,该模型具有较好的鲁棒性(Q 2LOO =0.53)和稳定性(R 2 -Q 2LOO =0.08),描述符之间的相关性较低(K XX =0.41),预测能力较好(R 2ext =0.54),是非随机相关的产物(R 2Yscr =0.06)。利用该模型对Drug Bank数据库进行虚拟筛选,确定了几种属于该模型适用范围的化合物为潜在的3c样蛋白酶抑制剂,并提出进一步的实验来证实预测的活性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信