J. Castillo-Garit, Y. Cañizares-Carmenate, H. Pham-The, F. Torrens, F. Pérez-Giménez
{"title":"基于QSARINS的Sars-Cov-2主要蛋白酶抑制剂的计算鉴定","authors":"J. Castillo-Garit, Y. Cañizares-Carmenate, H. Pham-The, F. Torrens, F. Pérez-Giménez","doi":"10.3390/mol2net-07-11622","DOIUrl":null,"url":null,"abstract":"Abstract. The novel coronavirus SARS-CoV-2 responsible for COVID-19, for which there is no vaccine or any known effective treatment created a sense of urgency for novel drug discovery approaches. One of the most important COVID-19 protein targets is the 3C-like (main) protease for which the crystal structure is known. In this study, we used QSAR methodology to identify compounds with potential inhibition activity for 3C-like protease. First we collect a dataset of 204 compounds, with experimental report of inhibition against SARS-CoV main protease, to develop a predictive model, using Multiple Linear Regression and a Genetic Algorithm for the selection of variables, implemented in the QSARINS software. The model was assessed and validated using the OECDs principles. The best model showed good value for the determination coefficient (R 2 =0.61), and others parameters were appropriate for fitting ( s =0.47 and RMSE tr =0.45). The validation results confirmed that the model has good robustness (Q 2LOO =0.53) and stability (R 2 –Q 2LOO =0.08) with low correlation between the descriptors (K XX =0.41), an excellent predictive power (R 2ext =0.54) and was product of a non-random correlation (R 2Yscr =0.06). This model is employed for the virtual screening of the Drug Bank database and several compounds, which belong to the applicability domain of the models, were identified as potential 3C-like protease inhibitors and proposed to further experiments to corroborate the predicted activity.","PeriodicalId":136053,"journal":{"name":"Proceedings of MOL2NET'21, Conference on Molecular, Biomedical & Computational Sciences and Engineering, 7th ed.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"QSARINS Based Computational Identification of Sars-Cov-2 Main Protease Inhibitors\",\"authors\":\"J. Castillo-Garit, Y. Cañizares-Carmenate, H. Pham-The, F. Torrens, F. Pérez-Giménez\",\"doi\":\"10.3390/mol2net-07-11622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. The novel coronavirus SARS-CoV-2 responsible for COVID-19, for which there is no vaccine or any known effective treatment created a sense of urgency for novel drug discovery approaches. One of the most important COVID-19 protein targets is the 3C-like (main) protease for which the crystal structure is known. In this study, we used QSAR methodology to identify compounds with potential inhibition activity for 3C-like protease. First we collect a dataset of 204 compounds, with experimental report of inhibition against SARS-CoV main protease, to develop a predictive model, using Multiple Linear Regression and a Genetic Algorithm for the selection of variables, implemented in the QSARINS software. The model was assessed and validated using the OECDs principles. The best model showed good value for the determination coefficient (R 2 =0.61), and others parameters were appropriate for fitting ( s =0.47 and RMSE tr =0.45). The validation results confirmed that the model has good robustness (Q 2LOO =0.53) and stability (R 2 –Q 2LOO =0.08) with low correlation between the descriptors (K XX =0.41), an excellent predictive power (R 2ext =0.54) and was product of a non-random correlation (R 2Yscr =0.06). This model is employed for the virtual screening of the Drug Bank database and several compounds, which belong to the applicability domain of the models, were identified as potential 3C-like protease inhibitors and proposed to further experiments to corroborate the predicted activity.\",\"PeriodicalId\":136053,\"journal\":{\"name\":\"Proceedings of MOL2NET'21, Conference on Molecular, Biomedical & Computational Sciences and Engineering, 7th ed.\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of MOL2NET'21, Conference on Molecular, Biomedical & Computational Sciences and Engineering, 7th ed.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/mol2net-07-11622\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of MOL2NET'21, Conference on Molecular, Biomedical & Computational Sciences and Engineering, 7th ed.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/mol2net-07-11622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
QSARINS Based Computational Identification of Sars-Cov-2 Main Protease Inhibitors
Abstract. The novel coronavirus SARS-CoV-2 responsible for COVID-19, for which there is no vaccine or any known effective treatment created a sense of urgency for novel drug discovery approaches. One of the most important COVID-19 protein targets is the 3C-like (main) protease for which the crystal structure is known. In this study, we used QSAR methodology to identify compounds with potential inhibition activity for 3C-like protease. First we collect a dataset of 204 compounds, with experimental report of inhibition against SARS-CoV main protease, to develop a predictive model, using Multiple Linear Regression and a Genetic Algorithm for the selection of variables, implemented in the QSARINS software. The model was assessed and validated using the OECDs principles. The best model showed good value for the determination coefficient (R 2 =0.61), and others parameters were appropriate for fitting ( s =0.47 and RMSE tr =0.45). The validation results confirmed that the model has good robustness (Q 2LOO =0.53) and stability (R 2 –Q 2LOO =0.08) with low correlation between the descriptors (K XX =0.41), an excellent predictive power (R 2ext =0.54) and was product of a non-random correlation (R 2Yscr =0.06). This model is employed for the virtual screening of the Drug Bank database and several compounds, which belong to the applicability domain of the models, were identified as potential 3C-like protease inhibitors and proposed to further experiments to corroborate the predicted activity.