qsar驱动的虚拟筛选实际应用中良好实践和错误命中的重要性:SARS-CoV-2主蛋白酶(Mpro)案例研究

M. Serafim, S. Q. Pantaleão, E. B. da Silva, J. McKerrow, A. O’Donoghue, B. E. Mota, K. M. Honório, V. Maltarollo
{"title":"qsar驱动的虚拟筛选实际应用中良好实践和错误命中的重要性:SARS-CoV-2主蛋白酶(Mpro)案例研究","authors":"M. Serafim, S. Q. Pantaleão, E. B. da Silva, J. McKerrow, A. O’Donoghue, B. E. Mota, K. M. Honório, V. Maltarollo","doi":"10.3389/fddsv.2023.1237655","DOIUrl":null,"url":null,"abstract":"Computer-Aided Drug Design (CADD) approaches, such as those employing quantitative structure-activity relationship (QSAR) methods, are known for their ability to uncover novel data from large databases. These approaches can help alleviate the lack of biological and chemical data, but some predictions do not generate sufficient positive information to be useful for biological screenings. QSAR models are often employed to explain biological data of chemicals and to design new chemicals based on their predictions. In this review, we discuss the importance of data set size with a focus on false hits for QSAR approaches. We assess the challenges and reliability of an initial in silico strategy for the virtual screening of bioactive molecules. Lastly, we present a case study reporting a combination approach of hologram-based quantitative structure-activity relationship (HQSAR) models and random forest-based QSAR (RF-QSAR), based on the 3D structures of 25 synthetic SARS-CoV-2 Mpro inhibitors, to virtually screen new compounds for potential inhibitors of enzyme activity. In this study, optimal models were selected and employed to predict Mpro inhibitors from the database Brazilian Compound Library (BraCoLi). Twenty-four compounds were then assessed against SARS-CoV-2 Mpro at 10 µM. At the time of this study (March 2021), the availability of varied and different Mpro inhibitors that were reported definitely affected the reliability of our work. Since no hits were obtained, the data set size, parameters employed, external validations, as well as the applicability domain (AD) could be considered regarding false hits data contribution, aiming to enhance the design and discovery of new bioactive molecules.","PeriodicalId":73080,"journal":{"name":"Frontiers in drug discovery","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The importance of good practices and false hits for QSAR-driven virtual screening real application: a SARS-CoV-2 main protease (Mpro) case study\",\"authors\":\"M. Serafim, S. Q. Pantaleão, E. B. da Silva, J. McKerrow, A. O’Donoghue, B. E. Mota, K. M. Honório, V. Maltarollo\",\"doi\":\"10.3389/fddsv.2023.1237655\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer-Aided Drug Design (CADD) approaches, such as those employing quantitative structure-activity relationship (QSAR) methods, are known for their ability to uncover novel data from large databases. These approaches can help alleviate the lack of biological and chemical data, but some predictions do not generate sufficient positive information to be useful for biological screenings. QSAR models are often employed to explain biological data of chemicals and to design new chemicals based on their predictions. In this review, we discuss the importance of data set size with a focus on false hits for QSAR approaches. We assess the challenges and reliability of an initial in silico strategy for the virtual screening of bioactive molecules. Lastly, we present a case study reporting a combination approach of hologram-based quantitative structure-activity relationship (HQSAR) models and random forest-based QSAR (RF-QSAR), based on the 3D structures of 25 synthetic SARS-CoV-2 Mpro inhibitors, to virtually screen new compounds for potential inhibitors of enzyme activity. In this study, optimal models were selected and employed to predict Mpro inhibitors from the database Brazilian Compound Library (BraCoLi). Twenty-four compounds were then assessed against SARS-CoV-2 Mpro at 10 µM. At the time of this study (March 2021), the availability of varied and different Mpro inhibitors that were reported definitely affected the reliability of our work. Since no hits were obtained, the data set size, parameters employed, external validations, as well as the applicability domain (AD) could be considered regarding false hits data contribution, aiming to enhance the design and discovery of new bioactive molecules.\",\"PeriodicalId\":73080,\"journal\":{\"name\":\"Frontiers in drug discovery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in drug discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fddsv.2023.1237655\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in drug discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fddsv.2023.1237655","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

计算机辅助药物设计(CADD)方法,如采用定量构效关系(QSAR)方法的方法,以其从大型数据库中发现新数据的能力而闻名。这些方法可以帮助缓解生物和化学数据的缺乏,但一些预测并没有产生足够的积极信息,对生物筛查有用。QSAR模型通常用于解释化学品的生物数据,并根据其预测设计新的化学品。在这篇综述中,我们讨论了数据集大小的重要性,重点是QSAR方法的错误命中。我们评估了虚拟筛选生物活性分子的初始计算机策略的挑战和可靠性。最后,我们提出了一个案例研究,报告了基于全息图的定量构效关系(HQSAR)模型和基于随机森林的QSAR(RF-QSAR)的组合方法,该方法基于25种合成的严重急性呼吸系统综合征冠状病毒2 Mpro抑制剂的3D结构,以实际筛选潜在的酶活性抑制剂的新化合物。在本研究中,从巴西化合物库(BraCoLi)数据库中选择并使用最佳模型来预测Mpro抑制剂。然后在10µM下对24种化合物进行了抗严重急性呼吸系统综合征冠状病毒2 Mpro的评估。在本研究进行时(2021年3月),报告的各种不同Mpro抑制剂的可用性肯定影响了我们工作的可靠性。由于没有获得命中率,可以考虑数据集大小、使用的参数、外部验证以及适用域(AD)对虚假命中率数据的贡献,旨在加强新生物活性分子的设计和发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The importance of good practices and false hits for QSAR-driven virtual screening real application: a SARS-CoV-2 main protease (Mpro) case study
Computer-Aided Drug Design (CADD) approaches, such as those employing quantitative structure-activity relationship (QSAR) methods, are known for their ability to uncover novel data from large databases. These approaches can help alleviate the lack of biological and chemical data, but some predictions do not generate sufficient positive information to be useful for biological screenings. QSAR models are often employed to explain biological data of chemicals and to design new chemicals based on their predictions. In this review, we discuss the importance of data set size with a focus on false hits for QSAR approaches. We assess the challenges and reliability of an initial in silico strategy for the virtual screening of bioactive molecules. Lastly, we present a case study reporting a combination approach of hologram-based quantitative structure-activity relationship (HQSAR) models and random forest-based QSAR (RF-QSAR), based on the 3D structures of 25 synthetic SARS-CoV-2 Mpro inhibitors, to virtually screen new compounds for potential inhibitors of enzyme activity. In this study, optimal models were selected and employed to predict Mpro inhibitors from the database Brazilian Compound Library (BraCoLi). Twenty-four compounds were then assessed against SARS-CoV-2 Mpro at 10 µM. At the time of this study (March 2021), the availability of varied and different Mpro inhibitors that were reported definitely affected the reliability of our work. Since no hits were obtained, the data set size, parameters employed, external validations, as well as the applicability domain (AD) could be considered regarding false hits data contribution, aiming to enhance the design and discovery of new bioactive molecules.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信