基于高通量化学生物学筛选数据集的分子体外抗结核活性计算模型。

IF 2.9 3区 医学 Q2 Medicine
Vinita Periwal, Shireesha Kishtapuram, Vinod Scaria
{"title":"基于高通量化学生物学筛选数据集的分子体外抗结核活性计算模型。","authors":"Vinita Periwal,&nbsp;Shireesha Kishtapuram,&nbsp;Vinod Scaria","doi":"10.1186/1471-2210-12-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The emergence of Multi-drug resistant tuberculosis in pandemic proportions throughout the world and the paucity of novel therapeutics for tuberculosis have re-iterated the need to accelerate the discovery of novel molecules with anti-tubercular activity. Though high-throughput screens for anti-tubercular activity are available, they are expensive, tedious and time-consuming to be performed on large scales. Thus, there remains an unmet need to prioritize the molecules that are taken up for biological screens to save on cost and time. Computational methods including Machine Learning have been widely employed to build classifiers for high-throughput virtual screens to prioritize molecules for further analysis. The availability of datasets based on high-throughput biological screens or assays in public domain makes computational methods a plausible proposition for building predictive models. In addition, this approach would save significantly on the cost, effort and time required to run high throughput screens.</p><p><strong>Results: </strong>We show that by using four supervised state-of-the-art classifiers (SMO, Random Forest, Naive Bayes and J48) we are able to generate in-silico predictive models on an extremely imbalanced (minority class ratio: 0.6%) large dataset of anti-tubercular molecules with reasonable AROC (0.6-0.75) and BCR (60-66%) values. Moreover, these models are able to provide 3-4 fold enrichment over random selection.</p><p><strong>Conclusions: </strong>In the present study, we have used the data from in-vitro screens for anti-tubercular activity from a high-throughput screen available in public domain to build highly accurate classifiers based on molecular descriptors of the molecules. We show that Machine Learning tools can be used to build highly effective predictive models for virtual high-throughput screens to prioritize molecules from large molecular libraries.</p>","PeriodicalId":48846,"journal":{"name":"BMC Pharmacology & Toxicology","volume":"12 ","pages":"1"},"PeriodicalIF":2.9000,"publicationDate":"2012-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1471-2210-12-1","citationCount":"34","resultStr":"{\"title\":\"Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets.\",\"authors\":\"Vinita Periwal,&nbsp;Shireesha Kishtapuram,&nbsp;Vinod Scaria\",\"doi\":\"10.1186/1471-2210-12-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The emergence of Multi-drug resistant tuberculosis in pandemic proportions throughout the world and the paucity of novel therapeutics for tuberculosis have re-iterated the need to accelerate the discovery of novel molecules with anti-tubercular activity. Though high-throughput screens for anti-tubercular activity are available, they are expensive, tedious and time-consuming to be performed on large scales. Thus, there remains an unmet need to prioritize the molecules that are taken up for biological screens to save on cost and time. Computational methods including Machine Learning have been widely employed to build classifiers for high-throughput virtual screens to prioritize molecules for further analysis. The availability of datasets based on high-throughput biological screens or assays in public domain makes computational methods a plausible proposition for building predictive models. In addition, this approach would save significantly on the cost, effort and time required to run high throughput screens.</p><p><strong>Results: </strong>We show that by using four supervised state-of-the-art classifiers (SMO, Random Forest, Naive Bayes and J48) we are able to generate in-silico predictive models on an extremely imbalanced (minority class ratio: 0.6%) large dataset of anti-tubercular molecules with reasonable AROC (0.6-0.75) and BCR (60-66%) values. Moreover, these models are able to provide 3-4 fold enrichment over random selection.</p><p><strong>Conclusions: </strong>In the present study, we have used the data from in-vitro screens for anti-tubercular activity from a high-throughput screen available in public domain to build highly accurate classifiers based on molecular descriptors of the molecules. We show that Machine Learning tools can be used to build highly effective predictive models for virtual high-throughput screens to prioritize molecules from large molecular libraries.</p>\",\"PeriodicalId\":48846,\"journal\":{\"name\":\"BMC Pharmacology & Toxicology\",\"volume\":\"12 \",\"pages\":\"1\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2012-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/1471-2210-12-1\",\"citationCount\":\"34\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Pharmacology & Toxicology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/1471-2210-12-1\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pharmacology & Toxicology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/1471-2210-12-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 34

摘要

背景:耐多药结核病在世界范围内的大流行和结核病新疗法的缺乏再次表明需要加速发现具有抗结核活性的新分子。虽然抗结核活性的高通量筛查是可用的,但它们昂贵,繁琐且耗时,无法大规模进行。因此,为了节省成本和时间,仍然需要优先考虑用于生物筛选的分子。包括机器学习在内的计算方法已被广泛用于构建高通量虚拟屏幕的分类器,以优先考虑分子以进行进一步分析。基于公共领域的高通量生物筛选或分析的数据集的可用性使得计算方法成为构建预测模型的合理主张。此外,这种方法将大大节省运行高吞吐量屏幕所需的成本、精力和时间。结果:我们表明,通过使用四个有监督的最先进的分类器(SMO,随机森林,朴素贝叶斯和J48),我们能够在具有合理的AROC(0.6-0.75)和BCR(60-66%)值的极度不平衡(少数类比率:0.6%)的抗结核分子大数据集上生成计算机预测模型。此外,这些模型能够提供3-4倍的富集比随机选择。结论:在本研究中,我们利用公共领域高通量筛选的体外抗结核活性筛选数据,基于分子描述符构建了高度准确的分类器。我们表明,机器学习工具可用于为虚拟高通量筛选构建高效的预测模型,以优先考虑大型分子库中的分子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets.

Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets.

Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets.

Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets.

Background: The emergence of Multi-drug resistant tuberculosis in pandemic proportions throughout the world and the paucity of novel therapeutics for tuberculosis have re-iterated the need to accelerate the discovery of novel molecules with anti-tubercular activity. Though high-throughput screens for anti-tubercular activity are available, they are expensive, tedious and time-consuming to be performed on large scales. Thus, there remains an unmet need to prioritize the molecules that are taken up for biological screens to save on cost and time. Computational methods including Machine Learning have been widely employed to build classifiers for high-throughput virtual screens to prioritize molecules for further analysis. The availability of datasets based on high-throughput biological screens or assays in public domain makes computational methods a plausible proposition for building predictive models. In addition, this approach would save significantly on the cost, effort and time required to run high throughput screens.

Results: We show that by using four supervised state-of-the-art classifiers (SMO, Random Forest, Naive Bayes and J48) we are able to generate in-silico predictive models on an extremely imbalanced (minority class ratio: 0.6%) large dataset of anti-tubercular molecules with reasonable AROC (0.6-0.75) and BCR (60-66%) values. Moreover, these models are able to provide 3-4 fold enrichment over random selection.

Conclusions: In the present study, we have used the data from in-vitro screens for anti-tubercular activity from a high-throughput screen available in public domain to build highly accurate classifiers based on molecular descriptors of the molecules. We show that Machine Learning tools can be used to build highly effective predictive models for virtual high-throughput screens to prioritize molecules from large molecular libraries.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Pharmacology & Toxicology
BMC Pharmacology & Toxicology PHARMACOLOGY & PHARMACY-TOXICOLOGY
CiteScore
4.40
自引率
0.00%
发文量
0
审稿时长
12 weeks
期刊介绍: BMC Pharmacology and Toxicology is an open access, peer-reviewed journal that considers articles on all aspects of chemically defined therapeutic and toxic agents. The journal welcomes submissions from all fields of experimental and clinical pharmacology including clinical trials and toxicology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信