基于机器学习和分子模拟的规程,以识别新型潜在的逆转录酶抑制剂,对抗艾滋病病毒感染。

IF 2.4 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Muhammad Shahab, Guojun Zheng, Yousef A Bin Jardan, Mohammed Bourhia
{"title":"基于机器学习和分子模拟的规程,以识别新型潜在的逆转录酶抑制剂,对抗艾滋病病毒感染。","authors":"Muhammad Shahab, Guojun Zheng, Yousef A Bin Jardan, Mohammed Bourhia","doi":"10.1080/07391102.2024.2319112","DOIUrl":null,"url":null,"abstract":"<p><p>Acquired immunodeficiency syndrome (AIDS) is a potentially fatal condition affecting the human immune system, which is attributed to the human immunodeficiency virus (HIV). The suppression of reverse transcriptase activity is a promising and feasible strategy for the therapeutic management of AIDS. In this study, we employed machine learning algorithms, such as support vector machines (SVM), k-nearest neighbor (k-NN), random forest (RF), and Gaussian naive base (GNB), which are fast and effective tools commonly used in drug design. For model training, we initially obtained a dataset of 5,159 compounds from BindingDB. The models were assessed using tenfold cross-validation to ensure their accuracy and reliability. Among these compounds, 1,645 compounds were labeled as active, having an IC<sub>50</sub> below 0.49 µM, while 3,514 compounds were labeled \"inactive against reverse transcriptase. Random forest achieved 86% accuracy on the train and test set among the different machine learning algorithms. Random forest model was then applied to an external ZINC dataset. Subsequently, only three hits-ZINC1359750464, ZINC1435357562, and ZINC1545719422-were selected based on the Lipinski Rule, docking score, and good interaction. The stability of these molecules was further evaluated by deploying molecular dynamics simulation and MM/GBSA, which were found to be -38.6013 ± 0.1103 kcal/mol for the Zidovudine/RT complex, -59.1761 ± 2.2926 kcal/mol for the ZINC1359750464/RT complex, -47.6292 ± 2.4206 kcal/mol for the ZINC1435357562/RT complex, and -50.7334 ± 2.5713 kcal/mol for the ZINC1545719422/RT complex.</p>","PeriodicalId":15272,"journal":{"name":"Journal of Biomolecular Structure & Dynamics","volume":" ","pages":"6109-6122"},"PeriodicalIF":2.4000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning and molecular simulation-based protocols to identify novel potential inhibitors for reverse transcriptase against HIV infections.\",\"authors\":\"Muhammad Shahab, Guojun Zheng, Yousef A Bin Jardan, Mohammed Bourhia\",\"doi\":\"10.1080/07391102.2024.2319112\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Acquired immunodeficiency syndrome (AIDS) is a potentially fatal condition affecting the human immune system, which is attributed to the human immunodeficiency virus (HIV). The suppression of reverse transcriptase activity is a promising and feasible strategy for the therapeutic management of AIDS. In this study, we employed machine learning algorithms, such as support vector machines (SVM), k-nearest neighbor (k-NN), random forest (RF), and Gaussian naive base (GNB), which are fast and effective tools commonly used in drug design. For model training, we initially obtained a dataset of 5,159 compounds from BindingDB. The models were assessed using tenfold cross-validation to ensure their accuracy and reliability. Among these compounds, 1,645 compounds were labeled as active, having an IC<sub>50</sub> below 0.49 µM, while 3,514 compounds were labeled \\\"inactive against reverse transcriptase. Random forest achieved 86% accuracy on the train and test set among the different machine learning algorithms. Random forest model was then applied to an external ZINC dataset. Subsequently, only three hits-ZINC1359750464, ZINC1435357562, and ZINC1545719422-were selected based on the Lipinski Rule, docking score, and good interaction. The stability of these molecules was further evaluated by deploying molecular dynamics simulation and MM/GBSA, which were found to be -38.6013 ± 0.1103 kcal/mol for the Zidovudine/RT complex, -59.1761 ± 2.2926 kcal/mol for the ZINC1359750464/RT complex, -47.6292 ± 2.4206 kcal/mol for the ZINC1435357562/RT complex, and -50.7334 ± 2.5713 kcal/mol for the ZINC1545719422/RT complex.</p>\",\"PeriodicalId\":15272,\"journal\":{\"name\":\"Journal of Biomolecular Structure & Dynamics\",\"volume\":\" \",\"pages\":\"6109-6122\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Biomolecular Structure & Dynamics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1080/07391102.2024.2319112\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/2/20 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomolecular Structure & Dynamics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1080/07391102.2024.2319112","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/20 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

获得性免疫缺陷综合征(艾滋病)是一种影响人类免疫系统的潜在致命疾病,是由人类免疫缺陷病毒(HIV)引起的。抑制逆转录酶的活性是治疗艾滋病的一种有希望且可行的策略。在本研究中,我们采用了机器学习算法,如支持向量机(SVM)、k-近邻(k-NN)、随机森林(RF)和高斯天真基础(GNB),这些算法是药物设计中常用的快速有效的工具。为了训练模型,我们首先从 BindingDB 中获得了 5,159 个化合物的数据集。为了确保模型的准确性和可靠性,我们对模型进行了十倍交叉验证。在这些化合物中,1645 个化合物被标记为活性化合物,其 IC50 低于 0.49 µM,而 3514 个化合物被标记为 "对逆转录酶无活性"。在不同的机器学习算法中,随机森林在训练集和测试集上的准确率达到了 86%。然后将随机森林模型应用于外部 ZINC 数据集。随后,根据利平斯基规则、对接得分和良好的相互作用,只选出了三个命中分子-ZINC1359750464、ZINC1435357562 和 ZINC1545719422。通过分子动力学模拟和 MM/GBSA 进一步评估了这些分子的稳定性,发现齐多夫定/RT 复合物的稳定性为 -38.6013 ± 0.1103 kcal/mol,ZINC1545719422 的稳定性为 -59.1761 ± 2.2926 kcal/mol。2926 kcal/mol,ZINC1435357562/RT复合物为-47.6292 ± 2.4206 kcal/mol,ZINC1545719422/RT复合物为-50.7334 ± 2.5713 kcal/mol。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine learning and molecular simulation-based protocols to identify novel potential inhibitors for reverse transcriptase against HIV infections.

Acquired immunodeficiency syndrome (AIDS) is a potentially fatal condition affecting the human immune system, which is attributed to the human immunodeficiency virus (HIV). The suppression of reverse transcriptase activity is a promising and feasible strategy for the therapeutic management of AIDS. In this study, we employed machine learning algorithms, such as support vector machines (SVM), k-nearest neighbor (k-NN), random forest (RF), and Gaussian naive base (GNB), which are fast and effective tools commonly used in drug design. For model training, we initially obtained a dataset of 5,159 compounds from BindingDB. The models were assessed using tenfold cross-validation to ensure their accuracy and reliability. Among these compounds, 1,645 compounds were labeled as active, having an IC50 below 0.49 µM, while 3,514 compounds were labeled "inactive against reverse transcriptase. Random forest achieved 86% accuracy on the train and test set among the different machine learning algorithms. Random forest model was then applied to an external ZINC dataset. Subsequently, only three hits-ZINC1359750464, ZINC1435357562, and ZINC1545719422-were selected based on the Lipinski Rule, docking score, and good interaction. The stability of these molecules was further evaluated by deploying molecular dynamics simulation and MM/GBSA, which were found to be -38.6013 ± 0.1103 kcal/mol for the Zidovudine/RT complex, -59.1761 ± 2.2926 kcal/mol for the ZINC1359750464/RT complex, -47.6292 ± 2.4206 kcal/mol for the ZINC1435357562/RT complex, and -50.7334 ± 2.5713 kcal/mol for the ZINC1545719422/RT complex.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Biomolecular Structure & Dynamics
Journal of Biomolecular Structure & Dynamics 生物-生化与分子生物学
CiteScore
8.90
自引率
9.10%
发文量
597
审稿时长
2 months
期刊介绍: The Journal of Biomolecular Structure and Dynamics welcomes manuscripts on biological structure, dynamics, interactions and expression. The Journal is one of the leading publications in high end computational science, atomic structural biology, bioinformatics, virtual drug design, genomics and biological networks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信