A machine learning-based virtual screening for natural compounds capable of inhibiting the HIV-1 integrase

L. A. Machado, Eduardo Krempser, A. Guimarães
{"title":"A machine learning-based virtual screening for natural compounds capable of inhibiting the HIV-1 integrase","authors":"L. A. Machado, Eduardo Krempser, A. Guimarães","doi":"10.3389/fddsv.2022.954911","DOIUrl":null,"url":null,"abstract":"HIV-1 integrase is an essential enzyme for the HIV-1 replication cycle, and currently, integrase inhibitors are in the first line of treatment in many guidelines. Despite the discovery of new inhibitors, including a new class of molecules with different mechanisms of action, resistance is still a relevant problem, and adding new options to the therapeutic arsenal to fight viral resistance is a Sisyphean task. Because of the difficulty and cost of in vitro screenings, machine learning-driven ligand-based virtual screenings are an alternative that can not only cut costs but also use valuable information about active compounds with yet unknown mechanisms of action. In this work, we describe a thorough model exploration and hyperparameter tuning procedure in a dataset with class imbalance and show several models capable of distinguishing between compounds that are active or inactive against the HIV-1 integrase. The best of the models was then used to screen the natural product atlas for active compounds, resulting in a myriad of molecules that share features with known integrase inhibitors. Here we also explore the strengths and shortcomings of our models and discuss the use of the applicability domain to guide in vitro screenings and differentiate between the “predictable” and “unknown” regions of the chemical space.","PeriodicalId":73080,"journal":{"name":"Frontiers in drug discovery","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in drug discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fddsv.2022.954911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

HIV-1 integrase is an essential enzyme for the HIV-1 replication cycle, and currently, integrase inhibitors are in the first line of treatment in many guidelines. Despite the discovery of new inhibitors, including a new class of molecules with different mechanisms of action, resistance is still a relevant problem, and adding new options to the therapeutic arsenal to fight viral resistance is a Sisyphean task. Because of the difficulty and cost of in vitro screenings, machine learning-driven ligand-based virtual screenings are an alternative that can not only cut costs but also use valuable information about active compounds with yet unknown mechanisms of action. In this work, we describe a thorough model exploration and hyperparameter tuning procedure in a dataset with class imbalance and show several models capable of distinguishing between compounds that are active or inactive against the HIV-1 integrase. The best of the models was then used to screen the natural product atlas for active compounds, resulting in a myriad of molecules that share features with known integrase inhibitors. Here we also explore the strengths and shortcomings of our models and discuss the use of the applicability domain to guide in vitro screenings and differentiate between the “predictable” and “unknown” regions of the chemical space.
基于机器学习的抑制HIV-1整合酶天然化合物的虚拟筛选
HIV-1整合酶是HIV-1复制周期的必需酶,目前,整合酶抑制剂在许多指南中处于治疗的第一线。尽管发现了新的抑制剂,包括一类具有不同作用机制的新分子,但耐药性仍然是一个相关的问题,为对抗病毒耐药性的治疗库增加新的选择是一项西西弗的任务。由于体外筛选的难度和成本,机器学习驱动的基于配体的虚拟筛选是一种替代方案,它不仅可以降低成本,还可以使用关于具有未知作用机制的活性化合物的有价值信息。在这项工作中,我们描述了在类别不平衡的数据集中进行的彻底的模型探索和超参数调整过程,并展示了几种能够区分对HIV-1整合酶有活性或无活性的化合物的模型。然后,使用最好的模型来筛选天然产物图谱中的活性化合物,从而产生与已知整合酶抑制剂具有相同特征的无数分子。在这里,我们还探讨了我们的模型的优点和缺点,并讨论了使用适用性领域来指导体外筛选,并区分化学空间的“可预测”和“未知”区域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信