摩根指纹在基于结构的虚拟配体筛选中的实用性。

IF 2.9 2区化学 Q3 CHEMISTRY, PHYSICAL

The Journal of Physical Chemistry B Pub Date : 2024-05-23 DOI:10.1021/acs.jpcb.4c01875

Hongyi Zhou, and , Jeffrey Skolnick*,

{"title":"摩根指纹在基于结构的虚拟配体筛选中的实用性。","authors":"Hongyi Zhou,  and , Jeffrey Skolnick*, ","doi":"10.1021/acs.jpcb.4c01875","DOIUrl":null,"url":null,"abstract":"In modern drug discovery, virtual ligand screening (VLS) is frequently applied to identify possible hits before experimental testing and refinement due to its cost-effective nature for large compound libraries. For decades, efforts have been devoted to developing VLS methods with high accuracy. These include the state-of-the-art FINDSITE suite of approaches FINDSITEcomb2.0, FRAGSITE, and FRAGSITE2 and the meta version FRAGSITEcomb that were developed in our lab. These methods combine ligand homology modeling (LHM), traditional ligand similarity methods, and more recently machine learning approaches to rank ligands and have proven to be superior to most recent deep learning and large language model-based approaches. Here, we describe further improvements to our previous best methods by combining the Morgan fingerprint (MF) with the originally used PubChem fingerprint and FP2 fingerprint. We then benchmarked FINDSITEcomb2.0M, FRAGSITEM, FRAGSITE2M, and the composite meta-approach FRAGSITEcombM. On the 102 target DUD-E set, the 1% enrichment factor (EF1%) and area under the precision-recall curve (AUPR) of FRAGSITEcomb increased from 42.0/0.59 to 47.6/0.72. This 0.72 AUPR is significantly better than that of the state-of-the-art deep learning-based method DenseFS’s AUPR of 0.443. An independent test on the 81 targets DEKOIS2.0 set shows that EF1%/AUPR increases from 18.3/0.520 to 23.1/0.683. An ablation investigation shows that the MF contributes to most of the improvement of all four approaches. Thus, the MF is a useful addition to structure-based VLS.","PeriodicalId":60,"journal":{"name":"The Journal of Physical Chemistry B","volume":"128 22","pages":"5363–5370"},"PeriodicalIF":2.9000,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acs.jpcb.4c01875","citationCount":"0","resultStr":"{\"title\":\"Utility of the Morgan Fingerprint in Structure-Based Virtual Ligand Screening\",\"authors\":\"Hongyi Zhou,  and , Jeffrey Skolnick*, \",\"doi\":\"10.1021/acs.jpcb.4c01875\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In modern drug discovery, virtual ligand screening (VLS) is frequently applied to identify possible hits before experimental testing and refinement due to its cost-effective nature for large compound libraries. For decades, efforts have been devoted to developing VLS methods with high accuracy. These include the state-of-the-art FINDSITE suite of approaches FINDSITEcomb2.0, FRAGSITE, and FRAGSITE2 and the meta version FRAGSITEcomb that were developed in our lab. These methods combine ligand homology modeling (LHM), traditional ligand similarity methods, and more recently machine learning approaches to rank ligands and have proven to be superior to most recent deep learning and large language model-based approaches. Here, we describe further improvements to our previous best methods by combining the Morgan fingerprint (MF) with the originally used PubChem fingerprint and FP2 fingerprint. We then benchmarked FINDSITEcomb2.0M, FRAGSITEM, FRAGSITE2M, and the composite meta-approach FRAGSITEcombM. On the 102 target DUD-E set, the 1% enrichment factor (EF1%) and area under the precision-recall curve (AUPR) of FRAGSITEcomb increased from 42.0/0.59 to 47.6/0.72. This 0.72 AUPR is significantly better than that of the state-of-the-art deep learning-based method DenseFS’s AUPR of 0.443. An independent test on the 81 targets DEKOIS2.0 set shows that EF1%/AUPR increases from 18.3/0.520 to 23.1/0.683. An ablation investigation shows that the MF contributes to most of the improvement of all four approaches. Thus, the MF is a useful addition to structure-based VLS.\",\"PeriodicalId\":60,\"journal\":{\"name\":\"The Journal of Physical Chemistry B\",\"volume\":\"128 22\",\"pages\":\"5363–5370\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/epdf/10.1021/acs.jpcb.4c01875\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Physical Chemistry B\",\"FirstCategoryId\":\"1\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.jpcb.4c01875\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Physical Chemistry B","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jpcb.4c01875","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}

引用次数: 0

摘要

在现代药物发现中，虚拟配体筛选（VLS）因其对大型化合物库的成本效益高，经常被用于在实验测试和改进之前识别可能的命中物。几十年来，人们一直致力于开发高精度的虚拟配体筛选方法。其中包括最先进的 FINDSITE 方法套件 FINDSITEcomb2.0、FRAGSITE 和 FRAGSITE2 以及我们实验室开发的元版本 FRAGSITEcomb。这些方法结合了配体同源性建模 (LHM)、传统配体相似性方法以及最近的机器学习方法来对配体进行排序，并已证明优于最近的深度学习和基于大型语言模型的方法。在此，我们介绍了通过将摩根指纹（MF）与最初使用的 PubChem 指纹和 FP2 指纹相结合，对之前最佳方法的进一步改进。然后，我们对 FINDSITEcomb2.0M、FRAGSITEM、FRAGSITE2M 和复合元方法 FRAGSITEcombM 进行了基准测试。在 102 个目标 DUD-E 集上，FRAGSITEcomb 的 1% 富集因子 (EF1%) 和精度-召回曲线下面积 (AUPR) 从 42.0/0.59 增加到 47.6/0.72。0.72 的 AUPR 明显优于基于深度学习的最先进方法 DenseFS 的 0.443 AUPR。对 81 个目标 DEKOIS2.0 集的独立测试表明，EF1%/AUPR 从 18.3/0.520 增加到 23.1/0.683。一项消融调查显示，MF 对所有四种方法的大部分改进都做出了贡献。因此，MF 是对基于结构的 VLS 的有益补充。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Utility of the Morgan Fingerprint in Structure-Based Virtual Ligand Screening

查看原文本刊更多论文

Utility of the Morgan Fingerprint in Structure-Based Virtual Ligand Screening

In modern drug discovery, virtual ligand screening (VLS) is frequently applied to identify possible hits before experimental testing and refinement due to its cost-effective nature for large compound libraries. For decades, efforts have been devoted to developing VLS methods with high accuracy. These include the state-of-the-art FINDSITE suite of approaches FINDSITE^comb2.0, FRAGSITE, and FRAGSITE2 and the meta version FRAGSITE^comb that were developed in our lab. These methods combine ligand homology modeling (LHM), traditional ligand similarity methods, and more recently machine learning approaches to rank ligands and have proven to be superior to most recent deep learning and large language model-based approaches. Here, we describe further improvements to our previous best methods by combining the Morgan fingerprint (MF) with the originally used PubChem fingerprint and FP2 fingerprint. We then benchmarked FINDSITE^comb2.0M, FRAGSITE^M, FRAGSITE2^M, and the composite meta-approach FRAGSITE^combM. On the 102 target DUD-E set, the 1% enrichment factor (EF_1%) and area under the precision-recall curve (AUPR) of FRAGSITE^comb increased from 42.0/0.59 to 47.6/0.72. This 0.72 AUPR is significantly better than that of the state-of-the-art deep learning-based method DenseFS’s AUPR of 0.443. An independent test on the 81 targets DEKOIS2.0 set shows that EF_1%/AUPR increases from 18.3/0.520 to 23.1/0.683. An ablation investigation shows that the MF contributes to most of the improvement of all four approaches. Thus, the MF is a useful addition to structure-based VLS.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Journal of Physical Chemistry B 化学-物理化学

CiteScore

5.80

自引率

9.10%

发文量

965

审稿时长

1.6 months

期刊介绍： An essential criterion for acceptance of research articles in the journal is that they provide new physical insight. Please refer to the New Physical Insights virtual issue on what constitutes new physical insight. Manuscripts that are essentially reporting data or applications of data are, in general, not suitable for publication in JPC B.