Quality over quantity: how to get the best results when using docking for repurposing.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Frontiers in bioinformatics Pub Date : 2025-05-26 eCollection Date: 2025-01-01 DOI:10.3389/fbinf.2025.1536504
Lenin Domínguez-Ramírez, Maricruz Anaya-Ruiz, Paulina Cortés-Hernández
{"title":"Quality over quantity: how to get the best results when using docking for repurposing.","authors":"Lenin Domínguez-Ramírez, Maricruz Anaya-Ruiz, Paulina Cortés-Hernández","doi":"10.3389/fbinf.2025.1536504","DOIUrl":null,"url":null,"abstract":"<p><p>Molecular docking is among the fastest and most readily available computational tools to explore protein-ligand interactions. However, little effort has been put into assessing the quality of its results. In this paper, we compared eight free license docking programs to screen a drug library against the human target, phosphodiesterase 5A (PDE5A), to evaluate their ability to find its known ligand, sildenafil, and other ligands that became erectile dysfunction drugs because they inhibit this target. GNINA was superior at identifying the known target because it offers a convolutional neural network (CNN) score that ranks the quality of docking results. Using this CNN score improved the ranking of known positives. Receiver operating characteristic (ROC) analysis revealed that all docking suites lack specificity; that is, they often misidentify true negatives. Employing a CNN score cutoff before ranking by docking affinity raised specificity with a small loss in sensitivity. After the cutoff, datasets became smaller but of higher quality. We propose a heuristic to produce relevant docking results, which includes an overall evaluation of the target on docking performance through ROC and an improvement of candidate binder selection using a CNN score cutoff of 0.9.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1536504"},"PeriodicalIF":2.8000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12146287/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2025.1536504","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Molecular docking is among the fastest and most readily available computational tools to explore protein-ligand interactions. However, little effort has been put into assessing the quality of its results. In this paper, we compared eight free license docking programs to screen a drug library against the human target, phosphodiesterase 5A (PDE5A), to evaluate their ability to find its known ligand, sildenafil, and other ligands that became erectile dysfunction drugs because they inhibit this target. GNINA was superior at identifying the known target because it offers a convolutional neural network (CNN) score that ranks the quality of docking results. Using this CNN score improved the ranking of known positives. Receiver operating characteristic (ROC) analysis revealed that all docking suites lack specificity; that is, they often misidentify true negatives. Employing a CNN score cutoff before ranking by docking affinity raised specificity with a small loss in sensitivity. After the cutoff, datasets became smaller but of higher quality. We propose a heuristic to produce relevant docking results, which includes an overall evaluation of the target on docking performance through ROC and an improvement of candidate binder selection using a CNN score cutoff of 0.9.

质重于量:如何在对接再利用时获得最佳效果。
分子对接是探索蛋白质-配体相互作用的最快和最容易获得的计算工具之一。然而,评估其结果质量的努力却很少。在本文中,我们比较了8个免费的许可证对接程序来筛选针对人类靶标磷酸二酯酶5A (PDE5A)的药物库,以评估它们找到已知配体的能力,西地那非和其他配体由于抑制该靶标而成为勃起功能障碍药物。GNINA在识别已知目标方面表现优异,因为它提供了一个卷积神经网络(CNN)评分,对对接结果的质量进行排名。使用CNN评分提高了已知阳性的排名。受试者工作特征(ROC)分析显示,所有对接组都缺乏特异性;也就是说,他们经常错误地识别真正的消极因素。在对接亲和度排序前采用CNN评分截止提高了特异性,敏感性损失较小。截止后,数据集变得更小,但质量更高。我们提出了一种启发式方法来产生相关的对接结果,其中包括通过ROC对目标的对接性能进行总体评估,并使用0.9的CNN分数截断来改进候选绑定器的选择。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信