Discriminating High from Low Energy Conformers of Druglike Molecules: An Assessment of Machine Learning Potentials and Quantum Chemical Methods

IF 2.3 3区 化学 Q3 CHEMISTRY, PHYSICAL
Linghan Kong, Richard A. Bryce
{"title":"Discriminating High from Low Energy Conformers of Druglike Molecules: An Assessment of Machine Learning Potentials and Quantum Chemical Methods","authors":"Linghan Kong,&nbsp;Richard A. Bryce","doi":"10.1002/cphc.202400992","DOIUrl":null,"url":null,"abstract":"<p>Accurate and efficient prediction of high energy ligand conformations is important in structure-based drug discovery for the exclusion of unrealistic structures in docking-based virtual screening and <i>de novo</i> design approaches. In this work, we constructed a database of 140 solution conformers from 20 druglike molecules of varying size and chemical complexity, with energetics evaluated at the DLPNO-CCSD(T)/complete basis set (CBS) level. We then assessed a selection of machine learning potentials and semiempirical quantum mechanical models for their ability to predict conformational energetics. The GFN2-xTB tight binding density functional method correlates with reference conformer energies, yielding a Kendall's τ of 0.63 and mean absolute error of 2.2 kcal/mol. As putative internal energy filters for screening, we find that the GFN2-xTB, ANI-2x and MACE-OFF23(L) models perform well in identifying low energy conformer geometries, with sensitivities of 95 %, 89 % and 95 % respectively, but display a reduced ability to exclude high energy conformers, with respective specificities of 80 %, 61 % and 63 %. The GFN2-xTB method therefore exhibited the best overall performance and appears currently the most suitable of the three methods to act as an internal energy filter for integration into drug discovery workflows. Enrichment of high energy conformers in the training of machine learning potentials could improve their performance as conformational filters.</p>","PeriodicalId":9819,"journal":{"name":"Chemphyschem","volume":"26 8","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cphc.202400992","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemphyschem","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cphc.202400992","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate and efficient prediction of high energy ligand conformations is important in structure-based drug discovery for the exclusion of unrealistic structures in docking-based virtual screening and de novo design approaches. In this work, we constructed a database of 140 solution conformers from 20 druglike molecules of varying size and chemical complexity, with energetics evaluated at the DLPNO-CCSD(T)/complete basis set (CBS) level. We then assessed a selection of machine learning potentials and semiempirical quantum mechanical models for their ability to predict conformational energetics. The GFN2-xTB tight binding density functional method correlates with reference conformer energies, yielding a Kendall's τ of 0.63 and mean absolute error of 2.2 kcal/mol. As putative internal energy filters for screening, we find that the GFN2-xTB, ANI-2x and MACE-OFF23(L) models perform well in identifying low energy conformer geometries, with sensitivities of 95 %, 89 % and 95 % respectively, but display a reduced ability to exclude high energy conformers, with respective specificities of 80 %, 61 % and 63 %. The GFN2-xTB method therefore exhibited the best overall performance and appears currently the most suitable of the three methods to act as an internal energy filter for integration into drug discovery workflows. Enrichment of high energy conformers in the training of machine learning potentials could improve their performance as conformational filters.

Abstract Image

区分类药物分子的高能与低能构象:对机器学习潜力和量子化学方法的评估。
在基于对接的虚拟筛选和从头设计方法中,准确有效地预测高能配体构象对基于结构的药物发现非常重要,可以排除不切实际的结构。在这项工作中,我们构建了一个包含20种不同大小和化学复杂性的药物样分子的140种溶液构象的数据库,并在DLPNO-CCSD(T)/完全基集(CBS)水平上评估了能量学。然后,我们评估了机器学习潜力和半经验量子力学模型的选择,以评估它们预测构象能量学的能力。GFN2-xTB紧密结合密度泛函方法与参考构象能量相关,得到Kendall's τ为0.63,平均绝对误差为2.2 kcal/mol。作为假定的筛选内部能量过滤器,我们发现GFN2-xTB, ANI-2x和mce - off23 (L)模型在识别低能量构象几何形状方面表现良好,灵敏度分别为95%,89%和95%,但排除高能量构象的能力较低,特异性分别为80%,61%和63%。因此,GFN2-xTB方法表现出最佳的整体性能,并且是目前三种方法中最适合作为内部能量过滤器整合到药物发现工作流程中的方法。在机器学习电位的训练中富集高能构象可以提高它们作为构象滤波器的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Chemphyschem
Chemphyschem 化学-物理:原子、分子和化学物理
CiteScore
4.60
自引率
3.40%
发文量
425
审稿时长
1.1 months
期刊介绍: ChemPhysChem is one of the leading chemistry/physics interdisciplinary journals (ISI Impact Factor 2018: 3.077) for physical chemistry and chemical physics. It is published on behalf of Chemistry Europe, an association of 16 European chemical societies. ChemPhysChem is an international source for important primary and critical secondary information across the whole field of physical chemistry and chemical physics. It integrates this wide and flourishing field ranging from Solid State and Soft-Matter Research, Electro- and Photochemistry, Femtochemistry and Nanotechnology, Complex Systems, Single-Molecule Research, Clusters and Colloids, Catalysis and Surface Science, Biophysics and Physical Biochemistry, Atmospheric and Environmental Chemistry, and many more topics. ChemPhysChem is peer-reviewed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信