Exploring isofunctional molecules: Design of a benchmark and evaluation of prediction performance.

IF 2.8 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics Pub Date : 2023-04-01 DOI:10.1002/minf.202200216

Philippe Pinel, Gwenn Guichaoua, Matthieu Najm, Stéphanie Labouille, Nicolas Drizard, Yann Gaston-Mathé, Brice Hoffmann, Véronique Stoven

{"title":"Exploring isofunctional molecules: Design of a benchmark and evaluation of prediction performance.","authors":"Philippe Pinel, Gwenn Guichaoua, Matthieu Najm, Stéphanie Labouille, Nicolas Drizard, Yann Gaston-Mathé, Brice Hoffmann, Véronique Stoven","doi":"10.1002/minf.202200216","DOIUrl":null,"url":null,"abstract":"<p><p>Identification of novel chemotypes with biological activity similar to a known active molecule is an important challenge in drug discovery called 'scaffold hopping'. Small-, medium-, and large-step scaffold hopping efforts may lead to increasing degrees of chemical structure novelty with respect to the parent compound. In the present paper, we focus on the problem of large-step scaffold hopping. We assembled a high quality and well characterized dataset of scaffold hopping examples comprising pairs of active molecules and including a variety of protein targets. This dataset was used to build a benchmark corresponding to the setting of real-life applications: one active molecule is known, and the second active is searched among a set of decoys chosen in a way to avoid statistical bias. This allowed us to evaluate the performance of computational methods for solving large-step scaffold hopping problems. In particular, we assessed how difficult these problems are, particularly for classical 2D and 3D ligand-based methods. We also showed that a machine-learning chemogenomic algorithm outperforms classical methods and we provided some useful hints for future improvements.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/minf.202200216","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}

引用次数: 0

Abstract

Identification of novel chemotypes with biological activity similar to a known active molecule is an important challenge in drug discovery called 'scaffold hopping'. Small-, medium-, and large-step scaffold hopping efforts may lead to increasing degrees of chemical structure novelty with respect to the parent compound. In the present paper, we focus on the problem of large-step scaffold hopping. We assembled a high quality and well characterized dataset of scaffold hopping examples comprising pairs of active molecules and including a variety of protein targets. This dataset was used to build a benchmark corresponding to the setting of real-life applications: one active molecule is known, and the second active is searched among a set of decoys chosen in a way to avoid statistical bias. This allowed us to evaluate the performance of computational methods for solving large-step scaffold hopping problems. In particular, we assessed how difficult these problems are, particularly for classical 2D and 3D ligand-based methods. We also showed that a machine-learning chemogenomic algorithm outperforms classical methods and we provided some useful hints for future improvements.

Abstract Image

查看原文本刊更多论文

探索同功能分子:一个基准的设计和预测性能的评估。

鉴定具有与已知活性分子相似生物活性的新化学型是药物发现中的一个重要挑战，称为“支架跳跃”。小、中、大台阶支架跳跃的努力可能导致相对于母体化合物的化学结构新颖性程度的增加。本文主要研究大台阶脚手架的跳跃问题。我们组装了一个高质量和特征良好的支架跳跃例子数据集，包括活性分子对，包括各种蛋白质靶点。该数据集用于建立与现实应用设置相对应的基准:已知一种活性分子，在一组以避免统计偏差的方式选择的诱饵中搜索第二种活性分子。这使我们能够评估求解大台阶脚手架跳跃问题的计算方法的性能。特别是，我们评估了这些问题的难度，特别是对于经典的基于二维和三维配体的方法。我们还展示了机器学习化学基因组算法优于经典方法，并为未来的改进提供了一些有用的提示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Molecular Informatics CHEMISTRY, MEDICINAL-MATHEMATICAL & COMPUTATIONAL BIOLOGY

CiteScore

7.30

自引率

2.80%

发文量

审稿时长

3 months

期刊介绍： Molecular Informatics is a peer-reviewed, international forum for publication of high-quality, interdisciplinary research on all molecular aspects of bio/cheminformatics and computer-assisted molecular design. Molecular Informatics succeeded QSAR & Combinatorial Science in 2010. Molecular Informatics presents methodological innovations that will lead to a deeper understanding of ligand-receptor interactions, macromolecular complexes, molecular networks, design concepts and processes that demonstrate how ideas and design concepts lead to molecules with a desired structure or function, preferably including experimental validation. The journal''s scope includes but is not limited to the fields of drug discovery and chemical biology, protein and nucleic acid engineering and design, the design of nanomolecular structures, strategies for modeling of macromolecular assemblies, molecular networks and systems, pharmaco- and chemogenomics, computer-assisted screening strategies, as well as novel technologies for the de novo design of biologically active molecules. As a unique feature Molecular Informatics publishes so-called "Methods Corner" review-type articles which feature important technological concepts and advances within the scope of the journal.