通过将生成式人工智能与基于物理的主动学习框架相结合来优化药物设计。

IF 6.2 2区化学 Q1 CHEMISTRY, MULTIDISCIPLINARY

Communications Chemistry Pub Date : 2025-08-08 DOI:10.1038/s42004-025-01635-7

Isaac Filella-Merce, Alexis Molina, Lucía Díaz, Marek Orzechowski, Yamina A Berchiche, Yang Ming Zhu, Júlia Vilalta-Mor, Laura Malo, Ajay S Yekkirala, Soumya Ray, Victor Guallar

{"title":"通过将生成式人工智能与基于物理的主动学习框架相结合来优化药物设计。","authors":"Isaac Filella-Merce, Alexis Molina, Lucía Díaz, Marek Orzechowski, Yamina A Berchiche, Yang Ming Zhu, Júlia Vilalta-Mor, Laura Malo, Ajay S Yekkirala, Soumya Ray, Victor Guallar","doi":"10.1038/s42004-025-01635-7","DOIUrl":null,"url":null,"abstract":"Machine learning is transforming drug discovery, with generative models (GMs) gaining attention for their ability to design molecules with specific properties. However, GMs often struggle with target engagement, synthetic accessibility, or generalization. To address these, we developed a GM workflow integrating a variational autoencoder with two nested active learning cycles. These iteratively refine their predictions using chemoinformatics and molecular modeling predictors. We tested our workflow on two systems, CDK2 and KRAS, successfully generating diverse, drug-like molecules with high predicted affinity and synthesis accessibility. Notably, we generated novel scaffolds distinct from those known for each target. For CDK2, we synthetized 9 molecules yielding 8 with in vitro activity, including one with nanomolar potency. For KRAS, in silico methods validated by CDK2 assays identified 4 molecules with potential activity. These findings showcase our GM workflow's ability to explore novel chemical spaces tailored for specific targets, thereby opening new avenues in drug discovery.","PeriodicalId":10529,"journal":{"name":"Communications Chemistry","volume":"8 1","pages":"238"},"PeriodicalIF":6.2000,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12334747/pdf/","citationCount":"0","resultStr":"{\"title\":\"Optimizing drug design by merging generative AI with a physics-based active learning framework.\",\"authors\":\"Isaac Filella-Merce, Alexis Molina, Lucía Díaz, Marek Orzechowski, Yamina A Berchiche, Yang Ming Zhu, Júlia Vilalta-Mor, Laura Malo, Ajay S Yekkirala, Soumya Ray, Victor Guallar\",\"doi\":\"10.1038/s42004-025-01635-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning is transforming drug discovery, with generative models (GMs) gaining attention for their ability to design molecules with specific properties. However, GMs often struggle with target engagement, synthetic accessibility, or generalization. To address these, we developed a GM workflow integrating a variational autoencoder with two nested active learning cycles. These iteratively refine their predictions using chemoinformatics and molecular modeling predictors. We tested our workflow on two systems, CDK2 and KRAS, successfully generating diverse, drug-like molecules with high predicted affinity and synthesis accessibility. Notably, we generated novel scaffolds distinct from those known for each target. For CDK2, we synthetized 9 molecules yielding 8 with in vitro activity, including one with nanomolar potency. For KRAS, in silico methods validated by CDK2 assays identified 4 molecules with potential activity. These findings showcase our GM workflow's ability to explore novel chemical spaces tailored for specific targets, thereby opening new avenues in drug discovery.\",\"PeriodicalId\":10529,\"journal\":{\"name\":\"Communications Chemistry\",\"volume\":\"8 1\",\"pages\":\"238\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12334747/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1038/s42004-025-01635-7\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1038/s42004-025-01635-7","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

机器学习正在改变药物发现，生成模型（GMs）因其设计具有特定性质的分子的能力而受到关注。然而，gm经常在目标用户粘性、综合可访问性或泛化方面挣扎。为了解决这些问题，我们开发了一个GM工作流，将变分自动编码器与两个嵌套的主动学习周期集成在一起。这些迭代改进他们的预测使用化学信息学和分子模型预测。我们在CDK2和KRAS两种系统上测试了我们的工作流程，成功地生成了具有高预测亲和力和合成可及性的多种药物样分子。值得注意的是，我们为每个目标生成了不同于已知支架的新支架。对于CDK2，我们合成了9个具有8个体外活性的分子，其中一个具有纳摩尔效价。对于KRAS，经CDK2检测验证的硅方法鉴定出4个具有潜在活性的分子。这些发现表明，我们的转基因工作流程有能力探索针对特定靶点量身定制的新型化学空间，从而为药物发现开辟了新的途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimizing drug design by merging generative AI with a physics-based active learning framework.

Machine learning is transforming drug discovery, with generative models (GMs) gaining attention for their ability to design molecules with specific properties. However, GMs often struggle with target engagement, synthetic accessibility, or generalization. To address these, we developed a GM workflow integrating a variational autoencoder with two nested active learning cycles. These iteratively refine their predictions using chemoinformatics and molecular modeling predictors. We tested our workflow on two systems, CDK2 and KRAS, successfully generating diverse, drug-like molecules with high predicted affinity and synthesis accessibility. Notably, we generated novel scaffolds distinct from those known for each target. For CDK2, we synthetized 9 molecules yielding 8 with in vitro activity, including one with nanomolar potency. For KRAS, in silico methods validated by CDK2 assays identified 4 molecules with potential activity. These findings showcase our GM workflow's ability to explore novel chemical spaces tailored for specific targets, thereby opening new avenues in drug discovery.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Communications Chemistry Chemistry-General Chemistry

CiteScore

7.70

自引率

1.70%

发文量

146

审稿时长

13 weeks

期刊介绍： Communications Chemistry is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the chemical sciences. Research papers published by the journal represent significant advances bringing new chemical insight to a specialized area of research. We also aim to provide a community forum for issues of importance to all chemists, regardless of sub-discipline.