Isaac Filella-Merce, Alexis Molina, Lucía Díaz, Marek Orzechowski, Yamina A Berchiche, Yang Ming Zhu, Júlia Vilalta-Mor, Laura Malo, Ajay S Yekkirala, Soumya Ray, Victor Guallar
{"title":"通过将生成式人工智能与基于物理的主动学习框架相结合来优化药物设计。","authors":"Isaac Filella-Merce, Alexis Molina, Lucía Díaz, Marek Orzechowski, Yamina A Berchiche, Yang Ming Zhu, Júlia Vilalta-Mor, Laura Malo, Ajay S Yekkirala, Soumya Ray, Victor Guallar","doi":"10.1038/s42004-025-01635-7","DOIUrl":null,"url":null,"abstract":"<p><p>Machine learning is transforming drug discovery, with generative models (GMs) gaining attention for their ability to design molecules with specific properties. However, GMs often struggle with target engagement, synthetic accessibility, or generalization. To address these, we developed a GM workflow integrating a variational autoencoder with two nested active learning cycles. These iteratively refine their predictions using chemoinformatics and molecular modeling predictors. We tested our workflow on two systems, CDK2 and KRAS, successfully generating diverse, drug-like molecules with high predicted affinity and synthesis accessibility. Notably, we generated novel scaffolds distinct from those known for each target. For CDK2, we synthetized 9 molecules yielding 8 with in vitro activity, including one with nanomolar potency. For KRAS, in silico methods validated by CDK2 assays identified 4 molecules with potential activity. These findings showcase our GM workflow's ability to explore novel chemical spaces tailored for specific targets, thereby opening new avenues in drug discovery.</p>","PeriodicalId":10529,"journal":{"name":"Communications Chemistry","volume":"8 1","pages":"238"},"PeriodicalIF":6.2000,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12334747/pdf/","citationCount":"0","resultStr":"{\"title\":\"Optimizing drug design by merging generative AI with a physics-based active learning framework.\",\"authors\":\"Isaac Filella-Merce, Alexis Molina, Lucía Díaz, Marek Orzechowski, Yamina A Berchiche, Yang Ming Zhu, Júlia Vilalta-Mor, Laura Malo, Ajay S Yekkirala, Soumya Ray, Victor Guallar\",\"doi\":\"10.1038/s42004-025-01635-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Machine learning is transforming drug discovery, with generative models (GMs) gaining attention for their ability to design molecules with specific properties. However, GMs often struggle with target engagement, synthetic accessibility, or generalization. To address these, we developed a GM workflow integrating a variational autoencoder with two nested active learning cycles. These iteratively refine their predictions using chemoinformatics and molecular modeling predictors. We tested our workflow on two systems, CDK2 and KRAS, successfully generating diverse, drug-like molecules with high predicted affinity and synthesis accessibility. Notably, we generated novel scaffolds distinct from those known for each target. For CDK2, we synthetized 9 molecules yielding 8 with in vitro activity, including one with nanomolar potency. For KRAS, in silico methods validated by CDK2 assays identified 4 molecules with potential activity. These findings showcase our GM workflow's ability to explore novel chemical spaces tailored for specific targets, thereby opening new avenues in drug discovery.</p>\",\"PeriodicalId\":10529,\"journal\":{\"name\":\"Communications Chemistry\",\"volume\":\"8 1\",\"pages\":\"238\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12334747/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1038/s42004-025-01635-7\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1038/s42004-025-01635-7","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Optimizing drug design by merging generative AI with a physics-based active learning framework.
Machine learning is transforming drug discovery, with generative models (GMs) gaining attention for their ability to design molecules with specific properties. However, GMs often struggle with target engagement, synthetic accessibility, or generalization. To address these, we developed a GM workflow integrating a variational autoencoder with two nested active learning cycles. These iteratively refine their predictions using chemoinformatics and molecular modeling predictors. We tested our workflow on two systems, CDK2 and KRAS, successfully generating diverse, drug-like molecules with high predicted affinity and synthesis accessibility. Notably, we generated novel scaffolds distinct from those known for each target. For CDK2, we synthetized 9 molecules yielding 8 with in vitro activity, including one with nanomolar potency. For KRAS, in silico methods validated by CDK2 assays identified 4 molecules with potential activity. These findings showcase our GM workflow's ability to explore novel chemical spaces tailored for specific targets, thereby opening new avenues in drug discovery.
期刊介绍:
Communications Chemistry is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the chemical sciences. Research papers published by the journal represent significant advances bringing new chemical insight to a specialized area of research. We also aim to provide a community forum for issues of importance to all chemists, regardless of sub-discipline.