{"title":"Directly Optimizing for Synthesizability in Generative Molecular Design using Retrosynthesis Models","authors":"Jeff Guo, Philippe Schwaller","doi":"arxiv-2407.12186","DOIUrl":null,"url":null,"abstract":"Synthesizability in generative molecular design remains a pressing challenge.\nExisting methods to assess synthesizability span heuristics-based methods,\nretrosynthesis models, and synthesizability-constrained molecular generation.\nThe latter has become increasingly prevalent and proceeds by defining a set of\npermitted actions a model can take when generating molecules, such that all\ngenerations are anchored in \"synthetically-feasible\" chemical transformations.\nTo date, retrosynthesis models have been mostly used as a post-hoc filtering\ntool as their inference cost remains prohibitive to use directly in an\noptimization loop. In this work, we show that with a sufficiently\nsample-efficient generative model, it is straightforward to directly optimize\nfor synthesizability using retrosynthesis models in goal-directed generation.\nUnder a heavily-constrained computational budget, our model can generate\nmolecules satisfying a multi-parameter drug discovery optimization task while\nbeing synthesizable, as deemed by the retrosynthesis model.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.12186","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Synthesizability in generative molecular design remains a pressing challenge.
Existing methods to assess synthesizability span heuristics-based methods,
retrosynthesis models, and synthesizability-constrained molecular generation.
The latter has become increasingly prevalent and proceeds by defining a set of
permitted actions a model can take when generating molecules, such that all
generations are anchored in "synthetically-feasible" chemical transformations.
To date, retrosynthesis models have been mostly used as a post-hoc filtering
tool as their inference cost remains prohibitive to use directly in an
optimization loop. In this work, we show that with a sufficiently
sample-efficient generative model, it is straightforward to directly optimize
for synthesizability using retrosynthesis models in goal-directed generation.
Under a heavily-constrained computational budget, our model can generate
molecules satisfying a multi-parameter drug discovery optimization task while
being synthesizable, as deemed by the retrosynthesis model.