Amina Mollaysa, Heath Arthur-Loui, Michael Krauthammer
{"title":"整合潜变量和自回归模型,实现目标导向的分子生成","authors":"Amina Mollaysa, Heath Arthur-Loui, Michael Krauthammer","doi":"arxiv-2409.00046","DOIUrl":null,"url":null,"abstract":"De novo molecule design has become a highly active research area, advanced\nsignificantly through the use of state-of-the-art generative models. Despite\nthese advances, several fundamental questions remain unanswered as the field\nincreasingly focuses on more complex generative models and sophisticated\nmolecular representations as an answer to the challenges of drug design. In\nthis paper, we return to the simplest representation of molecules, and\ninvestigate overlooked limitations of classical generative approaches,\nparticularly Variational Autoencoders (VAEs) and auto-regressive models. We\npropose a hybrid model in the form of a novel regularizer that leverages the\nstrengths of both to improve validity, conditional generation, and style\ntransfer of molecular sequences. Additionally, we provide an in depth\ndiscussion of overlooked assumptions of these models' behaviour.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating Latent Variable and Auto-Regressive Models for Goal-directed Molecule Generation\",\"authors\":\"Amina Mollaysa, Heath Arthur-Loui, Michael Krauthammer\",\"doi\":\"arxiv-2409.00046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"De novo molecule design has become a highly active research area, advanced\\nsignificantly through the use of state-of-the-art generative models. Despite\\nthese advances, several fundamental questions remain unanswered as the field\\nincreasingly focuses on more complex generative models and sophisticated\\nmolecular representations as an answer to the challenges of drug design. In\\nthis paper, we return to the simplest representation of molecules, and\\ninvestigate overlooked limitations of classical generative approaches,\\nparticularly Variational Autoencoders (VAEs) and auto-regressive models. We\\npropose a hybrid model in the form of a novel regularizer that leverages the\\nstrengths of both to improve validity, conditional generation, and style\\ntransfer of molecular sequences. Additionally, we provide an in depth\\ndiscussion of overlooked assumptions of these models' behaviour.\",\"PeriodicalId\":501022,\"journal\":{\"name\":\"arXiv - QuanBio - Biomolecules\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Biomolecules\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.00046\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.00046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Integrating Latent Variable and Auto-Regressive Models for Goal-directed Molecule Generation
De novo molecule design has become a highly active research area, advanced
significantly through the use of state-of-the-art generative models. Despite
these advances, several fundamental questions remain unanswered as the field
increasingly focuses on more complex generative models and sophisticated
molecular representations as an answer to the challenges of drug design. In
this paper, we return to the simplest representation of molecules, and
investigate overlooked limitations of classical generative approaches,
particularly Variational Autoencoders (VAEs) and auto-regressive models. We
propose a hybrid model in the form of a novel regularizer that leverages the
strengths of both to improve validity, conditional generation, and style
transfer of molecular sequences. Additionally, we provide an in depth
discussion of overlooked assumptions of these models' behaviour.