Joseph M. Cavanagh, Kunyang Sun, Andrew Gritsevskiy, Dorian Bagni, Thomas D. Bannister, Teresa Head-Gordon
{"title":"SmileyLlama:为定向化学空间探索修改大型语言模型","authors":"Joseph M. Cavanagh, Kunyang Sun, Andrew Gritsevskiy, Dorian Bagni, Thomas D. Bannister, Teresa Head-Gordon","doi":"arxiv-2409.02231","DOIUrl":null,"url":null,"abstract":"Here we show that a Large Language Model (LLM) can serve as a foundation\nmodel for a Chemical Language Model (CLM) which performs at or above the level\nof CLMs trained solely on chemical SMILES string data. Using supervised\nfine-tuning (SFT) and direct preference optimization (DPO) on the open-source\nLlama LLM, we demonstrate that we can train an LLM to respond to prompts such\nas generating molecules with properties of interest to drug development. This\noverall framework allows an LLM to not just be a chatbot client for chemistry\nand materials tasks, but can be adapted to speak more directly as a CLM which\ncan generate molecules with user-specified properties.","PeriodicalId":501304,"journal":{"name":"arXiv - PHYS - Chemical Physics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SmileyLlama: Modifying Large Language Models for Directed Chemical Space Exploration\",\"authors\":\"Joseph M. Cavanagh, Kunyang Sun, Andrew Gritsevskiy, Dorian Bagni, Thomas D. Bannister, Teresa Head-Gordon\",\"doi\":\"arxiv-2409.02231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Here we show that a Large Language Model (LLM) can serve as a foundation\\nmodel for a Chemical Language Model (CLM) which performs at or above the level\\nof CLMs trained solely on chemical SMILES string data. Using supervised\\nfine-tuning (SFT) and direct preference optimization (DPO) on the open-source\\nLlama LLM, we demonstrate that we can train an LLM to respond to prompts such\\nas generating molecules with properties of interest to drug development. This\\noverall framework allows an LLM to not just be a chatbot client for chemistry\\nand materials tasks, but can be adapted to speak more directly as a CLM which\\ncan generate molecules with user-specified properties.\",\"PeriodicalId\":501304,\"journal\":{\"name\":\"arXiv - PHYS - Chemical Physics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - PHYS - Chemical Physics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.02231\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Chemical Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.02231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SmileyLlama: Modifying Large Language Models for Directed Chemical Space Exploration
Here we show that a Large Language Model (LLM) can serve as a foundation
model for a Chemical Language Model (CLM) which performs at or above the level
of CLMs trained solely on chemical SMILES string data. Using supervised
fine-tuning (SFT) and direct preference optimization (DPO) on the open-source
Llama LLM, we demonstrate that we can train an LLM to respond to prompts such
as generating molecules with properties of interest to drug development. This
overall framework allows an LLM to not just be a chatbot client for chemistry
and materials tasks, but can be adapted to speak more directly as a CLM which
can generate molecules with user-specified properties.