Yifei Yang, Runhan Shi, Zuchao Li, Shu Jiang, Bao-Liang Lu, Qibin Zhao, Yang Yang, Hai Zhao
{"title":"BatGPT-Chem:化学工程的基础大型模型。","authors":"Yifei Yang, Runhan Shi, Zuchao Li, Shu Jiang, Bao-Liang Lu, Qibin Zhao, Yang Yang, Hai Zhao","doi":"10.34133/research.0827","DOIUrl":null,"url":null,"abstract":"<p><p>Large language models (LLMs) have showcased remarkable capabilities in the realm of AI for Science, and chemistry has greatly benefited from the advancement of AI tools. With a strong capacity for learning sequential data like natural language, LLMs offer immense potential. Despite this promise, the application of LLMs in chemistry remains limited, with few models specifically designed for chemical data and tasks. Hence, we propose leveraging LLMs to comprehensively model both chemical sequences and natural language sequences, aiming to tackle diverse chemical tasks. We introduce BatGPT-Chem, a general foundation large-scale model with 15 billion parameters tailored for chemical engineering. Built on a corpus of over 100 million chemical instances, BatGPT-Chem specializes in 5 core tasks: retrosynthesis prediction, molecule design, molecule description, product inference, and yield prediction. BatGPT-Chem comprehensively models the information flow between chemical language and natural language, enabling full-spectrum prediction across chemical tasks. It is one of the largest bilingual chemistry-specific LLMs, supporting both English and Chinese for input and output. BatGPT-Chem is also the first automated retrosynthesis tool capable of explicitly predicting reaction conditions, a critical but often overlooked aspect in previous models. Through rigorous zero-shot evaluations, BatGPT-Chem demonstrates state-of-the-art performance, surpassing both existing chemical LLMs and general-purpose models in accuracy and validity across a diverse range of tasks. Notably, it demonstrates superior ability in predicting both reactants and reaction conditions, as well as strong generalization in low-data settings. These results suggest that BatGPT-Chem is among the most advanced and practical chemical LLMs, with strong potential to support real-world applications in synthesis planning, drug discovery, and materials design.</p>","PeriodicalId":21120,"journal":{"name":"Research","volume":"8 ","pages":"0827"},"PeriodicalIF":10.7000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12421729/pdf/","citationCount":"0","resultStr":"{\"title\":\"BatGPT-Chem: A Foundation Large Model for Chemical Engineering.\",\"authors\":\"Yifei Yang, Runhan Shi, Zuchao Li, Shu Jiang, Bao-Liang Lu, Qibin Zhao, Yang Yang, Hai Zhao\",\"doi\":\"10.34133/research.0827\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Large language models (LLMs) have showcased remarkable capabilities in the realm of AI for Science, and chemistry has greatly benefited from the advancement of AI tools. With a strong capacity for learning sequential data like natural language, LLMs offer immense potential. Despite this promise, the application of LLMs in chemistry remains limited, with few models specifically designed for chemical data and tasks. Hence, we propose leveraging LLMs to comprehensively model both chemical sequences and natural language sequences, aiming to tackle diverse chemical tasks. We introduce BatGPT-Chem, a general foundation large-scale model with 15 billion parameters tailored for chemical engineering. Built on a corpus of over 100 million chemical instances, BatGPT-Chem specializes in 5 core tasks: retrosynthesis prediction, molecule design, molecule description, product inference, and yield prediction. BatGPT-Chem comprehensively models the information flow between chemical language and natural language, enabling full-spectrum prediction across chemical tasks. It is one of the largest bilingual chemistry-specific LLMs, supporting both English and Chinese for input and output. BatGPT-Chem is also the first automated retrosynthesis tool capable of explicitly predicting reaction conditions, a critical but often overlooked aspect in previous models. Through rigorous zero-shot evaluations, BatGPT-Chem demonstrates state-of-the-art performance, surpassing both existing chemical LLMs and general-purpose models in accuracy and validity across a diverse range of tasks. Notably, it demonstrates superior ability in predicting both reactants and reaction conditions, as well as strong generalization in low-data settings. These results suggest that BatGPT-Chem is among the most advanced and practical chemical LLMs, with strong potential to support real-world applications in synthesis planning, drug discovery, and materials design.</p>\",\"PeriodicalId\":21120,\"journal\":{\"name\":\"Research\",\"volume\":\"8 \",\"pages\":\"0827\"},\"PeriodicalIF\":10.7000,\"publicationDate\":\"2025-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12421729/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.34133/research.0827\",\"RegionNum\":1,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Multidisciplinary\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.34133/research.0827","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
BatGPT-Chem: A Foundation Large Model for Chemical Engineering.
Large language models (LLMs) have showcased remarkable capabilities in the realm of AI for Science, and chemistry has greatly benefited from the advancement of AI tools. With a strong capacity for learning sequential data like natural language, LLMs offer immense potential. Despite this promise, the application of LLMs in chemistry remains limited, with few models specifically designed for chemical data and tasks. Hence, we propose leveraging LLMs to comprehensively model both chemical sequences and natural language sequences, aiming to tackle diverse chemical tasks. We introduce BatGPT-Chem, a general foundation large-scale model with 15 billion parameters tailored for chemical engineering. Built on a corpus of over 100 million chemical instances, BatGPT-Chem specializes in 5 core tasks: retrosynthesis prediction, molecule design, molecule description, product inference, and yield prediction. BatGPT-Chem comprehensively models the information flow between chemical language and natural language, enabling full-spectrum prediction across chemical tasks. It is one of the largest bilingual chemistry-specific LLMs, supporting both English and Chinese for input and output. BatGPT-Chem is also the first automated retrosynthesis tool capable of explicitly predicting reaction conditions, a critical but often overlooked aspect in previous models. Through rigorous zero-shot evaluations, BatGPT-Chem demonstrates state-of-the-art performance, surpassing both existing chemical LLMs and general-purpose models in accuracy and validity across a diverse range of tasks. Notably, it demonstrates superior ability in predicting both reactants and reaction conditions, as well as strong generalization in low-data settings. These results suggest that BatGPT-Chem is among the most advanced and practical chemical LLMs, with strong potential to support real-world applications in synthesis planning, drug discovery, and materials design.
期刊介绍:
Research serves as a global platform for academic exchange, collaboration, and technological advancements. This journal welcomes high-quality research contributions from any domain, with open arms to authors from around the globe.
Comprising fundamental research in the life and physical sciences, Research also highlights significant findings and issues in engineering and applied science. The journal proudly features original research articles, reviews, perspectives, and editorials, fostering a diverse and dynamic scholarly environment.