Xiting Peng , Yi Shen Tew , Kai Zhao , Chi Wang , Ren'ai Li , Shanying Hu , Xiaonan Wang
{"title":"通过大型语言模型驱动框架和交互式AI代理解锁深度共晶溶剂知识","authors":"Xiting Peng , Yi Shen Tew , Kai Zhao , Chi Wang , Ren'ai Li , Shanying Hu , Xiaonan Wang","doi":"10.1016/j.gce.2025.05.006","DOIUrl":null,"url":null,"abstract":"<div><div>Artificial intelligence (AI) is playing an important role in advancing green chemical engineering, while the lack of data remains a primary challenge in many fields. Deep eutectic solvents (DESs) are a promising alternative to traditional organic solvents. However, the exploration of new DES formulations has long been constrained by trial-and-error research methods, a preference for familiar formulations, and a lack of easily accessible DES databases. This study proposes a framework driven by large language models (LLMs) for accurately and efficiently extracting data in the DES field, accelerating knowledge discovery. By coordinating LLMs and tools through predefined code paths, we extracted 34,027 data records and 9,215 unique DES formulations from 14,602 research articles, achieving an accuracy of over 90%, thereby creating a comprehensive domain knowledge base. An LLM-driven interactive agent has been deployed on an online platform, further facilitating access to this structured data and enabling researchers to overcome data limitations and accelerate the discovery of new DES formulations.</div></div>","PeriodicalId":66474,"journal":{"name":"Green Chemical Engineering","volume":"6 4","pages":"Pages 572-581"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unlocking deep eutectic solvent knowledge through a large language model-driven framework and an interactive AI agent\",\"authors\":\"Xiting Peng , Yi Shen Tew , Kai Zhao , Chi Wang , Ren'ai Li , Shanying Hu , Xiaonan Wang\",\"doi\":\"10.1016/j.gce.2025.05.006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Artificial intelligence (AI) is playing an important role in advancing green chemical engineering, while the lack of data remains a primary challenge in many fields. Deep eutectic solvents (DESs) are a promising alternative to traditional organic solvents. However, the exploration of new DES formulations has long been constrained by trial-and-error research methods, a preference for familiar formulations, and a lack of easily accessible DES databases. This study proposes a framework driven by large language models (LLMs) for accurately and efficiently extracting data in the DES field, accelerating knowledge discovery. By coordinating LLMs and tools through predefined code paths, we extracted 34,027 data records and 9,215 unique DES formulations from 14,602 research articles, achieving an accuracy of over 90%, thereby creating a comprehensive domain knowledge base. An LLM-driven interactive agent has been deployed on an online platform, further facilitating access to this structured data and enabling researchers to overcome data limitations and accelerate the discovery of new DES formulations.</div></div>\",\"PeriodicalId\":66474,\"journal\":{\"name\":\"Green Chemical Engineering\",\"volume\":\"6 4\",\"pages\":\"Pages 572-581\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Green Chemical Engineering\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666952825000433\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Green Chemical Engineering","FirstCategoryId":"1089","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666952825000433","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
Unlocking deep eutectic solvent knowledge through a large language model-driven framework and an interactive AI agent
Artificial intelligence (AI) is playing an important role in advancing green chemical engineering, while the lack of data remains a primary challenge in many fields. Deep eutectic solvents (DESs) are a promising alternative to traditional organic solvents. However, the exploration of new DES formulations has long been constrained by trial-and-error research methods, a preference for familiar formulations, and a lack of easily accessible DES databases. This study proposes a framework driven by large language models (LLMs) for accurately and efficiently extracting data in the DES field, accelerating knowledge discovery. By coordinating LLMs and tools through predefined code paths, we extracted 34,027 data records and 9,215 unique DES formulations from 14,602 research articles, achieving an accuracy of over 90%, thereby creating a comprehensive domain knowledge base. An LLM-driven interactive agent has been deployed on an online platform, further facilitating access to this structured data and enabling researchers to overcome data limitations and accelerate the discovery of new DES formulations.