大型语言模型辅助材料开发:(氧)氢氧化物析氧反应催化剂的预测分析案例

IF 7.3 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Chenyang Wei, Yutong Shi, Wenbo Mu*, Hongyuan Zhang, Rui Qin, Yijun Yin, Gangqiang Yu* and Tiancheng Mu*, 
{"title":"大型语言模型辅助材料开发:(氧)氢氧化物析氧反应催化剂的预测分析案例","authors":"Chenyang Wei,&nbsp;Yutong Shi,&nbsp;Wenbo Mu*,&nbsp;Hongyuan Zhang,&nbsp;Rui Qin,&nbsp;Yijun Yin,&nbsp;Gangqiang Yu* and Tiancheng Mu*,&nbsp;","doi":"10.1021/acssuschemeng.5c0079810.1021/acssuschemeng.5c00798","DOIUrl":null,"url":null,"abstract":"<p >This study explores the transformative role of artificial intelligence (AI) and machine learning (ML) in materials science, leveraging large language models (LLMs) such as OpenAI’s ChatGPT. Focusing on (oxy)hydroxides as oxygen evolution reaction (OER) catalysts, we demonstrate how LLMs streamline data extraction, significantly reducing reliance on traditional, time-intensive methods. Using few-shot training and strategic prompting, ChatGPT achieved an extraction accuracy of approximately 0.9. The curated data set was then used to predict OER performance via the PyCaret library to evaluate various ML algorithms and a high-accuracy XGBoost regression model with accuracies above 0.9 is subsequently established. Further analysis using SHAP and Python Symbolic Regression (PySR) identified key descriptors-electrochemical double-layer capacitance, transition metal composition, support material, and d-electron count-as critical factors, consistent with established electrochemical principles. Additionally, SHAP’s extreme values for Cu and Zn suggest unconventional catalytic roles, potentially linked to Cu<sub>2</sub>O-facilitated NiOOH formation and Zn-induced electronic modulation, demonstrating the power of data-driven analysis in uncovering hidden mechanisms. To enhance literature-based insights, Microsoft’s GraphRAG technology was employed for in-depth chemical information retrieval. Overall, this study introduces an innovative, end-to-end ML framework powered by ChatGPT, promoting broader AI adoption in scientific research and bridging computational intelligence with experimental sciences.</p>","PeriodicalId":25,"journal":{"name":"ACS Sustainable Chemistry & Engineering","volume":"13 14","pages":"5368–5380 5368–5380"},"PeriodicalIF":7.3000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Language Models Assisted Materials Development: Case of Predictive Analytics for Oxygen Evolution Reaction Catalysts of (Oxy)hydroxides\",\"authors\":\"Chenyang Wei,&nbsp;Yutong Shi,&nbsp;Wenbo Mu*,&nbsp;Hongyuan Zhang,&nbsp;Rui Qin,&nbsp;Yijun Yin,&nbsp;Gangqiang Yu* and Tiancheng Mu*,&nbsp;\",\"doi\":\"10.1021/acssuschemeng.5c0079810.1021/acssuschemeng.5c00798\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >This study explores the transformative role of artificial intelligence (AI) and machine learning (ML) in materials science, leveraging large language models (LLMs) such as OpenAI’s ChatGPT. Focusing on (oxy)hydroxides as oxygen evolution reaction (OER) catalysts, we demonstrate how LLMs streamline data extraction, significantly reducing reliance on traditional, time-intensive methods. Using few-shot training and strategic prompting, ChatGPT achieved an extraction accuracy of approximately 0.9. The curated data set was then used to predict OER performance via the PyCaret library to evaluate various ML algorithms and a high-accuracy XGBoost regression model with accuracies above 0.9 is subsequently established. Further analysis using SHAP and Python Symbolic Regression (PySR) identified key descriptors-electrochemical double-layer capacitance, transition metal composition, support material, and d-electron count-as critical factors, consistent with established electrochemical principles. Additionally, SHAP’s extreme values for Cu and Zn suggest unconventional catalytic roles, potentially linked to Cu<sub>2</sub>O-facilitated NiOOH formation and Zn-induced electronic modulation, demonstrating the power of data-driven analysis in uncovering hidden mechanisms. To enhance literature-based insights, Microsoft’s GraphRAG technology was employed for in-depth chemical information retrieval. Overall, this study introduces an innovative, end-to-end ML framework powered by ChatGPT, promoting broader AI adoption in scientific research and bridging computational intelligence with experimental sciences.</p>\",\"PeriodicalId\":25,\"journal\":{\"name\":\"ACS Sustainable Chemistry & Engineering\",\"volume\":\"13 14\",\"pages\":\"5368–5380 5368–5380\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Sustainable Chemistry & Engineering\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acssuschemeng.5c00798\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Sustainable Chemistry & Engineering","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acssuschemeng.5c00798","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

本研究利用 OpenAI 的 ChatGPT 等大型语言模型 (LLM),探索人工智能 (AI) 和机器学习 (ML) 在材料科学中的变革性作用。以作为氧进化反应(OER)催化剂的(氧)氢氧化物为重点,我们展示了 LLM 如何简化数据提取,大大减少对传统的时间密集型方法的依赖。通过少量训练和策略性提示,ChatGPT 实现了约 0.9 的提取准确率。随后,通过 PyCaret 库,将策划好的数据集用于预测 OER 性能,以评估各种 ML 算法,随后建立了一个准确率高于 0.9 的高精度 XGBoost 回归模型。使用 SHAP 和 Python 符号回归 (PySR) 进行的进一步分析确定了关键描述符--电化学双层电容、过渡金属成分、支持材料和 d 电子计数--是关键因素,这与既定的电化学原理是一致的。此外,SHAP 中 Cu 和 Zn 的极值表明,它们具有非常规的催化作用,可能与 Cu2O 促进 NiOOH 形成和 Zn 诱导的电子调制有关,这证明了数据驱动分析在揭示隐藏机制方面的威力。为了增强基于文献的洞察力,我们采用了微软的 GraphRAG 技术来进行深入的化学信息检索。总之,这项研究引入了一个由 ChatGPT 支持的创新型端到端 ML 框架,促进了人工智能在科学研究中的广泛应用,并将计算智能与实验科学连接起来。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Large Language Models Assisted Materials Development: Case of Predictive Analytics for Oxygen Evolution Reaction Catalysts of (Oxy)hydroxides

Large Language Models Assisted Materials Development: Case of Predictive Analytics for Oxygen Evolution Reaction Catalysts of (Oxy)hydroxides

This study explores the transformative role of artificial intelligence (AI) and machine learning (ML) in materials science, leveraging large language models (LLMs) such as OpenAI’s ChatGPT. Focusing on (oxy)hydroxides as oxygen evolution reaction (OER) catalysts, we demonstrate how LLMs streamline data extraction, significantly reducing reliance on traditional, time-intensive methods. Using few-shot training and strategic prompting, ChatGPT achieved an extraction accuracy of approximately 0.9. The curated data set was then used to predict OER performance via the PyCaret library to evaluate various ML algorithms and a high-accuracy XGBoost regression model with accuracies above 0.9 is subsequently established. Further analysis using SHAP and Python Symbolic Regression (PySR) identified key descriptors-electrochemical double-layer capacitance, transition metal composition, support material, and d-electron count-as critical factors, consistent with established electrochemical principles. Additionally, SHAP’s extreme values for Cu and Zn suggest unconventional catalytic roles, potentially linked to Cu2O-facilitated NiOOH formation and Zn-induced electronic modulation, demonstrating the power of data-driven analysis in uncovering hidden mechanisms. To enhance literature-based insights, Microsoft’s GraphRAG technology was employed for in-depth chemical information retrieval. Overall, this study introduces an innovative, end-to-end ML framework powered by ChatGPT, promoting broader AI adoption in scientific research and bridging computational intelligence with experimental sciences.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ACS Sustainable Chemistry & Engineering
ACS Sustainable Chemistry & Engineering CHEMISTRY, MULTIDISCIPLINARY-ENGINEERING, CHEMICAL
CiteScore
13.80
自引率
4.80%
发文量
1470
审稿时长
1.7 months
期刊介绍: ACS Sustainable Chemistry & Engineering is a prestigious weekly peer-reviewed scientific journal published by the American Chemical Society. Dedicated to advancing the principles of green chemistry and green engineering, it covers a wide array of research topics including green chemistry, green engineering, biomass, alternative energy, and life cycle assessment. The journal welcomes submissions in various formats, including Letters, Articles, Features, and Perspectives (Reviews), that address the challenges of sustainability in the chemical enterprise and contribute to the advancement of sustainable practices. Join us in shaping the future of sustainable chemistry and engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信