{"title":"Empowering Chemistry Experts with Large Language Models for Literature Interpretation in Single-Atom Catalysis Toward Advanced Oxidation.","authors":"Jing-Hang Wu,Ran Shi,Xiao Zhou,Liang Zhang,Kong Chen,Han-Qing Yu,Yuen Wu","doi":"10.1002/anie.202520525","DOIUrl":null,"url":null,"abstract":"Large language models (LLMs) hold considerable promise for large-scale data extraction from scientific literatures for catalyst design and practical optimization. Yet, turning such outputs into reliable, formalized chemical knowledge would heavily rely on domain expertise rather than end-to-end automation. Herein, we present a human-in-the-loop workflow integrating LLM-facilitated structured data extraction with iterative, expert-guided curation and analysis. As a proof of concept, we take single-atom catalysts (SACs) for advanced oxidation processes (AOPs) as an example, enabling efficient data extraction, rigorous curation, and statistically driven interpretation. Thus, we uncover the key correlations among metal types, coordination environments, reaction substances, and catalytic performance, providing deeper mechanism insights into SAC-driven AOPs. In contrast to fully automated, end-to-end models, our approach relies on human-driven optimization at multiple stages, and underscores human insight as central to understand LLM outputs. By introducing human-driven prompt refinement, model comparison, and expert-led analysis, our method ensures that human cognition remains central to interpreting LLM outputs and converting structured data into reliable scientific knowledge. Our work addresses the limitations inherent in fully automated, end-to-end methodologies and effectively bridges the gap between structured outputs and catalytically meaningful insights.","PeriodicalId":125,"journal":{"name":"Angewandte Chemie International Edition","volume":"138 1","pages":"e202520525"},"PeriodicalIF":16.9000,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Angewandte Chemie International Edition","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1002/anie.202520525","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Large language models (LLMs) hold considerable promise for large-scale data extraction from scientific literatures for catalyst design and practical optimization. Yet, turning such outputs into reliable, formalized chemical knowledge would heavily rely on domain expertise rather than end-to-end automation. Herein, we present a human-in-the-loop workflow integrating LLM-facilitated structured data extraction with iterative, expert-guided curation and analysis. As a proof of concept, we take single-atom catalysts (SACs) for advanced oxidation processes (AOPs) as an example, enabling efficient data extraction, rigorous curation, and statistically driven interpretation. Thus, we uncover the key correlations among metal types, coordination environments, reaction substances, and catalytic performance, providing deeper mechanism insights into SAC-driven AOPs. In contrast to fully automated, end-to-end models, our approach relies on human-driven optimization at multiple stages, and underscores human insight as central to understand LLM outputs. By introducing human-driven prompt refinement, model comparison, and expert-led analysis, our method ensures that human cognition remains central to interpreting LLM outputs and converting structured data into reliable scientific knowledge. Our work addresses the limitations inherent in fully automated, end-to-end methodologies and effectively bridges the gap between structured outputs and catalytically meaningful insights.
期刊介绍:
Angewandte Chemie, a journal of the German Chemical Society (GDCh), maintains a leading position among scholarly journals in general chemistry with an impressive Impact Factor of 16.6 (2022 Journal Citation Reports, Clarivate, 2023). Published weekly in a reader-friendly format, it features new articles almost every day. Established in 1887, Angewandte Chemie is a prominent chemistry journal, offering a dynamic blend of Review-type articles, Highlights, Communications, and Research Articles on a weekly basis, making it unique in the field.