Quanliang Liu, Maciej P Polak, MD Al Amin Shuvo, Hrishikesh Shridhar Deodhar, Jeongsoo Han, Dane Morgan, Hyunseok Oh
{"title":"使用大型语言模型框架自动提取材料系统图","authors":"Quanliang Liu, Maciej P Polak, MD Al Amin Shuvo, Hrishikesh Shridhar Deodhar, Jeongsoo Han, Dane Morgan, Hyunseok Oh","doi":"10.1016/j.scriptamat.2025.116815","DOIUrl":null,"url":null,"abstract":"<div><div>A framework leveraging large language models (LLMs) is developed to systematically extract and organize Processing-Mechanism-Structure-Mechanism-Property (P-M-S-M-P) relationships from materials science and engineering literature, with a particular focus on metallurgy. Using multi-stage prompts, our method identifies key properties, microstructures, processing methods, and associated mechanisms, then integrates them to generate comprehensive materials system charts. Additionally, the framework refines the extracted system charts for visualization, enabling the creation of informative diagrams that capture essential insights from each paper. Evaluated across 70 papers spanning multiple alloy systems and research types, the approach achieves 94 % accuracy in mechanism extraction, 87 % in information source labeling, and 97 % in the human-machine readability index for processing, structure, and property entities. The prompts and codes are provided alongside guidelines for researchers unfamiliar with coding. This framework offers an effective methodology for knowledge extraction in materials science using the P-M-S-M-P framework.</div></div>","PeriodicalId":423,"journal":{"name":"Scripta Materialia","volume":"267 ","pages":"Article 116815"},"PeriodicalIF":5.3000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated extraction of materials system charts using a large language model framework\",\"authors\":\"Quanliang Liu, Maciej P Polak, MD Al Amin Shuvo, Hrishikesh Shridhar Deodhar, Jeongsoo Han, Dane Morgan, Hyunseok Oh\",\"doi\":\"10.1016/j.scriptamat.2025.116815\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A framework leveraging large language models (LLMs) is developed to systematically extract and organize Processing-Mechanism-Structure-Mechanism-Property (P-M-S-M-P) relationships from materials science and engineering literature, with a particular focus on metallurgy. Using multi-stage prompts, our method identifies key properties, microstructures, processing methods, and associated mechanisms, then integrates them to generate comprehensive materials system charts. Additionally, the framework refines the extracted system charts for visualization, enabling the creation of informative diagrams that capture essential insights from each paper. Evaluated across 70 papers spanning multiple alloy systems and research types, the approach achieves 94 % accuracy in mechanism extraction, 87 % in information source labeling, and 97 % in the human-machine readability index for processing, structure, and property entities. The prompts and codes are provided alongside guidelines for researchers unfamiliar with coding. This framework offers an effective methodology for knowledge extraction in materials science using the P-M-S-M-P framework.</div></div>\",\"PeriodicalId\":423,\"journal\":{\"name\":\"Scripta Materialia\",\"volume\":\"267 \",\"pages\":\"Article 116815\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scripta Materialia\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1359646225002787\",\"RegionNum\":2,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scripta Materialia","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1359646225002787","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
Automated extraction of materials system charts using a large language model framework
A framework leveraging large language models (LLMs) is developed to systematically extract and organize Processing-Mechanism-Structure-Mechanism-Property (P-M-S-M-P) relationships from materials science and engineering literature, with a particular focus on metallurgy. Using multi-stage prompts, our method identifies key properties, microstructures, processing methods, and associated mechanisms, then integrates them to generate comprehensive materials system charts. Additionally, the framework refines the extracted system charts for visualization, enabling the creation of informative diagrams that capture essential insights from each paper. Evaluated across 70 papers spanning multiple alloy systems and research types, the approach achieves 94 % accuracy in mechanism extraction, 87 % in information source labeling, and 97 % in the human-machine readability index for processing, structure, and property entities. The prompts and codes are provided alongside guidelines for researchers unfamiliar with coding. This framework offers an effective methodology for knowledge extraction in materials science using the P-M-S-M-P framework.
期刊介绍:
Scripta Materialia is a LETTERS journal of Acta Materialia, providing a forum for the rapid publication of short communications on the relationship between the structure and the properties of inorganic materials. The emphasis is on originality rather than incremental research. Short reports on the development of materials with novel or substantially improved properties are also welcomed. Emphasis is on either the functional or mechanical behavior of metals, ceramics and semiconductors at all length scales.