Steel design based on a large language model

IF 8.3 1区 材料科学 Q1 MATERIALS SCIENCE, MULTIDISCIPLINARY
Shaohan Tian, Xue Jiang, Weiren Wang, Zhihua Jing, Chi Zhang, Cheng Zhang, Turab Lookman, Yanjing Su
{"title":"Steel design based on a large language model","authors":"Shaohan Tian, Xue Jiang, Weiren Wang, Zhihua Jing, Chi Zhang, Cheng Zhang, Turab Lookman, Yanjing Su","doi":"10.1016/j.actamat.2024.120663","DOIUrl":null,"url":null,"abstract":"The success of artificial intelligence (AI) in materials research heavily relies on the integrity of structured data and the construction of precise descriptors. In this study, we present an end-to-end pipeline from materials text to properties for steels based on a large language model. The objective is to enable quantitative predictions of properties with high-accuracy and explore new steels. The pipeline includes a materials language encoder, named SteelBERT, and a multimodal deep learning framework that maps the composition and text sequence of complex fabrication processes to mechanical properties. We demonstrate high accuracy on mechanical properties, including yield strength (YS), ultimate tensile strength (UTS), and elongation (EL) by predicting determination coefficients (R<sup>2</sup>) reaching 78.17% (±3.40%), 82.56% (±1.96%), and 81.44% (±2.98%) respectively. Further, through an additional fine-tuning strategy for the design of specific steels with small datasets, we show how the performance can be refined. With only 64 experimental samples of 15Cr austenitic stainless steels, we obtain an optimized model with R<sup>2</sup> of 89.85% (±6.17%), 88.34% (±5.95%) and 87.24% (±5.15%) for YS, UTS and EL, that requires the user to input composition and text sequence for processing and which outputs mechanical properties. The model efficiently optimizes the text sequence for the fabrication process by suggesting a secondary round of cold rolling and tempering to yield an exceptional YS of 960 MPa, UTS of 1138 MPa, and EL of 32.5%, exceeding those of reported 15Cr austenitic stainless steels.","PeriodicalId":238,"journal":{"name":"Acta Materialia","volume":"30 1","pages":""},"PeriodicalIF":8.3000,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Materialia","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1016/j.actamat.2024.120663","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The success of artificial intelligence (AI) in materials research heavily relies on the integrity of structured data and the construction of precise descriptors. In this study, we present an end-to-end pipeline from materials text to properties for steels based on a large language model. The objective is to enable quantitative predictions of properties with high-accuracy and explore new steels. The pipeline includes a materials language encoder, named SteelBERT, and a multimodal deep learning framework that maps the composition and text sequence of complex fabrication processes to mechanical properties. We demonstrate high accuracy on mechanical properties, including yield strength (YS), ultimate tensile strength (UTS), and elongation (EL) by predicting determination coefficients (R2) reaching 78.17% (±3.40%), 82.56% (±1.96%), and 81.44% (±2.98%) respectively. Further, through an additional fine-tuning strategy for the design of specific steels with small datasets, we show how the performance can be refined. With only 64 experimental samples of 15Cr austenitic stainless steels, we obtain an optimized model with R2 of 89.85% (±6.17%), 88.34% (±5.95%) and 87.24% (±5.15%) for YS, UTS and EL, that requires the user to input composition and text sequence for processing and which outputs mechanical properties. The model efficiently optimizes the text sequence for the fabrication process by suggesting a secondary round of cold rolling and tempering to yield an exceptional YS of 960 MPa, UTS of 1138 MPa, and EL of 32.5%, exceeding those of reported 15Cr austenitic stainless steels.

Abstract Image

基于大型语言模型的钢结构设计
人工智能(AI)在材料研究领域的成功在很大程度上依赖于结构化数据的完整性和精确描述符的构建。在本研究中,我们基于大型语言模型,提出了从材料文本到钢材属性的端到端管道。其目的是实现高精度的属性定量预测,并探索新的钢材。该管道包括一个名为 SteelBERT 的材料语言编码器和一个多模态深度学习框架,后者可将复杂制造工艺的成分和文本序列映射到机械性能。我们通过预测确定系数 (R2) 分别达到 78.17% (±3.40%)、82.56% (±1.96%) 和 81.44% (±2.98%),证明了机械性能的高准确性,包括屈服强度 (YS)、极限拉伸强度 (UTS) 和伸长率 (EL)。此外,我们还通过附加的微调策略,利用小数据集设计特定钢材,展示了如何改进性能。仅使用 64 个 15Cr 奥氏体不锈钢实验样本,我们就获得了一个优化模型,其 YS、UTS 和 EL 的 R2 分别为 89.85% (±6.17%)、88.34% (±5.95%) 和 87.24% (±5.15%)。该模型通过建议进行第二轮冷轧和回火,有效优化了制造过程中的文字顺序,从而获得了优异的 YS 值(960 兆帕)、UTS 值(1138 兆帕)和 EL 值(32.5%),超过了已报道的 15Cr 奥氏体不锈钢。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Acta Materialia
Acta Materialia 工程技术-材料科学:综合
CiteScore
16.10
自引率
8.50%
发文量
801
审稿时长
53 days
期刊介绍: Acta Materialia serves as a platform for publishing full-length, original papers and commissioned overviews that contribute to a profound understanding of the correlation between the processing, structure, and properties of inorganic materials. The journal seeks papers with high impact potential or those that significantly propel the field forward. The scope includes the atomic and molecular arrangements, chemical and electronic structures, and microstructure of materials, focusing on their mechanical or functional behavior across all length scales, including nanostructures.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信