Interpretable machine learning modeling of alloy composition-process-property relationships based on industrial big data in hot strip rolling

IF 6.1 1区 工程技术 Q1 ENGINEERING, MANUFACTURING
Jingdong Li , Xiaochen Wang , Fengxia Li , Yamin Sun , Youzhao Sun , Quan Yang , Xiangchen Wang
{"title":"Interpretable machine learning modeling of alloy composition-process-property relationships based on industrial big data in hot strip rolling","authors":"Jingdong Li ,&nbsp;Xiaochen Wang ,&nbsp;Fengxia Li ,&nbsp;Yamin Sun ,&nbsp;Youzhao Sun ,&nbsp;Quan Yang ,&nbsp;Xiangchen Wang","doi":"10.1016/j.jmapro.2025.06.059","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate mapping of the composition, process, and property relationship is essential for online predicting and controlling mechanical properties in hot-rolled alloy steel. However, this remains a challenge due to persistent data silos in hot strip rolling (HSR) and the limited interpretability of “black box” machine learning (ML) models in capturing complex multivariable interactions. This study developed a four-layer industrial digital twin platform to integrate multisource heterogeneous data into a unified dataset, including composition, process parameters and properties. A dataset reconstruction strategy was introduced to address the challenges posed by large-scale, nonlinear, and noise-prone data. Based on the reconstructed inputs, interpretable ML models were established to characterize the underlying composition-process-property relationships accurately. The light gradient boosting machine (LGBM) model, optimized using particle swarm optimization, achieved superior performance with an R<sup>2</sup> of 0.80 and a mean absolute error of 10.02 MPa on the test set. Shapley additive explanations and partial dependence plot analyses further revealed the combined effects of alloying elements, rolling temperature, and deformation on mechanical behavior. The proposed framework was successfully implemented on a 1422 mm HSR production line, providing real-time guidance for alloy design and reducing reliance on manual sampling.</div></div>","PeriodicalId":16148,"journal":{"name":"Journal of Manufacturing Processes","volume":"150 ","pages":"Pages 299-316"},"PeriodicalIF":6.1000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Processes","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1526612525007170","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MANUFACTURING","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate mapping of the composition, process, and property relationship is essential for online predicting and controlling mechanical properties in hot-rolled alloy steel. However, this remains a challenge due to persistent data silos in hot strip rolling (HSR) and the limited interpretability of “black box” machine learning (ML) models in capturing complex multivariable interactions. This study developed a four-layer industrial digital twin platform to integrate multisource heterogeneous data into a unified dataset, including composition, process parameters and properties. A dataset reconstruction strategy was introduced to address the challenges posed by large-scale, nonlinear, and noise-prone data. Based on the reconstructed inputs, interpretable ML models were established to characterize the underlying composition-process-property relationships accurately. The light gradient boosting machine (LGBM) model, optimized using particle swarm optimization, achieved superior performance with an R2 of 0.80 and a mean absolute error of 10.02 MPa on the test set. Shapley additive explanations and partial dependence plot analyses further revealed the combined effects of alloying elements, rolling temperature, and deformation on mechanical behavior. The proposed framework was successfully implemented on a 1422 mm HSR production line, providing real-time guidance for alloy design and reducing reliance on manual sampling.
基于工业大数据的热轧合金成分-工艺-性能关系的可解释机器学习建模
准确绘制热轧合金钢的成分、工艺和性能关系,是在线预测和控制热轧合金钢力学性能的关键。然而,这仍然是一个挑战,因为热轧(HSR)中持续存在的数据孤岛,以及“黑匣子”机器学习(ML)模型在捕获复杂的多变量相互作用时的有限可解释性。本研究开发了一个四层工业数字孪生平台,将多源异构数据集成为一个统一的数据集,包括成分、工艺参数和属性。引入了一种数据集重建策略,以解决大规模、非线性和易受噪声影响的数据所带来的挑战。基于重构的输入,建立了可解释的机器学习模型,以准确地描述潜在的组合-过程-属性关系。采用粒子群优化方法优化的光梯度增强机(LGBM)模型在测试集上的R2为0.80,平均绝对误差为10.02 MPa,取得了较好的性能。Shapley加性解释和部分依赖图分析进一步揭示了合金元素、轧制温度和变形对力学行为的综合影响。该框架在1422 mm高铁生产线上成功实施,为合金设计提供了实时指导,减少了对人工采样的依赖。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Manufacturing Processes
Journal of Manufacturing Processes ENGINEERING, MANUFACTURING-
CiteScore
10.20
自引率
11.30%
发文量
833
审稿时长
50 days
期刊介绍: The aim of the Journal of Manufacturing Processes (JMP) is to exchange current and future directions of manufacturing processes research, development and implementation, and to publish archival scholarly literature with a view to advancing state-of-the-art manufacturing processes and encouraging innovation for developing new and efficient processes. The journal will also publish from other research communities for rapid communication of innovative new concepts. Special-topic issues on emerging technologies and invited papers will also be published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信