Prediction of Tandem Cold-Rolled Strip Flatness Based on the BiGRU-Attention-iTransformer Model

IF 2.3 4区材料科学 Q3 MATERIALS SCIENCE, MULTIDISCIPLINARY

JOM Pub Date : 2025-06-02 DOI:10.1007/s11837-025-07456-2

Ming-hua Liu, Ya-han Li, Da-yuan Wu, Zi-xuan Zhu

{"title":"Prediction of Tandem Cold-Rolled Strip Flatness Based on the BiGRU-Attention-iTransformer Model","authors":"Ming-hua Liu, Ya-han Li, Da-yuan Wu, Zi-xuan Zhu","doi":"10.1007/s11837-025-07456-2","DOIUrl":null,"url":null,"abstract":"<div>Cold rolling strip production is a multi-stand continuous rolling process, so flatness prediction is a typical spatiotemporal series data prediction problem, which requires considering various complex factors affecting flatness and paying attention to the correlation of its spatiotemporal dimensions. Based on this, a cold rolled strip flatness prediction model is proposed, integrating a Bidirectional Gated Recurrent Unit (BiGRU), an Attention mechanism, and an Inverted Transformer (iTransformer). The model adopts a parallel structure, where one branch utilizes a BiGRU-Attention module designed to capture the spatiotemporal correlations in strip production data, with the BiGRU’s hidden layer dimension set to 128; the other branch employs an iTransformer module with a feature dimension of 256 and 8 attention heads to effectively extract key features and model the relationships between parameters using the self-attention mechanism. The features extracted from both branches are fused into a 128-dimensional vector, which is then passed through a fully connected layer for flatness prediction. The prediction results show that the error indicators MSE, RMSE and MAE of the proposed model are 0.937, 0.968 and 0.774 respectively, and the fitting performance indicator \\(R^{2}\\) is 0.974, which are better than the comparison models Random Forest (RF), Deep Neural Network (DNN), Long Short-Term Memory (LSTM), BiGRU, iTransformer, and BiGRU-iTransformer models. Feature ablation experiments show that for flatness, the importance of parameters is ranked as follows: forward tension (Tf), roll gap difference (Gd), work roll bending force (WR), exit thickness (Hb), rolling force (\\(F\\)), intermediate roll bending force (IR), strip yield strength (\\(Y\\)), entrance thickness (Hf), rolling speed (\\(V\\)), backward tension (Tb), strip width (\\(B\\)). The experimental results are consistent with expert knowledge and underlying mechanisms, verifying the effectiveness of the attention mechanism while enhancing interpretability and credibility.</div>","PeriodicalId":605,"journal":{"name":"JOM","volume":"77 8","pages":"6245 - 6259"},"PeriodicalIF":2.3000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JOM","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s11837-025-07456-2","RegionNum":4,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Cold rolling strip production is a multi-stand continuous rolling process, so flatness prediction is a typical spatiotemporal series data prediction problem, which requires considering various complex factors affecting flatness and paying attention to the correlation of its spatiotemporal dimensions. Based on this, a cold rolled strip flatness prediction model is proposed, integrating a Bidirectional Gated Recurrent Unit (BiGRU), an Attention mechanism, and an Inverted Transformer (iTransformer). The model adopts a parallel structure, where one branch utilizes a BiGRU-Attention module designed to capture the spatiotemporal correlations in strip production data, with the BiGRU’s hidden layer dimension set to 128; the other branch employs an iTransformer module with a feature dimension of 256 and 8 attention heads to effectively extract key features and model the relationships between parameters using the self-attention mechanism. The features extracted from both branches are fused into a 128-dimensional vector, which is then passed through a fully connected layer for flatness prediction. The prediction results show that the error indicators MSE, RMSE and MAE of the proposed model are 0.937, 0.968 and 0.774 respectively, and the fitting performance indicator \(R^{2}\) is 0.974, which are better than the comparison models Random Forest (RF), Deep Neural Network (DNN), Long Short-Term Memory (LSTM), BiGRU, iTransformer, and BiGRU-iTransformer models. Feature ablation experiments show that for flatness, the importance of parameters is ranked as follows: forward tension (Tf), roll gap difference (Gd), work roll bending force (WR), exit thickness (Hb), rolling force (\(F\)), intermediate roll bending force (IR), strip yield strength (\(Y\)), entrance thickness (Hf), rolling speed (\(V\)), backward tension (Tb), strip width (\(B\)). The experimental results are consistent with expert knowledge and underlying mechanisms, verifying the effectiveness of the attention mechanism while enhancing interpretability and credibility.

查看原文本刊更多论文

基于bigru - attention - ittransformer模型的冷轧带钢板形预测

冷轧带钢生产是一个多机架连续轧制过程，因此板形预测是一个典型的时空序列数据预测问题，需要考虑影响板形的各种复杂因素，并关注其时空维度的相关性。在此基础上，提出了一种结合双向门控循环单元（BiGRU）、注意力机制和逆变变压器（ittransformer）的冷轧带钢板形预测模型。该模型采用并行结构，其中一个分支利用BiGRU- attention模块捕获条带生产数据中的时空相关性，BiGRU隐藏层维数设置为128；另一个分支采用特征维数为256的ittransformer模块和8个注意头，有效提取关键特征，并利用自注意机制对参数之间的关系进行建模。从两个分支中提取的特征融合成一个128维向量，然后通过一个全连接层进行平面度预测。预测结果表明，该模型的误差指标MSE、RMSE和MAE分别为0.937、0.968和0.774，拟合性能指标\(R^{2}\)为0.974，优于随机森林（Random Forest， RF）、深度神经网络（Deep Neural Network， DNN）、长短期记忆（Long - short - short Memory， LSTM）、BiGRU、iTransformer和BiGRU-iTransformer模型。特征烧蚀实验表明，对于板形，各参数的重要程度依次为：正向张力（Tf）、辊缝差（Gd）、工作辊弯曲力（WR）、出口厚度（Hb）、轧制力（\(F\)）、中间辊弯曲力（IR）、带钢屈服强度（\(Y\)）、进口厚度（Hf）、轧制速度（\(V\)）、反向张力（Tb）、带钢宽度（\(B\)）。实验结果与专家知识和潜在机制一致，验证了注意机制的有效性，同时增强了可解释性和可信度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JOM 工程技术-材料科学：综合

CiteScore

4.50

自引率

3.80%

发文量

540

审稿时长

2.8 months

期刊介绍： JOM is a technical journal devoted to exploring the many aspects of materials science and engineering. JOM reports scholarly work that explores the state-of-the-art processing, fabrication, design, and application of metals, ceramics, plastics, composites, and other materials. In pursuing this goal, JOM strives to balance the interests of the laboratory and the marketplace by reporting academic, industrial, and government-sponsored work from around the world.