{"title":"工业软测量建模中基于变压器的组合多头自编码器","authors":"Yanhong Li, Shiwei Gao, Wenfeng Zhao","doi":"10.1016/j.engappai.2025.111681","DOIUrl":null,"url":null,"abstract":"<div><div>Soft-sensor modeling is common in industrial production, but high data dimensionality, a lack of labeled features, and inadequate methods complicate extracting nonlinear feature representations. This paper proposes a Transformer-based auto-encoder with a combined multi-head-attention approach (TAE-CMHA) for soft-sensor modeling, which offers advantages for nonlinear feature representation. It introduces a combined multi-head-attention mechanism (CMHA) that improves feature-extraction accuracy and robustness. The Transformer's global feature extraction capabilities are leveraged in the auto-encoder for better nonlinear feature extraction. Additionally, label information optimizes the auto-encoder's reconstruction loss function which improves feature acquisition for predicting target outputs. Compared to supervised methods, the unsupervised auto-encoder uses abundant unlabeled industrial data to improve generalizability. Experiments were conducted on the industrial steam flow and debutanizer column datasets. The results show that the mean squared error (MSE) of the proposed method reaches a minimum of 0.00297 and the coefficient of determination (R<sup>2</sup>) is 0.881 in debutanizer column datasets, which shows the advantages of the model.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111681"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transformer-based auto-encoder with combined multi-head-attention for industrial soft-sensor modeling\",\"authors\":\"Yanhong Li, Shiwei Gao, Wenfeng Zhao\",\"doi\":\"10.1016/j.engappai.2025.111681\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Soft-sensor modeling is common in industrial production, but high data dimensionality, a lack of labeled features, and inadequate methods complicate extracting nonlinear feature representations. This paper proposes a Transformer-based auto-encoder with a combined multi-head-attention approach (TAE-CMHA) for soft-sensor modeling, which offers advantages for nonlinear feature representation. It introduces a combined multi-head-attention mechanism (CMHA) that improves feature-extraction accuracy and robustness. The Transformer's global feature extraction capabilities are leveraged in the auto-encoder for better nonlinear feature extraction. Additionally, label information optimizes the auto-encoder's reconstruction loss function which improves feature acquisition for predicting target outputs. Compared to supervised methods, the unsupervised auto-encoder uses abundant unlabeled industrial data to improve generalizability. Experiments were conducted on the industrial steam flow and debutanizer column datasets. The results show that the mean squared error (MSE) of the proposed method reaches a minimum of 0.00297 and the coefficient of determination (R<sup>2</sup>) is 0.881 in debutanizer column datasets, which shows the advantages of the model.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"159 \",\"pages\":\"Article 111681\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625016835\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625016835","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Transformer-based auto-encoder with combined multi-head-attention for industrial soft-sensor modeling
Soft-sensor modeling is common in industrial production, but high data dimensionality, a lack of labeled features, and inadequate methods complicate extracting nonlinear feature representations. This paper proposes a Transformer-based auto-encoder with a combined multi-head-attention approach (TAE-CMHA) for soft-sensor modeling, which offers advantages for nonlinear feature representation. It introduces a combined multi-head-attention mechanism (CMHA) that improves feature-extraction accuracy and robustness. The Transformer's global feature extraction capabilities are leveraged in the auto-encoder for better nonlinear feature extraction. Additionally, label information optimizes the auto-encoder's reconstruction loss function which improves feature acquisition for predicting target outputs. Compared to supervised methods, the unsupervised auto-encoder uses abundant unlabeled industrial data to improve generalizability. Experiments were conducted on the industrial steam flow and debutanizer column datasets. The results show that the mean squared error (MSE) of the proposed method reaches a minimum of 0.00297 and the coefficient of determination (R2) is 0.881 in debutanizer column datasets, which shows the advantages of the model.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.