Mingjun Liu , Jianqin Liu , Wei Guo , Hongxu Liu , Xiao Guo
{"title":"Multi-patch attention Transformer for multivariate long-term time series forecasting of TBM excavation parameters","authors":"Mingjun Liu , Jianqin Liu , Wei Guo , Hongxu Liu , Xiao Guo","doi":"10.1016/j.undsp.2025.02.007","DOIUrl":null,"url":null,"abstract":"<div><div>To address the research gap in multivariable long-term time series forecasting in the field of tunnel boring machine (TBM) and provide long-term insights for decision-making in TBM construction, this paper studies a novel Transformer-based forecasting model. Leveraging a multi-patch attention mechanism, the newly developed multi-patch attention Transformer (MPAT) model is designed to predict long-term trends of multiple TBM operation parameters. The innovation lies in finding the most relevant time delay series of the input series through autocorrelation calculation, and designing a multi-patch attention mechanism to replace the traditional attention mechanism of Transformer, so that the model can capture local and global information of the series and improve the accuracy of long-term prediction of high-frequency and weakly periodic TBM data. Experimental results have shown that MPAT model has a significant effect on capturing TBM data in terms of temporal dependencies. In a case study, we applied MPAT to the Rongjiang Guanbu Water Diversion Project in Guangdong Province and predicted four excavation parameters. The experimental results show that MPAT exhibits accurate predictive ability when the input length is 36 and the outputs are 12, 24, 48, and 72, respectively. In comparison with some state-of-the-art models, MPAT outperforms MSE by 19.1%, 23.6%, 36.4%, and 48.3%, respectively. We also discussed the impact of input length and the number of patches on performance, and found that each prediction length has the best input length corresponding to it, and longer inputs don’t represent more accurate predictions. The determination of the number of patches should also depend on the input length, as too many or too few patches can affect the capture of local information in the sequence.</div></div>","PeriodicalId":48505,"journal":{"name":"Underground Space","volume":"23 ","pages":"Pages 285-306"},"PeriodicalIF":8.3000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Underground Space","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2467967425000431","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
To address the research gap in multivariable long-term time series forecasting in the field of tunnel boring machine (TBM) and provide long-term insights for decision-making in TBM construction, this paper studies a novel Transformer-based forecasting model. Leveraging a multi-patch attention mechanism, the newly developed multi-patch attention Transformer (MPAT) model is designed to predict long-term trends of multiple TBM operation parameters. The innovation lies in finding the most relevant time delay series of the input series through autocorrelation calculation, and designing a multi-patch attention mechanism to replace the traditional attention mechanism of Transformer, so that the model can capture local and global information of the series and improve the accuracy of long-term prediction of high-frequency and weakly periodic TBM data. Experimental results have shown that MPAT model has a significant effect on capturing TBM data in terms of temporal dependencies. In a case study, we applied MPAT to the Rongjiang Guanbu Water Diversion Project in Guangdong Province and predicted four excavation parameters. The experimental results show that MPAT exhibits accurate predictive ability when the input length is 36 and the outputs are 12, 24, 48, and 72, respectively. In comparison with some state-of-the-art models, MPAT outperforms MSE by 19.1%, 23.6%, 36.4%, and 48.3%, respectively. We also discussed the impact of input length and the number of patches on performance, and found that each prediction length has the best input length corresponding to it, and longer inputs don’t represent more accurate predictions. The determination of the number of patches should also depend on the input length, as too many or too few patches can affect the capture of local information in the sequence.
期刊介绍:
Underground Space is an open access international journal without article processing charges (APC) committed to serving as a scientific forum for researchers and practitioners in the field of underground engineering. The journal welcomes manuscripts that deal with original theories, methods, technologies, and important applications throughout the life-cycle of underground projects, including planning, design, operation and maintenance, disaster prevention, and demolition. The journal is particularly interested in manuscripts related to the latest development of smart underground engineering from the perspectives of resilience, resources saving, environmental friendliness, humanity, and artificial intelligence. The manuscripts are expected to have significant innovation and potential impact in the field of underground engineering, and should have clear association with or application in underground projects.