Model-based offline reinforcement learning framework for optimizing tunnel boring machine operation

IF 8.2 1区 工程技术 Q1 ENGINEERING, CIVIL
Yupeng Cao , Wei Luo , Yadong Xue , Weiren Lin , Feng Zhang
{"title":"Model-based offline reinforcement learning framework for optimizing tunnel boring machine operation","authors":"Yupeng Cao ,&nbsp;Wei Luo ,&nbsp;Yadong Xue ,&nbsp;Weiren Lin ,&nbsp;Feng Zhang","doi":"10.1016/j.undsp.2024.01.008","DOIUrl":null,"url":null,"abstract":"<div><p>Research on automation and intelligent operation of tunnel boring machine (TBM) is receiving more and more attention, benefiting from the increasing construction data. However, most studies on TBM operations optimization were trained by the labels of human drivers’ decisions, which were subjective and stochastic. As a result, the control parameters suggested by these models could hardly surpass the performance of a human driver, even the possibility of subjective incorrect decisions. Considering that the geomechanical feedback to TBM under drivers’ actions is objective, in this paper, a transformer-based model called the geological response for tunnel boring machine (GRTBM), is proposed to learn the relationship between operation-adjust and TBM monitoring changes. Additionally, with the model-based offline reinforcement learning, this paper provided a novel approach to optimizing the TBM excavation operations. The decision processes, recorded in the Yin-song TBM project for a waterway tunnel in Jilin Province of China, were used for the validation of the model. By adopting an implicit perception of geological conditions in the GRTBM model, the suggested method achieved the desired state within a single action, greatly outperformed the practical adjustments where 500 s were taken, revealing the fact that the proposed model has the potential to surpass the capability of human beings.</p></div>","PeriodicalId":48505,"journal":{"name":"Underground Space","volume":"19 ","pages":"Pages 47-71"},"PeriodicalIF":8.2000,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2467967424000540/pdfft?md5=12e60e379c29962c638a0fecd8e75a42&pid=1-s2.0-S2467967424000540-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Underground Space","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2467967424000540","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

Research on automation and intelligent operation of tunnel boring machine (TBM) is receiving more and more attention, benefiting from the increasing construction data. However, most studies on TBM operations optimization were trained by the labels of human drivers’ decisions, which were subjective and stochastic. As a result, the control parameters suggested by these models could hardly surpass the performance of a human driver, even the possibility of subjective incorrect decisions. Considering that the geomechanical feedback to TBM under drivers’ actions is objective, in this paper, a transformer-based model called the geological response for tunnel boring machine (GRTBM), is proposed to learn the relationship between operation-adjust and TBM monitoring changes. Additionally, with the model-based offline reinforcement learning, this paper provided a novel approach to optimizing the TBM excavation operations. The decision processes, recorded in the Yin-song TBM project for a waterway tunnel in Jilin Province of China, were used for the validation of the model. By adopting an implicit perception of geological conditions in the GRTBM model, the suggested method achieved the desired state within a single action, greatly outperformed the practical adjustments where 500 s were taken, revealing the fact that the proposed model has the potential to surpass the capability of human beings.

优化隧道掘进机运行的基于模型的离线强化学习框架
得益于不断增加的施工数据,隧道掘进机(TBM)的自动化和智能化操作研究正受到越来越多的关注。然而,大多数有关隧道掘进机操作优化的研究都是根据人类驾驶员的决策标签进行训练的,而这些决策标签具有主观性和随机性。因此,这些模型建议的控制参数很难超越人类驾驶员的表现,甚至可能出现主观错误决策。考虑到驾驶员行为对隧道掘进机的地质力学反馈是客观的,本文提出了基于变压器的隧道掘进机地质响应(GRTBM)模型,以学习操作调整与隧道掘进机监测变化之间的关系。此外,通过基于模型的离线强化学习,本文提供了一种优化隧道掘进机挖掘作业的新方法。模型的验证采用了中国吉林省银松水道隧道 TBM 项目中记录的决策过程。通过在 GRTBM 模型中采用对地质条件的隐式感知,所建议的方法在一次操作中就达到了预期状态,大大优于实际调整所需的 500 秒,揭示了所建议的模型具有超越人类能力的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Underground Space
Underground Space ENGINEERING, CIVIL-
CiteScore
10.20
自引率
14.10%
发文量
71
审稿时长
63 days
期刊介绍: Underground Space is an open access international journal without article processing charges (APC) committed to serving as a scientific forum for researchers and practitioners in the field of underground engineering. The journal welcomes manuscripts that deal with original theories, methods, technologies, and important applications throughout the life-cycle of underground projects, including planning, design, operation and maintenance, disaster prevention, and demolition. The journal is particularly interested in manuscripts related to the latest development of smart underground engineering from the perspectives of resilience, resources saving, environmental friendliness, humanity, and artificial intelligence. The manuscripts are expected to have significant innovation and potential impact in the field of underground engineering, and should have clear association with or application in underground projects.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信