Yuxiang Meng , Qian Fang , Guoli Zheng , Gan Wang , Pengfei Li , Shuang Chen
{"title":"基于迁移学习的盾构机圆盘刀具磨损预测","authors":"Yuxiang Meng , Qian Fang , Guoli Zheng , Gan Wang , Pengfei Li , Shuang Chen","doi":"10.1016/j.tust.2025.106633","DOIUrl":null,"url":null,"abstract":"<div><div>Disc cutters serve as the primary device in shield machines for rock breaking during tunnel construction. Assessing the wear state of disc cutters is crucial for making timely replacement decisions. Several researchers have successfully predicted disc cutter wear with acceptable accuracy using Machine Learning (ML). However, ML models are often project-specific. The model trained on one project cannot be applied to the other project if the two projects have significant deviations, resulting in a waste of resources and effort. To address this issue, we propose a domain-adversarial-based transfer learning method to improve the generalization performance of ML models. In particular, we integrate the Domain-Adversarial Neural Network (DANN) with the Transformer. The proposed model makes domain discrimination and regression prediction for input parameters. The domain-adversarial mechanism makes the extracted features from input parameters share many commonalities and confuses data from different domains, which can improve the generalization performance of the model. The hyperparameter <em>λ</em> of the proposed model is used to balance the importance of domain discrimination and regression prediction. We validate the effectiveness of the proposed method in the second subsea tunnel in Qingdao, China. The south and service tunnels of the project are in similar strata conditions but have a significant difference in tunnel diameter. They share many commonalities in the wear characteristics of disc cutters. We set the service tunnel as the source domain and the south tunnel as the target domain. The model is trained and tested in the service tunnel, learning wear characteristics under different strata. Then, the pre-trained model is transferred to the south tunnel and fine-tuned with limited data to adapt to wear characteristics under different shields. Finally, the fine-tuned model is used to predict the wear values of the target data in the south tunnel. The domain-adversarial-based Transformer model outperforms FCN, LSTM and Transformer without the domain-adversarial mechanism and requires even limited target project data, unlike traditional models, which require extra training data. The proposed method can be applied in the early stage of related projects when data are scarce.</div></div>","PeriodicalId":49414,"journal":{"name":"Tunnelling and Underground Space Technology","volume":"162 ","pages":"Article 106633"},"PeriodicalIF":6.7000,"publicationDate":"2025-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of disc cutter wear of shield machines based on transfer learning\",\"authors\":\"Yuxiang Meng , Qian Fang , Guoli Zheng , Gan Wang , Pengfei Li , Shuang Chen\",\"doi\":\"10.1016/j.tust.2025.106633\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Disc cutters serve as the primary device in shield machines for rock breaking during tunnel construction. Assessing the wear state of disc cutters is crucial for making timely replacement decisions. Several researchers have successfully predicted disc cutter wear with acceptable accuracy using Machine Learning (ML). However, ML models are often project-specific. The model trained on one project cannot be applied to the other project if the two projects have significant deviations, resulting in a waste of resources and effort. To address this issue, we propose a domain-adversarial-based transfer learning method to improve the generalization performance of ML models. In particular, we integrate the Domain-Adversarial Neural Network (DANN) with the Transformer. The proposed model makes domain discrimination and regression prediction for input parameters. The domain-adversarial mechanism makes the extracted features from input parameters share many commonalities and confuses data from different domains, which can improve the generalization performance of the model. The hyperparameter <em>λ</em> of the proposed model is used to balance the importance of domain discrimination and regression prediction. We validate the effectiveness of the proposed method in the second subsea tunnel in Qingdao, China. The south and service tunnels of the project are in similar strata conditions but have a significant difference in tunnel diameter. They share many commonalities in the wear characteristics of disc cutters. We set the service tunnel as the source domain and the south tunnel as the target domain. The model is trained and tested in the service tunnel, learning wear characteristics under different strata. Then, the pre-trained model is transferred to the south tunnel and fine-tuned with limited data to adapt to wear characteristics under different shields. Finally, the fine-tuned model is used to predict the wear values of the target data in the south tunnel. The domain-adversarial-based Transformer model outperforms FCN, LSTM and Transformer without the domain-adversarial mechanism and requires even limited target project data, unlike traditional models, which require extra training data. The proposed method can be applied in the early stage of related projects when data are scarce.</div></div>\",\"PeriodicalId\":49414,\"journal\":{\"name\":\"Tunnelling and Underground Space Technology\",\"volume\":\"162 \",\"pages\":\"Article 106633\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-04-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tunnelling and Underground Space Technology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0886779825002718\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tunnelling and Underground Space Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0886779825002718","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Prediction of disc cutter wear of shield machines based on transfer learning
Disc cutters serve as the primary device in shield machines for rock breaking during tunnel construction. Assessing the wear state of disc cutters is crucial for making timely replacement decisions. Several researchers have successfully predicted disc cutter wear with acceptable accuracy using Machine Learning (ML). However, ML models are often project-specific. The model trained on one project cannot be applied to the other project if the two projects have significant deviations, resulting in a waste of resources and effort. To address this issue, we propose a domain-adversarial-based transfer learning method to improve the generalization performance of ML models. In particular, we integrate the Domain-Adversarial Neural Network (DANN) with the Transformer. The proposed model makes domain discrimination and regression prediction for input parameters. The domain-adversarial mechanism makes the extracted features from input parameters share many commonalities and confuses data from different domains, which can improve the generalization performance of the model. The hyperparameter λ of the proposed model is used to balance the importance of domain discrimination and regression prediction. We validate the effectiveness of the proposed method in the second subsea tunnel in Qingdao, China. The south and service tunnels of the project are in similar strata conditions but have a significant difference in tunnel diameter. They share many commonalities in the wear characteristics of disc cutters. We set the service tunnel as the source domain and the south tunnel as the target domain. The model is trained and tested in the service tunnel, learning wear characteristics under different strata. Then, the pre-trained model is transferred to the south tunnel and fine-tuned with limited data to adapt to wear characteristics under different shields. Finally, the fine-tuned model is used to predict the wear values of the target data in the south tunnel. The domain-adversarial-based Transformer model outperforms FCN, LSTM and Transformer without the domain-adversarial mechanism and requires even limited target project data, unlike traditional models, which require extra training data. The proposed method can be applied in the early stage of related projects when data are scarce.
期刊介绍:
Tunnelling and Underground Space Technology is an international journal which publishes authoritative articles encompassing the development of innovative uses of underground space and the results of high quality research into improved, more cost-effective techniques for the planning, geo-investigation, design, construction, operation and maintenance of underground and earth-sheltered structures. The journal provides an effective vehicle for the improved worldwide exchange of information on developments in underground technology - and the experience gained from its use - and is strongly committed to publishing papers on the interdisciplinary aspects of creating, planning, and regulating underground space.