Zhixing Deng , Linrong Xu , Qian Su , Yuanxingzi He , Yongwei Li
{"title":"基于经验主义约束神经网络和 SHapley Additive exPlanations 分析的高速铁路路基累积变形预测新方法","authors":"Zhixing Deng , Linrong Xu , Qian Su , Yuanxingzi He , Yongwei Li","doi":"10.1016/j.trgeo.2024.101438","DOIUrl":null,"url":null,"abstract":"<div><div>Understanding the long-term deformation of high-speed railway subgrade is essential for solving deformation issues and managing operations. Machine learning methods are commonly used to predict subgrade cumulative deformation (SCD). However, traditional machine learning models for SCD prediction have poor generalization to new data and lack visualization. Hence, this study proposes a novel method using an empiricism-constrained neural network (ECNN) and SHapley Additive exPlanations (SHAP) analysis for predicting SCD of high-speed railways. Firstly, the SCD prediction dataset is constructed and divided into training and test sets. Then, neural network models are developed using the training set, and the optimal model is determined based on the comprehensive scoring results on the test set. The optimal model couples empirical information into the neural network with loss function modification, to create the ECNN model. Finally, the interpretability of the ECNN model is analyzed using the SHAP method. The results indicate that the Bi-directional Gated Recurrent Unit (Bi-GRU) model is the optimal model with the highest <em>CSI</em> value of 23. The ECNN model outperforms the Bi-GRU in generalization to new data, especially in long-term SCD prediction with limited training data. Contribution analysis shows that the top two features influencing the prediction are <em>S</em><sub>t-1</sub> (54.4%) and <em>S</em><sub>t-2</sub> (30.4%), consistent with the findings of the ablation analysis. The research results can provide a new reference for predicting the SCD of high-speed railways.</div></div>","PeriodicalId":56013,"journal":{"name":"Transportation Geotechnics","volume":"49 ","pages":"Article 101438"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel method for subgrade cumulative deformation prediction of high-speed railways based on empiricism-constrained neural network and SHapley Additive exPlanations analysis\",\"authors\":\"Zhixing Deng , Linrong Xu , Qian Su , Yuanxingzi He , Yongwei Li\",\"doi\":\"10.1016/j.trgeo.2024.101438\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Understanding the long-term deformation of high-speed railway subgrade is essential for solving deformation issues and managing operations. Machine learning methods are commonly used to predict subgrade cumulative deformation (SCD). However, traditional machine learning models for SCD prediction have poor generalization to new data and lack visualization. Hence, this study proposes a novel method using an empiricism-constrained neural network (ECNN) and SHapley Additive exPlanations (SHAP) analysis for predicting SCD of high-speed railways. Firstly, the SCD prediction dataset is constructed and divided into training and test sets. Then, neural network models are developed using the training set, and the optimal model is determined based on the comprehensive scoring results on the test set. The optimal model couples empirical information into the neural network with loss function modification, to create the ECNN model. Finally, the interpretability of the ECNN model is analyzed using the SHAP method. The results indicate that the Bi-directional Gated Recurrent Unit (Bi-GRU) model is the optimal model with the highest <em>CSI</em> value of 23. The ECNN model outperforms the Bi-GRU in generalization to new data, especially in long-term SCD prediction with limited training data. Contribution analysis shows that the top two features influencing the prediction are <em>S</em><sub>t-1</sub> (54.4%) and <em>S</em><sub>t-2</sub> (30.4%), consistent with the findings of the ablation analysis. The research results can provide a new reference for predicting the SCD of high-speed railways.</div></div>\",\"PeriodicalId\":56013,\"journal\":{\"name\":\"Transportation Geotechnics\",\"volume\":\"49 \",\"pages\":\"Article 101438\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Geotechnics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2214391224002599\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Geotechnics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214391224002599","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
A novel method for subgrade cumulative deformation prediction of high-speed railways based on empiricism-constrained neural network and SHapley Additive exPlanations analysis
Understanding the long-term deformation of high-speed railway subgrade is essential for solving deformation issues and managing operations. Machine learning methods are commonly used to predict subgrade cumulative deformation (SCD). However, traditional machine learning models for SCD prediction have poor generalization to new data and lack visualization. Hence, this study proposes a novel method using an empiricism-constrained neural network (ECNN) and SHapley Additive exPlanations (SHAP) analysis for predicting SCD of high-speed railways. Firstly, the SCD prediction dataset is constructed and divided into training and test sets. Then, neural network models are developed using the training set, and the optimal model is determined based on the comprehensive scoring results on the test set. The optimal model couples empirical information into the neural network with loss function modification, to create the ECNN model. Finally, the interpretability of the ECNN model is analyzed using the SHAP method. The results indicate that the Bi-directional Gated Recurrent Unit (Bi-GRU) model is the optimal model with the highest CSI value of 23. The ECNN model outperforms the Bi-GRU in generalization to new data, especially in long-term SCD prediction with limited training data. Contribution analysis shows that the top two features influencing the prediction are St-1 (54.4%) and St-2 (30.4%), consistent with the findings of the ablation analysis. The research results can provide a new reference for predicting the SCD of high-speed railways.
期刊介绍:
Transportation Geotechnics is a journal dedicated to publishing high-quality, theoretical, and applied papers that cover all facets of geotechnics for transportation infrastructure such as roads, highways, railways, underground railways, airfields, and waterways. The journal places a special emphasis on case studies that present original work relevant to the sustainable construction of transportation infrastructure. The scope of topics it addresses includes the geotechnical properties of geomaterials for sustainable and rational design and construction, the behavior of compacted and stabilized geomaterials, the use of geosynthetics and reinforcement in constructed layers and interlayers, ground improvement and slope stability for transportation infrastructures, compaction technology and management, maintenance technology, the impact of climate, embankments for highways and high-speed trains, transition zones, dredging, underwater geotechnics for infrastructure purposes, and the modeling of multi-layered structures and supporting ground under dynamic and repeated loads.