Deep learning models for efficient geotechnical predictions: reducing training effort and data requirements with transfer learning

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics Pub Date : 2025-09-12 DOI:10.1016/j.aei.2025.103852

Haoding Xu , Xuzhen He , Shaoheng Dai , Caihui Zhu , Feng Shan , Qin Zhao , Faning Dang , Daichao Sheng

{"title":"Deep learning models for efficient geotechnical predictions: reducing training effort and data requirements with transfer learning","authors":"Haoding Xu , Xuzhen He , Shaoheng Dai , Caihui Zhu , Feng Shan , Qin Zhao , Faning Dang , Daichao Sheng","doi":"10.1016/j.aei.2025.103852","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate predictions of geotechnical failure loads, such as bearing capacity and slope stability, are important for safe infrastructure design and risk management. Recently, the use of ready-to-use deep-learning models as surrogate models to improve computational efficiency of risk analysis considering spatial variability has been proposed. These deep-learning models were trained with big datasets that covers all possible material properties and boundary conditions for a particular kind of problem, so it is ready to make predictions without further training. However, training such a large model requires a big dataset and substantial computational efforts. This study introduces a novel framework by employing existing deep neural networks and transfer learning techniques to effectively address these issues. Specifically, the pre-trained MobileNetV2 for image classification is used as a base. It is found that parametric ReLU (PReLU) and locally connected (LC) layers are important for our tasks. The PReLU activation can mitigate neuron deactivation by allowing the model to learn negative input slopes, while LC layers enabled the extraction of localized features – critical for accurately representing spatial variability in soil properties. Compared to a handcrafted locally connected network (MAPE ≈ 2%), the proposed deep neural network achieves similar predictive accuracy (MAPE ≈ 3%) but reduced training times (only 10% time required on the same computer). Notably, training with only 50 to 60% of the data maintained stable performance, and even with as little as 8.5% of the original dataset, satisfactory accuracy was achieved. Furthermore, this transfer learning approach generalized seamlessly to various problems without significant modifications–both bearing capacity and slope stability problems are tested. The results highlight the accuracy, efficiency, and generalization of the proposed framework, indicating the potential to simplify geotechnical engineering analyses and accelerate decision-making in practice.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103852"},"PeriodicalIF":9.9000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625007451","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate predictions of geotechnical failure loads, such as bearing capacity and slope stability, are important for safe infrastructure design and risk management. Recently, the use of ready-to-use deep-learning models as surrogate models to improve computational efficiency of risk analysis considering spatial variability has been proposed. These deep-learning models were trained with big datasets that covers all possible material properties and boundary conditions for a particular kind of problem, so it is ready to make predictions without further training. However, training such a large model requires a big dataset and substantial computational efforts. This study introduces a novel framework by employing existing deep neural networks and transfer learning techniques to effectively address these issues. Specifically, the pre-trained MobileNetV2 for image classification is used as a base. It is found that parametric ReLU (PReLU) and locally connected (LC) layers are important for our tasks. The PReLU activation can mitigate neuron deactivation by allowing the model to learn negative input slopes, while LC layers enabled the extraction of localized features – critical for accurately representing spatial variability in soil properties. Compared to a handcrafted locally connected network (MAPE ≈ 2%), the proposed deep neural network achieves similar predictive accuracy (MAPE ≈ 3%) but reduced training times (only 10% time required on the same computer). Notably, training with only 50 to 60% of the data maintained stable performance, and even with as little as 8.5% of the original dataset, satisfactory accuracy was achieved. Furthermore, this transfer learning approach generalized seamlessly to various problems without significant modifications–both bearing capacity and slope stability problems are tested. The results highlight the accuracy, efficiency, and generalization of the proposed framework, indicating the potential to simplify geotechnical engineering analyses and accelerate decision-making in practice.

查看原文本刊更多论文

用于有效岩土预测的深度学习模型：通过迁移学习减少训练工作量和数据需求

岩土破坏荷载的准确预测，如承载力和边坡稳定性，对安全基础设施设计和风险管理非常重要。近年来，人们提出使用现成的深度学习模型作为替代模型，以提高考虑空间变异性的风险分析的计算效率。这些深度学习模型是用大数据集训练的，这些数据集涵盖了特定类型问题的所有可能的材料属性和边界条件，因此无需进一步训练就可以做出预测。然而，训练如此庞大的模型需要庞大的数据集和大量的计算工作。本研究引入了一个新的框架，利用现有的深度神经网络和迁移学习技术来有效地解决这些问题。具体来说，使用预先训练好的用于图像分类的MobileNetV2作为基础。发现参数化ReLU （PReLU）和局部连接层（LC）对我们的任务很重要。PReLU激活可以通过允许模型学习负输入斜率来减轻神经元的失活，而LC层可以提取局部特征，这对于准确表示土壤属性的空间变异性至关重要。与手工制作的局部连接网络（MAPE≈2%）相比，所提出的深度神经网络实现了相似的预测精度（MAPE≈3%），但减少了训练时间（在同一台计算机上只需要10%的时间）。值得注意的是，仅使用50%至60%的数据进行训练就能保持稳定的性能，即使只使用原始数据集的8.5%，也能获得令人满意的准确性。此外，这种迁移学习方法无缝地推广到各种问题，而无需进行重大修改-承载力和边坡稳定性问题都进行了测试。结果突出了所提出框架的准确性、效率和通用性，表明了在实践中简化岩土工程分析和加速决策的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advanced Engineering Informatics 工程技术-工程：综合

CiteScore

12.40

自引率

18.20%

发文量

292

审稿时长

45 days

期刊介绍： Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.