High‐Fidelity Data Augmentation for Few‐Shot Learning in Jet Grout Injection Applications

IF 3.4 2区 工程技术 Q2 ENGINEERING, GEOLOGICAL
Pierre Guy Atangana Njock, Zhen‐Yu Yin, Ning Zhang
{"title":"High‐Fidelity Data Augmentation for Few‐Shot Learning in Jet Grout Injection Applications","authors":"Pierre Guy Atangana Njock, Zhen‐Yu Yin, Ning Zhang","doi":"10.1002/nag.3862","DOIUrl":null,"url":null,"abstract":"Contemporary geoengineering challenges grapple with the plateauing of both existing algorithms and their depth of insights, a phenomenon exacerbated by the scarcity of high‐fidelity data. Although existing solutions such as Monte‐Carlo method can generate abundant data, they are not sufficiently robust for ensuring the high fidelity of data. This study proposes a novel data augmentation framework that combines statistical and machine learning methods to generate high‐fidelity synthetic data, which closely align with field data in terms of the statistical and empirical attributes. The innovations of the proposed approach lie in the integration of Copulas theory for data generation, a developed geo‐regression anomaly detection (GRAD) for adjusting data attributes, and an evolutionary polynomial regression for data consistency enforcement. The multilayer perceptron (MLP) and a wide‐and‐deep (WaD) networks are applied to assess the effectiveness of high‐fidelity data augmentation using jet grouting data. The outcomes reveal the robustness of the synthetic data generation framework, achieving satisfactory fidelity in both empirical and statistical attributes. The proposed data augmentation improved the <jats:italic>R<jats:sup>2</jats:sup></jats:italic> and MAE achieved by MLP and WaD up to 28.37% under data fractions ranging from 0.2 to 1. MLP and WaD yielded comparable results in terms of accuracy and generalization ability across various augmented fractions. This indicates that the accuracy of synthetic data plays a pivotal role, suggesting improving data quality can be highly effective in boosting performance, regardless of the model complexity. This study contributes valuable insights to addressing the challenges of scare high‐fidelity data in geoengineering.","PeriodicalId":13786,"journal":{"name":"International Journal for Numerical and Analytical Methods in Geomechanics","volume":"123 1","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal for Numerical and Analytical Methods in Geomechanics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1002/nag.3862","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, GEOLOGICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Contemporary geoengineering challenges grapple with the plateauing of both existing algorithms and their depth of insights, a phenomenon exacerbated by the scarcity of high‐fidelity data. Although existing solutions such as Monte‐Carlo method can generate abundant data, they are not sufficiently robust for ensuring the high fidelity of data. This study proposes a novel data augmentation framework that combines statistical and machine learning methods to generate high‐fidelity synthetic data, which closely align with field data in terms of the statistical and empirical attributes. The innovations of the proposed approach lie in the integration of Copulas theory for data generation, a developed geo‐regression anomaly detection (GRAD) for adjusting data attributes, and an evolutionary polynomial regression for data consistency enforcement. The multilayer perceptron (MLP) and a wide‐and‐deep (WaD) networks are applied to assess the effectiveness of high‐fidelity data augmentation using jet grouting data. The outcomes reveal the robustness of the synthetic data generation framework, achieving satisfactory fidelity in both empirical and statistical attributes. The proposed data augmentation improved the R2 and MAE achieved by MLP and WaD up to 28.37% under data fractions ranging from 0.2 to 1. MLP and WaD yielded comparable results in terms of accuracy and generalization ability across various augmented fractions. This indicates that the accuracy of synthetic data plays a pivotal role, suggesting improving data quality can be highly effective in boosting performance, regardless of the model complexity. This study contributes valuable insights to addressing the challenges of scare high‐fidelity data in geoengineering.
在喷射灌浆注射应用中进行高保真数据扩增以实现少量学习
当代地球工程学面临的挑战是现有算法及其洞察深度都已趋于稳定,而高保真数据的稀缺加剧了这一现象。虽然蒙特卡洛法等现有解决方案可以生成大量数据,但它们在确保数据的高保真方面不够稳健。本研究提出了一种新颖的数据增强框架,该框架结合了统计和机器学习方法来生成高保真合成数据,这些数据在统计和经验属性方面与实地数据非常接近。该方法的创新之处在于整合了用于生成数据的 Copulas 理论、用于调整数据属性的地理回归异常检测(GRAD)以及用于数据一致性执行的进化多项式回归。多层感知器(MLP)和宽深度(WaD)网络被用于评估使用喷射灌浆数据进行高保真数据增强的有效性。结果显示了合成数据生成框架的稳健性,在经验和统计属性方面都达到了令人满意的保真度。在数据分数为 0.2 到 1 的情况下,所提出的数据增强方法将 MLP 和 WaD 的 R2 和 MAE 提高了 28.37%。MLP 和 WaD 在各种增强分数的准确性和泛化能力方面取得了不相上下的结果。这表明,合成数据的准确性起着至关重要的作用,无论模型的复杂程度如何,提高数据质量都能非常有效地提升性能。这项研究为解决地质工程中高保真数据的恐慌难题提供了宝贵的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.40
自引率
12.50%
发文量
160
审稿时长
9 months
期刊介绍: The journal welcomes manuscripts that substantially contribute to the understanding of the complex mechanical behaviour of geomaterials (soils, rocks, concrete, ice, snow, and powders), through innovative experimental techniques, and/or through the development of novel numerical or hybrid experimental/numerical modelling concepts in geomechanics. Topics of interest include instabilities and localization, interface and surface phenomena, fracture and failure, multi-physics and other time-dependent phenomena, micromechanics and multi-scale methods, and inverse analysis and stochastic methods. Papers related to energy and environmental issues are particularly welcome. The illustration of the proposed methods and techniques to engineering problems is encouraged. However, manuscripts dealing with applications of existing methods, or proposing incremental improvements to existing methods – in particular marginal extensions of existing analytical solutions or numerical methods – will not be considered for review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信