Pierre Guy Atangana Njock, Zhen‐Yu Yin, Ning Zhang
{"title":"在喷射灌浆注射应用中进行高保真数据扩增以实现少量学习","authors":"Pierre Guy Atangana Njock, Zhen‐Yu Yin, Ning Zhang","doi":"10.1002/nag.3862","DOIUrl":null,"url":null,"abstract":"Contemporary geoengineering challenges grapple with the plateauing of both existing algorithms and their depth of insights, a phenomenon exacerbated by the scarcity of high‐fidelity data. Although existing solutions such as Monte‐Carlo method can generate abundant data, they are not sufficiently robust for ensuring the high fidelity of data. This study proposes a novel data augmentation framework that combines statistical and machine learning methods to generate high‐fidelity synthetic data, which closely align with field data in terms of the statistical and empirical attributes. The innovations of the proposed approach lie in the integration of Copulas theory for data generation, a developed geo‐regression anomaly detection (GRAD) for adjusting data attributes, and an evolutionary polynomial regression for data consistency enforcement. The multilayer perceptron (MLP) and a wide‐and‐deep (WaD) networks are applied to assess the effectiveness of high‐fidelity data augmentation using jet grouting data. The outcomes reveal the robustness of the synthetic data generation framework, achieving satisfactory fidelity in both empirical and statistical attributes. The proposed data augmentation improved the <jats:italic>R<jats:sup>2</jats:sup></jats:italic> and MAE achieved by MLP and WaD up to 28.37% under data fractions ranging from 0.2 to 1. MLP and WaD yielded comparable results in terms of accuracy and generalization ability across various augmented fractions. This indicates that the accuracy of synthetic data plays a pivotal role, suggesting improving data quality can be highly effective in boosting performance, regardless of the model complexity. This study contributes valuable insights to addressing the challenges of scare high‐fidelity data in geoengineering.","PeriodicalId":13786,"journal":{"name":"International Journal for Numerical and Analytical Methods in Geomechanics","volume":"123 1","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High‐Fidelity Data Augmentation for Few‐Shot Learning in Jet Grout Injection Applications\",\"authors\":\"Pierre Guy Atangana Njock, Zhen‐Yu Yin, Ning Zhang\",\"doi\":\"10.1002/nag.3862\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Contemporary geoengineering challenges grapple with the plateauing of both existing algorithms and their depth of insights, a phenomenon exacerbated by the scarcity of high‐fidelity data. Although existing solutions such as Monte‐Carlo method can generate abundant data, they are not sufficiently robust for ensuring the high fidelity of data. This study proposes a novel data augmentation framework that combines statistical and machine learning methods to generate high‐fidelity synthetic data, which closely align with field data in terms of the statistical and empirical attributes. The innovations of the proposed approach lie in the integration of Copulas theory for data generation, a developed geo‐regression anomaly detection (GRAD) for adjusting data attributes, and an evolutionary polynomial regression for data consistency enforcement. The multilayer perceptron (MLP) and a wide‐and‐deep (WaD) networks are applied to assess the effectiveness of high‐fidelity data augmentation using jet grouting data. The outcomes reveal the robustness of the synthetic data generation framework, achieving satisfactory fidelity in both empirical and statistical attributes. The proposed data augmentation improved the <jats:italic>R<jats:sup>2</jats:sup></jats:italic> and MAE achieved by MLP and WaD up to 28.37% under data fractions ranging from 0.2 to 1. MLP and WaD yielded comparable results in terms of accuracy and generalization ability across various augmented fractions. This indicates that the accuracy of synthetic data plays a pivotal role, suggesting improving data quality can be highly effective in boosting performance, regardless of the model complexity. This study contributes valuable insights to addressing the challenges of scare high‐fidelity data in geoengineering.\",\"PeriodicalId\":13786,\"journal\":{\"name\":\"International Journal for Numerical and Analytical Methods in Geomechanics\",\"volume\":\"123 1\",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal for Numerical and Analytical Methods in Geomechanics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1002/nag.3862\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, GEOLOGICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal for Numerical and Analytical Methods in Geomechanics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1002/nag.3862","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, GEOLOGICAL","Score":null,"Total":0}
High‐Fidelity Data Augmentation for Few‐Shot Learning in Jet Grout Injection Applications
Contemporary geoengineering challenges grapple with the plateauing of both existing algorithms and their depth of insights, a phenomenon exacerbated by the scarcity of high‐fidelity data. Although existing solutions such as Monte‐Carlo method can generate abundant data, they are not sufficiently robust for ensuring the high fidelity of data. This study proposes a novel data augmentation framework that combines statistical and machine learning methods to generate high‐fidelity synthetic data, which closely align with field data in terms of the statistical and empirical attributes. The innovations of the proposed approach lie in the integration of Copulas theory for data generation, a developed geo‐regression anomaly detection (GRAD) for adjusting data attributes, and an evolutionary polynomial regression for data consistency enforcement. The multilayer perceptron (MLP) and a wide‐and‐deep (WaD) networks are applied to assess the effectiveness of high‐fidelity data augmentation using jet grouting data. The outcomes reveal the robustness of the synthetic data generation framework, achieving satisfactory fidelity in both empirical and statistical attributes. The proposed data augmentation improved the R2 and MAE achieved by MLP and WaD up to 28.37% under data fractions ranging from 0.2 to 1. MLP and WaD yielded comparable results in terms of accuracy and generalization ability across various augmented fractions. This indicates that the accuracy of synthetic data plays a pivotal role, suggesting improving data quality can be highly effective in boosting performance, regardless of the model complexity. This study contributes valuable insights to addressing the challenges of scare high‐fidelity data in geoengineering.
期刊介绍:
The journal welcomes manuscripts that substantially contribute to the understanding of the complex mechanical behaviour of geomaterials (soils, rocks, concrete, ice, snow, and powders), through innovative experimental techniques, and/or through the development of novel numerical or hybrid experimental/numerical modelling concepts in geomechanics. Topics of interest include instabilities and localization, interface and surface phenomena, fracture and failure, multi-physics and other time-dependent phenomena, micromechanics and multi-scale methods, and inverse analysis and stochastic methods. Papers related to energy and environmental issues are particularly welcome. The illustration of the proposed methods and techniques to engineering problems is encouraged. However, manuscripts dealing with applications of existing methods, or proposing incremental improvements to existing methods – in particular marginal extensions of existing analytical solutions or numerical methods – will not be considered for review.