{"title":"Evolution of learning curve and white box machine learning models for estimating in-situ stresses based on velocity-stress relationship","authors":"Ayyaz Mustafa , Guanyi Lu , Andrew P. Bunger","doi":"10.1016/j.fuel.2025.136306","DOIUrl":null,"url":null,"abstract":"<div><div>Reliable estimation of in-situ stresses has recently been demonstrated using an integrated deep learning/machine learning (DL/ML)-based workflow requiring training and validation using laboratory-based true triaxial ultrasonic velocity (TUV) experimental data representing ultrasonic velocities at various stress combinations. However, it remains to clarify how many TUV experiments must be performed in order to effectively and efficiently train the model. This paper presents analysis of the learning curve of the model for different dataset sizes, thereby defining the smallest effective training dataset to develop reliable prediction models. The ML/DL models were developed using TUV data obtained using five different subsurface core samples from the Utah FORGE well 16B(78)–32. Velocities were measured for 93 stress combinations per sample. Initially, prediction performances of ML/DL models were compared using 20, 40, 60, 80 and 100 percent of the total dataset. Learning curve analysis demonstrated the improvement in prediction performance up to 80% of the dataset, indicating that a model with similar predictive capacity could have been developed with 20% fewer data points collected in the laboratory. Introducing a learning curve analysis early in a project can therefore lead to significant cost savings when applying a DL/ML approach to in-situ stress estimate based on velocity-stress relationships.</div></div>","PeriodicalId":325,"journal":{"name":"Fuel","volume":"404 ","pages":"Article 136306"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuel","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016236125020319","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Reliable estimation of in-situ stresses has recently been demonstrated using an integrated deep learning/machine learning (DL/ML)-based workflow requiring training and validation using laboratory-based true triaxial ultrasonic velocity (TUV) experimental data representing ultrasonic velocities at various stress combinations. However, it remains to clarify how many TUV experiments must be performed in order to effectively and efficiently train the model. This paper presents analysis of the learning curve of the model for different dataset sizes, thereby defining the smallest effective training dataset to develop reliable prediction models. The ML/DL models were developed using TUV data obtained using five different subsurface core samples from the Utah FORGE well 16B(78)–32. Velocities were measured for 93 stress combinations per sample. Initially, prediction performances of ML/DL models were compared using 20, 40, 60, 80 and 100 percent of the total dataset. Learning curve analysis demonstrated the improvement in prediction performance up to 80% of the dataset, indicating that a model with similar predictive capacity could have been developed with 20% fewer data points collected in the laboratory. Introducing a learning curve analysis early in a project can therefore lead to significant cost savings when applying a DL/ML approach to in-situ stress estimate based on velocity-stress relationships.
期刊介绍:
The exploration of energy sources remains a critical matter of study. For the past nine decades, fuel has consistently held the forefront in primary research efforts within the field of energy science. This area of investigation encompasses a wide range of subjects, with a particular emphasis on emerging concerns like environmental factors and pollution.