Shadfar Davoodi, Mohammad Mehrad, David A. Wood, Mohammed Al-Shargabi, Grachik Eremyan, Tamara Shulgina
{"title":"A novel data-driven model for real-time prediction of static Young's modulus applying mud-logging data","authors":"Shadfar Davoodi, Mohammad Mehrad, David A. Wood, Mohammed Al-Shargabi, Grachik Eremyan, Tamara Shulgina","doi":"10.1007/s12145-024-01474-5","DOIUrl":null,"url":null,"abstract":"<p>Effective drilling planning relies on understanding the rock mechanical properties, typically estimated from petrophysical data. Real-time estimation of these properties, especially static Young's modulus (<span>\\({E}_{sta}\\)</span>), is crucial for geomechanical modeling, wellbore stability, and cost-effective decision-making. In this study, predictive models of <span>\\({E}_{sta}\\)</span> were developed using mudlogging data from two vertically drilled wells (A and B) in the same field. <span>\\({E}_{sta}\\)</span> was estimated from petrophysical data across the studied depth range in both wells using a field-specific equation. Outlier data were identified and removed by evaluating the cross plot of mechanical specific energy and drilling rate for Well A. The data from Well A were then randomly divided into training and testing sets. The algorithms, multi-layer perceptron neural networks, random forests, Gaussian process regression (GPR), and support vector regression, were adjusted and applied to the training data. The resulting models were evaluated on the test data. The GPR model demonstrated the lowest RMSE values in both the training (0.0075 GPa) and testing (0.4577 GPa) phases, indicating superior performance. To further assess the models, the overfitting index and scoring techniques were employed, revealing that the GPR model exhibited the lowest overfitting value and outperformed the other models. Consequently, the GPR model was selected as the best-performing model and was analyzed using Shapley additive explanation to evaluate the influence of each input feature on the output. This analysis indicated that depth had the greatest effect, while rotation speed had the least impact on the model's output. The application of the GPR model to predict <span>\\({E}_{sta}\\)</span> in Well B demonstrated its high generalization capability. Therefore, it can be confidently stated that with additional data, this model could be effectively applied to similar depth ranges in other wells within the field. The study introduces innovations by applying GPR to predict <span>\\({E}_{sta}\\)</span> from mudlogging data, addressing outlier impact on predictions, and developing a real-time <span>\\({E}_{sta}\\)</span> prediction model for drilling.</p>","PeriodicalId":49318,"journal":{"name":"Earth Science Informatics","volume":"161 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Science Informatics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12145-024-01474-5","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Effective drilling planning relies on understanding the rock mechanical properties, typically estimated from petrophysical data. Real-time estimation of these properties, especially static Young's modulus (\({E}_{sta}\)), is crucial for geomechanical modeling, wellbore stability, and cost-effective decision-making. In this study, predictive models of \({E}_{sta}\) were developed using mudlogging data from two vertically drilled wells (A and B) in the same field. \({E}_{sta}\) was estimated from petrophysical data across the studied depth range in both wells using a field-specific equation. Outlier data were identified and removed by evaluating the cross plot of mechanical specific energy and drilling rate for Well A. The data from Well A were then randomly divided into training and testing sets. The algorithms, multi-layer perceptron neural networks, random forests, Gaussian process regression (GPR), and support vector regression, were adjusted and applied to the training data. The resulting models were evaluated on the test data. The GPR model demonstrated the lowest RMSE values in both the training (0.0075 GPa) and testing (0.4577 GPa) phases, indicating superior performance. To further assess the models, the overfitting index and scoring techniques were employed, revealing that the GPR model exhibited the lowest overfitting value and outperformed the other models. Consequently, the GPR model was selected as the best-performing model and was analyzed using Shapley additive explanation to evaluate the influence of each input feature on the output. This analysis indicated that depth had the greatest effect, while rotation speed had the least impact on the model's output. The application of the GPR model to predict \({E}_{sta}\) in Well B demonstrated its high generalization capability. Therefore, it can be confidently stated that with additional data, this model could be effectively applied to similar depth ranges in other wells within the field. The study introduces innovations by applying GPR to predict \({E}_{sta}\) from mudlogging data, addressing outlier impact on predictions, and developing a real-time \({E}_{sta}\) prediction model for drilling.
期刊介绍:
The Earth Science Informatics [ESIN] journal aims at rapid publication of high-quality, current, cutting-edge, and provocative scientific work in the area of Earth Science Informatics as it relates to Earth systems science and space science. This includes articles on the application of formal and computational methods, computational Earth science, spatial and temporal analyses, and all aspects of computer applications to the acquisition, storage, processing, interchange, and visualization of data and information about the materials, properties, processes, features, and phenomena that occur at all scales and locations in the Earth system’s five components (atmosphere, hydrosphere, geosphere, biosphere, cryosphere) and in space (see "About this journal" for more detail). The quarterly journal publishes research, methodology, and software articles, as well as editorials, comments, and book and software reviews. Review articles of relevant findings, topics, and methodologies are also considered.