{"title":"A novel approach to modeling breakdown pressure dynamics using machine learning","authors":"Subhan Aliyev, Talal Al Shafloot, Murtada Saleh Aljawad, Abdulazeez Abdulraheem, Salaheldin Elkatatny","doi":"10.1016/j.petlm.2025.07.008","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents a novel machine learning (ML)-based approach for predicting breakdown pressure (BP) in hydraulic fracturing using experimental data. Unlike traditional analytical models that rely on simplified assumptions, ML models can capture complex nonlinear relationships between BP and its influencing factors. However, a key limitation in BP prediction stems from dataset constraints, particularly the scale differences between experimental setups and real-world formations. To mitigate these limitations, this research utilizes a unique dataset of 144 BP data points, incorporating various rock mechanical properties, injection parameters, and fluid properties. Additionally, a separate analysis of pressurization rate, based on 32 additional experimental data points, was conducted to better understand its effect on fracture initiation—an aspect often overlooked in ML-based studies. The dataset includes critical parameters such as injection rate, confining pressure, tensile strength, Young's modulus, permeability, unconfined compressive strength, Poisson's ratio, porosity, wellbore radius, and fracture geometry ratio. Five ML models—LightGBM, CatBoost, XGBoost, Kolmogorov-Arnold Network (KAN), and TabNet—were trained and evaluated. TabNet achieved the highest predictive performance (<em>R</em><sup>2</sup> = 0.94) due to its attention-based feature selection and deep-learning-based representation learning. Model performance was assessed using mean absolute error (MAE) and mean squared error (MSE) to ensure robustness. To further enhance model interpretability, SHapley Additive exPlanations (SHAP) and TabNet's attention mechanism were used to explicitly assess feature importance, providing insights into the relative influence of different parameters on BP predictions. Additionally, advanced feature-handling techniques were employed to address categorical variables automatically, ensuring minimal preprocessing bias. The findings demonstrate the scalability of ML models for BP prediction using experimental data, reducing reliance on costly and time-consuming laboratory testing. By incorporating advanced interpretability techniques, systematic pressurization rate analysis, and robust ML architectures, this research provides a more accurate, data-driven approach for optimizing hydraulic fracturing designs.</div></div>","PeriodicalId":37433,"journal":{"name":"Petroleum","volume":"11 4","pages":"Pages 516-532"},"PeriodicalIF":3.5000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Petroleum","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405656125000586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
This study presents a novel machine learning (ML)-based approach for predicting breakdown pressure (BP) in hydraulic fracturing using experimental data. Unlike traditional analytical models that rely on simplified assumptions, ML models can capture complex nonlinear relationships between BP and its influencing factors. However, a key limitation in BP prediction stems from dataset constraints, particularly the scale differences between experimental setups and real-world formations. To mitigate these limitations, this research utilizes a unique dataset of 144 BP data points, incorporating various rock mechanical properties, injection parameters, and fluid properties. Additionally, a separate analysis of pressurization rate, based on 32 additional experimental data points, was conducted to better understand its effect on fracture initiation—an aspect often overlooked in ML-based studies. The dataset includes critical parameters such as injection rate, confining pressure, tensile strength, Young's modulus, permeability, unconfined compressive strength, Poisson's ratio, porosity, wellbore radius, and fracture geometry ratio. Five ML models—LightGBM, CatBoost, XGBoost, Kolmogorov-Arnold Network (KAN), and TabNet—were trained and evaluated. TabNet achieved the highest predictive performance (R2 = 0.94) due to its attention-based feature selection and deep-learning-based representation learning. Model performance was assessed using mean absolute error (MAE) and mean squared error (MSE) to ensure robustness. To further enhance model interpretability, SHapley Additive exPlanations (SHAP) and TabNet's attention mechanism were used to explicitly assess feature importance, providing insights into the relative influence of different parameters on BP predictions. Additionally, advanced feature-handling techniques were employed to address categorical variables automatically, ensuring minimal preprocessing bias. The findings demonstrate the scalability of ML models for BP prediction using experimental data, reducing reliance on costly and time-consuming laboratory testing. By incorporating advanced interpretability techniques, systematic pressurization rate analysis, and robust ML architectures, this research provides a more accurate, data-driven approach for optimizing hydraulic fracturing designs.
期刊介绍:
Examples of appropriate topical areas that will be considered include the following: 1.comprehensive research on oil and gas reservoir (reservoir geology): -geological basis of oil and gas reservoirs -reservoir geochemistry -reservoir formation mechanism -reservoir identification methods and techniques 2.kinetics of oil and gas basins and analyses of potential oil and gas resources: -fine description factors of hydrocarbon accumulation -mechanism analysis on recovery and dynamic accumulation process -relationship between accumulation factors and the accumulation process -analysis of oil and gas potential resource 3.theories and methods for complex reservoir geophysical prospecting: -geophysical basis of deep geologic structures and background of hydrocarbon occurrence -geophysical prediction of deep and complex reservoirs -physical test analyses and numerical simulations of reservoir rocks -anisotropic medium seismic imaging theory and new technology for multiwave seismic exploration -o theories and methods for reservoir fluid geophysical identification and prediction 4.theories, methods, technology, and design for complex reservoir development: -reservoir percolation theory and application technology -field development theories and methods -theory and technology for enhancing recovery efficiency 5.working liquid for oil and gas wells and reservoir protection technology: -working chemicals and mechanics for oil and gas wells -reservoir protection technology 6.new techniques and technologies for oil and gas drilling and production: -under-balanced drilling/gas drilling -special-track well drilling -cementing and completion of oil and gas wells -engineering safety applications for oil and gas wells -new technology of fracture acidizing