Machine learning assisted estimation of total solids content of drilling fluids

IF 4.2

Artificial Intelligence in Geosciences Pub Date : 2025-07-05 DOI:10.1016/j.aiig.2025.100138

B.T. Gunel , Y.D. Pak , A.Ö. Herekeli , S. Gül , B. Kulga , E. Artun

{"title":"Machine learning assisted estimation of total solids content of drilling fluids","authors":"B.T. Gunel , Y.D. Pak , A.Ö. Herekeli , S. Gül , B. Kulga , E. Artun","doi":"10.1016/j.aiig.2025.100138","DOIUrl":null,"url":null,"abstract":"<div><div>Characterization and optimization of physical and chemical properties of drilling fluids are critical for the efficiency and success of drilling operations. In particular, maintaining the optimal levels of solids content is essential for achieving the most effective fluid performance. Proper management of solids content also reduces the risk of tool failures. Traditional solids content analysis methods, such as retort analysis, require substantial human intervention and time, which can lead to inaccuracies, time-management issues, and increased operational risks. In contrast to human-intensive methods, machine learning may offer a viable alternative for solids content estimation due to its pattern-recognition capability. In this study, a large set of laboratory reports of drilling-fluid analyses from 130 oil wells around the world were compiled to construct a comprehensive data set. The relationships among various rheological parameters were analyzed using statistical methods and machine learning algorithms. Several machine learning algorithms of diverse classes, namely linear (linear regression, ridge regression, and ElasticNet regression), kernel-based (support vector machine) and ensemble tree-based (gradient boosting, XGBoost, and random forests) algorithms, were trained and tuned to estimate solids content from other readily available drilling fluid properties. Input variables were kept consistent across all models for interpretation and comparison purposes. In the final stage, different evaluation metrics were employed to evaluate and compare the performance of different classes of machine learning models. Among all algorithms tested, random forests algorithm was found to be the best predictive model resulting in consistently high accuracy. Further optimization of the random forests model resulted in a mean absolute percentage error (MAPE) of 3.9% and 9.6% and R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 and 0.93 for the training and testing sets, respectively. Analysis of residuals, their histograms and Q-Q normality plots showed Gaussian distributions with residuals that are scattered around a mean of zero within error ranges of <span><math><mo>±</mo></math></span>1% and <span><math><mo>±</mo></math></span>4%, for training and testing, respectively. The selected model was further validated by applying the rheological measurements from mud samples taken from an offshore well from the Gulf of Mexico. The model was able to estimate total solids content in those four mud samples with an average absolute error of 1.08% of total solids content. The model was then used to develop a web-based graphical-user-interface (GUI) application, which can be practically used at the rig site by engineers to optimize drilling fluid programs. The proposed model can complement automation workflows that are designed to measure fundamental rheological properties in real time during drilling operations. While a standard retort test can take approximately 2 h at the rig site, such kind of real-time estimations can help the rig personnel to timely optimize drilling fluids, with a potential of saving 2920 man-hours in a given year for a single drilling rig.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100138"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544125000346","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Characterization and optimization of physical and chemical properties of drilling fluids are critical for the efficiency and success of drilling operations. In particular, maintaining the optimal levels of solids content is essential for achieving the most effective fluid performance. Proper management of solids content also reduces the risk of tool failures. Traditional solids content analysis methods, such as retort analysis, require substantial human intervention and time, which can lead to inaccuracies, time-management issues, and increased operational risks. In contrast to human-intensive methods, machine learning may offer a viable alternative for solids content estimation due to its pattern-recognition capability. In this study, a large set of laboratory reports of drilling-fluid analyses from 130 oil wells around the world were compiled to construct a comprehensive data set. The relationships among various rheological parameters were analyzed using statistical methods and machine learning algorithms. Several machine learning algorithms of diverse classes, namely linear (linear regression, ridge regression, and ElasticNet regression), kernel-based (support vector machine) and ensemble tree-based (gradient boosting, XGBoost, and random forests) algorithms, were trained and tuned to estimate solids content from other readily available drilling fluid properties. Input variables were kept consistent across all models for interpretation and comparison purposes. In the final stage, different evaluation metrics were employed to evaluate and compare the performance of different classes of machine learning models. Among all algorithms tested, random forests algorithm was found to be the best predictive model resulting in consistently high accuracy. Further optimization of the random forests model resulted in a mean absolute percentage error (MAPE) of 3.9% and 9.6% and R

^{2}

of 0.99 and 0.93 for the training and testing sets, respectively. Analysis of residuals, their histograms and Q-Q normality plots showed Gaussian distributions with residuals that are scattered around a mean of zero within error ranges of

\pm

1% and

\pm

4%, for training and testing, respectively. The selected model was further validated by applying the rheological measurements from mud samples taken from an offshore well from the Gulf of Mexico. The model was able to estimate total solids content in those four mud samples with an average absolute error of 1.08% of total solids content. The model was then used to develop a web-based graphical-user-interface (GUI) application, which can be practically used at the rig site by engineers to optimize drilling fluid programs. The proposed model can complement automation workflows that are designed to measure fundamental rheological properties in real time during drilling operations. While a standard retort test can take approximately 2 h at the rig site, such kind of real-time estimations can help the rig personnel to timely optimize drilling fluids, with a potential of saving 2920 man-hours in a given year for a single drilling rig.

查看原文本刊更多论文

机器学习辅助估计钻井液的总固体含量

钻井液物理化学性质的表征和优化对钻井作业的效率和成功至关重要。特别是，保持最佳固体含量水平对于实现最有效的流体性能至关重要。适当的固体含量管理也降低了工具故障的风险。传统的固体含量分析方法，如蒸馏器分析，需要大量的人工干预和时间，这可能导致不准确，时间管理问题，并增加操作风险。与人工密集型方法相比，机器学习由于其模式识别能力，可以为固体含量估计提供可行的替代方案。在这项研究中，收集了大量来自世界各地130口油井的钻井液分析实验室报告，构建了一个全面的数据集。利用统计方法和机器学习算法分析了各流变参数之间的关系。几种不同类型的机器学习算法，即线性（线性回归、脊回归和ElasticNet回归）、基于核（支持向量机）和基于集成树（梯度增强、XGBoost和随机森林）算法，经过训练和调整，可以从其他可用的钻井液性质中估计固体含量。为了解释和比较的目的，所有模型的输入变量保持一致。在最后阶段，采用不同的评估指标来评估和比较不同类别的机器学习模型的性能。在所有被测试的算法中，随机森林算法是最好的预测模型，具有较高的准确率。进一步优化随机森林模型，训练集和测试集的平均绝对百分比误差（MAPE）分别为3.9%和9.6%，R2分别为0.99和0.93。残差分析，其直方图和Q-Q正态图显示高斯分布，残差分散在平均值零附近，误差范围分别为±1%和±4%，用于训练和测试。通过对墨西哥湾海上油井的泥浆样品进行流变测量，进一步验证了所选模型的有效性。该模型能够估计出这4种泥浆样品中的总固体含量，平均绝对误差为总固体含量的1.08%。然后，该模型被用于开发基于web的图形用户界面（GUI）应用程序，工程师可以在钻井现场实际使用该应用程序来优化钻井液方案。所提出的模型可以补充自动化工作流程，旨在实时测量钻井作业中的基本流变性能。虽然标准的油罐测试在钻井现场大约需要2小时，但这种实时评估可以帮助钻井人员及时优化钻井液，单台钻机每年可节省2920个工时。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence in Geosciences

CiteScore

4.20

自引率

0.00%

发文量