Cuttings Accumulations Prediction in Deviated & Horizontal Wells with Dimensionless Data-Driven Models

M. Khaled, M. Khan, A. Barooah, Mohammad Azizur Rahman, A. R. Hasan
{"title":"Cuttings Accumulations Prediction in Deviated & Horizontal Wells with Dimensionless Data-Driven Models","authors":"M. Khaled, M. Khan, A. Barooah, Mohammad Azizur Rahman, A. R. Hasan","doi":"10.2523/iptc-24001-ms","DOIUrl":null,"url":null,"abstract":"\n Effective cuttings removal in deviated and horizontal wells is essential for improving drilling efficiency and preventing non-productive time (NPT) caused by hole-cleaning issues. While various numerical models have been developed to simulate cuttings accumulation in wellbores, only a subset of these models can be employed for real-time operations due to their complexity and lengthy computational requirements. This paper compares the performance of various data-driven (machine learning) models in monitoring cuttings bed accumulation in real-time during drilling operations.\n The construction of these data-driven models relies on the analysis of hundreds of bed height measurements obtained from ten flow loops. These models incorporate unique dimensionless parameters and are trained on a diverse dataset encompassing a wide range of drilling conditions. These conditions include variables such as the rate of penetration (ROP), drilling flow rate, drillstring rotation, hole eccentricity, wellbore hydraulic diameter and inclination, drilling fluid rheological parameters, and cuttings (solid) density and size. Five different data-driven models are evaluated: linear regressor (LR), deep neural networks (DNN), support vector regressor (SVR), random forests (RF), and extreme gradient boosting regressor (XGBoost) algorithms. Additionally, the performance of the developed models is assessed against previously unseen datasets to ensure fair evaluation. Comparisons are also made with the Duan correlation (a mechanistic model) to evaluate the accuracy and limitations of the data-driven models.\n A total of ten dimensionless parameters are devised to estimate bed height accumulation using the Buckingham-Π theorem and Pearson correlation. The results indicate that both the RF and XGBoost models exhibit accurate estimations of bed thickness, achieving root mean square error (RMSE) and mean absolute percentage error (MAPE) values around 0.07 and 13%, respectively. Furthermore, these two models demonstrate strong generalization capabilities and precision in estimating bed thickness, with a MAPE below 20% when validated against unseen datasets and compared to the Duan model. In contrast, the DNN model is observed to be less accurate than the RF and XGBoost models, though a majority of its predicted points still fall within the ±20% tolerance envelope. On the other hand, both the SVR and LR models exhibit poor accuracy in capturing the underlying relationship between input parameters and the target variable, as evidenced by their scattered residual values. Utilizing the Shapley additive explanations (SHAP) approach and RF feature analysis, the study identifies the Froude number as having high feature importance while negatively impacting bed height predictions. Conversely, the inlet feed concentration and annular eccentricity significantly positively contribute to bed height prediction.\n In conclusion, the data-driven (machine learning) models developed in this study offer a reliable means of real-time prediction for cuttings bed thickness during drilling operations. By eliminating the need for complex numerical models with extended computational times, these models empower proactive decision-making, thus enhancing drilling efficiency and minimizing NPT resulting from inadequate hole cleaning.","PeriodicalId":519056,"journal":{"name":"Day 1 Mon, February 12, 2024","volume":"231 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 1 Mon, February 12, 2024","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2523/iptc-24001-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Effective cuttings removal in deviated and horizontal wells is essential for improving drilling efficiency and preventing non-productive time (NPT) caused by hole-cleaning issues. While various numerical models have been developed to simulate cuttings accumulation in wellbores, only a subset of these models can be employed for real-time operations due to their complexity and lengthy computational requirements. This paper compares the performance of various data-driven (machine learning) models in monitoring cuttings bed accumulation in real-time during drilling operations. The construction of these data-driven models relies on the analysis of hundreds of bed height measurements obtained from ten flow loops. These models incorporate unique dimensionless parameters and are trained on a diverse dataset encompassing a wide range of drilling conditions. These conditions include variables such as the rate of penetration (ROP), drilling flow rate, drillstring rotation, hole eccentricity, wellbore hydraulic diameter and inclination, drilling fluid rheological parameters, and cuttings (solid) density and size. Five different data-driven models are evaluated: linear regressor (LR), deep neural networks (DNN), support vector regressor (SVR), random forests (RF), and extreme gradient boosting regressor (XGBoost) algorithms. Additionally, the performance of the developed models is assessed against previously unseen datasets to ensure fair evaluation. Comparisons are also made with the Duan correlation (a mechanistic model) to evaluate the accuracy and limitations of the data-driven models. A total of ten dimensionless parameters are devised to estimate bed height accumulation using the Buckingham-Π theorem and Pearson correlation. The results indicate that both the RF and XGBoost models exhibit accurate estimations of bed thickness, achieving root mean square error (RMSE) and mean absolute percentage error (MAPE) values around 0.07 and 13%, respectively. Furthermore, these two models demonstrate strong generalization capabilities and precision in estimating bed thickness, with a MAPE below 20% when validated against unseen datasets and compared to the Duan model. In contrast, the DNN model is observed to be less accurate than the RF and XGBoost models, though a majority of its predicted points still fall within the ±20% tolerance envelope. On the other hand, both the SVR and LR models exhibit poor accuracy in capturing the underlying relationship between input parameters and the target variable, as evidenced by their scattered residual values. Utilizing the Shapley additive explanations (SHAP) approach and RF feature analysis, the study identifies the Froude number as having high feature importance while negatively impacting bed height predictions. Conversely, the inlet feed concentration and annular eccentricity significantly positively contribute to bed height prediction. In conclusion, the data-driven (machine learning) models developed in this study offer a reliable means of real-time prediction for cuttings bed thickness during drilling operations. By eliminating the need for complex numerical models with extended computational times, these models empower proactive decision-making, thus enhancing drilling efficiency and minimizing NPT resulting from inadequate hole cleaning.
利用无量纲数据驱动模型预测偏差井和水平井中的岩屑堆积情况
在斜井和水平井中有效清除沉积物对于提高钻井效率和防止因清孔问题造成的非生产时间(NPT)至关重要。虽然已经开发了各种数值模型来模拟井筒中的沉积物,但由于其复杂性和冗长的计算要求,只有其中一部分模型可用于实时操作。本文比较了各种数据驱动(机器学习)模型在钻井作业期间实时监测岩屑床堆积的性能。这些数据驱动模型的构建依赖于对从十个流动回路中获得的数百个床面高度测量值的分析。这些模型包含独特的无量纲参数,并在包含各种钻井条件的不同数据集上进行训练。这些条件包括钻进速度(ROP)、钻井流速、钻杆旋转、钻孔偏心率、井筒水力直径和倾斜度、钻井液流变参数以及钻屑(固体)密度和尺寸等变量。评估了五种不同的数据驱动模型:线性回归(LR)、深度神经网络(DNN)、支持向量回归(SVR)、随机森林(RF)和极端梯度提升回归(XGBoost)算法。此外,为了确保评估的公平性,还针对以前未见过的数据集对所开发模型的性能进行了评估。还与 Duan 相关性(一种机理模型)进行了比较,以评估数据驱动模型的准确性和局限性。共设计了 10 个无量纲参数,利用白金汉-Π 定理和皮尔逊相关性估算床面高度累积。结果表明,RF 和 XGBoost 模型都能准确估算床厚,其均方根误差 (RMSE) 和平均绝对百分比误差 (MAPE) 值分别约为 0.07% 和 13%。此外,这两个模型在估算床厚方面表现出很强的泛化能力和精度,在对未见数据集进行验证并与 Duan 模型进行比较时,MAPE 值低于 20%。相比之下,DNN 模型的精确度要低于 RF 和 XGBoost 模型,尽管其预测的大部分点仍在±20% 的容差范围内。另一方面,SVR 和 LR 模型在捕捉输入参数与目标变量之间的潜在关系方面表现出了较差的准确性,这一点可以从它们分散的残差值看出。利用夏普利加法解释(SHAP)方法和射频特征分析,研究发现弗劳德数具有较高的特征重要性,同时对床层高度预测有负面影响。相反,入口进料浓度和环形偏心率则对床面高度预测有显著的积极作用。总之,本研究开发的数据驱动(机器学习)模型为钻井作业期间的岩屑床厚度实时预测提供了可靠的方法。这些模型不需要复杂的数值模型和较长的计算时间,能够帮助用户做出前瞻性决策,从而提高钻井效率,并最大限度地减少因孔清理不彻底而导致的NPT。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信