Use Metaheuristics to Improve the Quality of Drilling Real-Time Data for Advance Artificial Intelligent and Machine Learning Modeling. Case Study: Cleanse Hook-Load Real-Time Data

S. Gharbi, Moataz A. Ahmed, S. Elkatatny
{"title":"Use Metaheuristics to Improve the Quality of Drilling Real-Time Data for Advance Artificial Intelligent and Machine Learning Modeling. Case Study: Cleanse Hook-Load Real-Time Data","authors":"S. Gharbi, Moataz A. Ahmed, S. Elkatatny","doi":"10.2118/192810-MS","DOIUrl":null,"url":null,"abstract":"\n The drilling engineers are overmild with huge amount of data-points, argue the need to develop Artificial Intelligent (AI) and Machine Learning (ML) models to crunch these huge amount of data generating decision-like information. There are a lot of challenges developing such approach, varying from computational power, lack of subject matter experts, and develop the optimum algorithm. But the main bottleneck is the quality of the data. Regardless of how advance AI/ML model, if the data is bad, the model will generate bad result; garbage-in garbage-out. The scope of this paper is to use metaheuristics models to improve the data quality. The process start by extracting Hook-Load drilling real-time data. And explore the raw data quality using visualization and statistical methods. Then apply several Metaheuristics models to generate functional approximation equation that identify/ follow the trend of the good-quality data. This will be by employ multiple scenarios with different degree of randomness that lead to the highest matching which generate the high quality level. The process will cover different technique including Greedy, Hill-Climbing, Random Search, and Simulated Annealing. During this process hundreds of thousands of scenarios will be conducted to simulate the Hook-Load data, to identify the optimum functional approximation equation that match the best data quality. Which can then safely integrated into the advance Artificial Intelligent and Machine Learning models. Running such process require an expensive computational cost, since it includes huge amount of real-time data need to be process under complex advance models. Moreover it require a deep understanding of the internal process of each models to ensure finest manipulating them to get the optimum data quality result. Running these scenarios, lead successfully to functional approximation that spill the data behavior, with Mean Absolute Error (MAE) equal to 10.5. It is worth height that functional approximation is very expensive in term of time and complexity, but it generate the highest quality result, leading to better AI/ML model. Moreover it is the most dynamic approach allowing it to be applied in other drilling real-time parameters as well. Utilizing Metaheuristics approach to improve the data quality is new to the upstream domain in general, with almost no application in drilling in specific. The novelty is to introduce this advance technique into the drilling real-time data domain, it will sharply improve the data quality leading to higher Artificial Intelligent and Machine Learning prediction/ analytical models. It worth mentioning that such approach will run all those simulation/ scenarios and adjust itself automatically with almost no manual interference. Leading to self-data-driven data-quality model.","PeriodicalId":11208,"journal":{"name":"Day 2 Tue, November 13, 2018","volume":"558 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2018-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Tue, November 13, 2018","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/192810-MS","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The drilling engineers are overmild with huge amount of data-points, argue the need to develop Artificial Intelligent (AI) and Machine Learning (ML) models to crunch these huge amount of data generating decision-like information. There are a lot of challenges developing such approach, varying from computational power, lack of subject matter experts, and develop the optimum algorithm. But the main bottleneck is the quality of the data. Regardless of how advance AI/ML model, if the data is bad, the model will generate bad result; garbage-in garbage-out. The scope of this paper is to use metaheuristics models to improve the data quality. The process start by extracting Hook-Load drilling real-time data. And explore the raw data quality using visualization and statistical methods. Then apply several Metaheuristics models to generate functional approximation equation that identify/ follow the trend of the good-quality data. This will be by employ multiple scenarios with different degree of randomness that lead to the highest matching which generate the high quality level. The process will cover different technique including Greedy, Hill-Climbing, Random Search, and Simulated Annealing. During this process hundreds of thousands of scenarios will be conducted to simulate the Hook-Load data, to identify the optimum functional approximation equation that match the best data quality. Which can then safely integrated into the advance Artificial Intelligent and Machine Learning models. Running such process require an expensive computational cost, since it includes huge amount of real-time data need to be process under complex advance models. Moreover it require a deep understanding of the internal process of each models to ensure finest manipulating them to get the optimum data quality result. Running these scenarios, lead successfully to functional approximation that spill the data behavior, with Mean Absolute Error (MAE) equal to 10.5. It is worth height that functional approximation is very expensive in term of time and complexity, but it generate the highest quality result, leading to better AI/ML model. Moreover it is the most dynamic approach allowing it to be applied in other drilling real-time parameters as well. Utilizing Metaheuristics approach to improve the data quality is new to the upstream domain in general, with almost no application in drilling in specific. The novelty is to introduce this advance technique into the drilling real-time data domain, it will sharply improve the data quality leading to higher Artificial Intelligent and Machine Learning prediction/ analytical models. It worth mentioning that such approach will run all those simulation/ scenarios and adjust itself automatically with almost no manual interference. Leading to self-data-driven data-quality model.
利用元启发式方法提高钻井实时数据质量,促进人工智能和机器学习建模。案例研究:清理钩子负载实时数据
钻井工程师对大量数据点过于温和,认为有必要开发人工智能(AI)和机器学习(ML)模型来处理这些大量数据,生成类似决策的信息。开发这样的方法有很多挑战,从计算能力、缺乏主题专家到开发最佳算法。但主要的瓶颈是数据的质量。不管AI/ML模型有多先进,如果数据不好,模型就会产生不好的结果;垃圾输出。本文的范围是使用元启发式模型来提高数据质量。首先提取Hook-Load钻井实时数据。并利用可视化和统计学方法探讨原始数据的质量。然后应用几个元启发式模型生成识别/跟踪高质量数据趋势的函数逼近方程。这将通过使用具有不同随机性程度的多个场景来实现最高匹配,从而生成高质量的关卡。该过程将涵盖不同的技术,包括贪婪,爬坡,随机搜索和模拟退火。在此过程中,将进行数十万个场景来模拟Hook-Load数据,以确定与最佳数据质量匹配的最佳函数近似方程。然后可以安全地集成到先进的人工智能和机器学习模型中。运行这样的进程需要昂贵的计算成本,因为它包含大量的实时数据,需要在复杂的先进模型下进行处理。此外,它需要深入了解每个模型的内部过程,以确保最好地操作它们以获得最佳的数据质量结果。运行这些场景,可以成功地得到泄漏数据行为的函数近似,平均绝对误差(MAE)等于10.5。值得注意的是,函数近似在时间和复杂性方面是非常昂贵的,但它会产生最高质量的结果,从而产生更好的AI/ML模型。此外,它是最动态的方法,可以应用于其他钻井实时参数。一般来说,利用元启发式方法来提高数据质量对上游领域来说是一种新的方法,在钻井领域几乎没有具体的应用。新颖之处在于,将这种先进技术引入钻井实时数据领域,将大大提高数据质量,从而实现更高的人工智能和机器学习预测/分析模型。值得一提的是,这种方法将运行所有这些模拟/场景,并自动调整自己,几乎没有人为干扰。导致自数据驱动的数据质量模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信