Event Recognition on Time Series Frac Data Using Machine Learning

A. Ramirez, J. Iriarte
{"title":"Event Recognition on Time Series Frac Data Using Machine Learning","authors":"A. Ramirez, J. Iriarte","doi":"10.2118/195317-MS","DOIUrl":null,"url":null,"abstract":"\n Hydraulic fracturing pumping data is recorded and mapped in the field at one-second intervals. The designation of the stage start and end time is very important because these boundaries govern summary calculations, such as pressure, rate, and concentrations. Manual selection of staging flags is often very time consuming and prone to inaccuracies due to the lack of uniform selection and interpretation methods across the industry. The purpose of this study is to demonstrate the automation process to identify accurate and consistent stage start and end times in a high-frequency treating plot using machine learning algorithms.\n This study is based on the analysis of metered high-frequency treatment data coupled with supervised machine learning algorithms. The pumping dataset includes treating pressure, slurry rate, and clean volume for 179 stages, for a total of 1,530,445 rows of data per variable. Sixty-six percent of the data were used to train the model, eight percent were used to validate the model, and the remaining twenty-six percent were used to test it. Subject matter expertise, taking into account user-defined start/end time flags, was used to train the algorithm.\n Pumping data behaves very differently than traditional time-series data such as weather, stock prices, or population growth. The features examined are not affected by time but by physical events, so the correlation or dependency between features can affect accurate pattern recognition. To allow the algorithm to run leaner, the dataset was pre-processed using loss functions, smoothing techniques, and the rate of change of the main data channels. To understand how data may impair the predictions and to evaluate different model performances, we tested two classification algorithms: logistic regression and support vector machine.\n Classification techniques were used to generate an accurate suggestion of where the pumping of a hydraulic fracturing stage starts and ends in a high-frequency treating plot. Results show that flag predictions have a training and validation accuracy of approximately 90 percent using logistic regression algorithms. The predicted flags were within 10 seconds of the manual selected flag. A limitation of this method is that it requires periodic retraining with new field data to improve the prediction robustness and to maintain high accuracy.\n Accurate start and end time selections make it not only viable to process large volumes of fracture treatment data but also reduce the time spent reviewing field data for quality control. Petroleum engineers need to continue their focus on optimizing their systems with the greatest possible efficiency. Leveraging common analytical methods combined with the large, structured datasets that are readily available provide impressive results without extensive programming knowledge.","PeriodicalId":425264,"journal":{"name":"Day 2 Wed, April 24, 2019","volume":"217 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Wed, April 24, 2019","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/195317-MS","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Hydraulic fracturing pumping data is recorded and mapped in the field at one-second intervals. The designation of the stage start and end time is very important because these boundaries govern summary calculations, such as pressure, rate, and concentrations. Manual selection of staging flags is often very time consuming and prone to inaccuracies due to the lack of uniform selection and interpretation methods across the industry. The purpose of this study is to demonstrate the automation process to identify accurate and consistent stage start and end times in a high-frequency treating plot using machine learning algorithms. This study is based on the analysis of metered high-frequency treatment data coupled with supervised machine learning algorithms. The pumping dataset includes treating pressure, slurry rate, and clean volume for 179 stages, for a total of 1,530,445 rows of data per variable. Sixty-six percent of the data were used to train the model, eight percent were used to validate the model, and the remaining twenty-six percent were used to test it. Subject matter expertise, taking into account user-defined start/end time flags, was used to train the algorithm. Pumping data behaves very differently than traditional time-series data such as weather, stock prices, or population growth. The features examined are not affected by time but by physical events, so the correlation or dependency between features can affect accurate pattern recognition. To allow the algorithm to run leaner, the dataset was pre-processed using loss functions, smoothing techniques, and the rate of change of the main data channels. To understand how data may impair the predictions and to evaluate different model performances, we tested two classification algorithms: logistic regression and support vector machine. Classification techniques were used to generate an accurate suggestion of where the pumping of a hydraulic fracturing stage starts and ends in a high-frequency treating plot. Results show that flag predictions have a training and validation accuracy of approximately 90 percent using logistic regression algorithms. The predicted flags were within 10 seconds of the manual selected flag. A limitation of this method is that it requires periodic retraining with new field data to improve the prediction robustness and to maintain high accuracy. Accurate start and end time selections make it not only viable to process large volumes of fracture treatment data but also reduce the time spent reviewing field data for quality control. Petroleum engineers need to continue their focus on optimizing their systems with the greatest possible efficiency. Leveraging common analytical methods combined with the large, structured datasets that are readily available provide impressive results without extensive programming knowledge.
基于机器学习的时间序列Frac数据事件识别
水力压裂泵送数据每隔一秒在现场进行记录和绘制。阶段开始和结束时间的指定非常重要,因为这些边界决定了总体计算,如压力、速率和浓度。由于整个行业缺乏统一的选择和解释方法,手动选择阶段标志通常非常耗时,而且容易出现不准确的情况。本研究的目的是演示使用机器学习算法在高频处理地块中识别准确和一致的阶段开始和结束时间的自动化过程。本研究基于对计量高频治疗数据的分析,并结合监督机器学习算法。泵送数据集包括179级的处理压力、泥浆速率和清洁体积,每个变量总共有1,530,445行数据。66%的数据用于训练模型,8%用于验证模型,剩下的26%用于测试模型。考虑到用户定义的开始/结束时间标志,使用主题专业知识来训练算法。泵送数据的行为与传统的时间序列数据(如天气、股票价格或人口增长)非常不同。所检测的特征不受时间的影响,而是受物理事件的影响,因此特征之间的相关性或依赖性会影响准确的模式识别。为了使算法运行更精简,使用损失函数、平滑技术和主要数据通道的变化率对数据集进行预处理。为了了解数据如何影响预测并评估不同的模型性能,我们测试了两种分类算法:逻辑回归和支持向量机。在高频处理区中,使用分类技术可以准确地确定水力压裂阶段的泵送开始和结束位置。结果表明,使用逻辑回归算法,标志预测的训练和验证精度约为90%。预测的旗帜在手动选择旗帜的10秒内。该方法的一个局限性是需要对新的现场数据进行周期性的再训练,以提高预测的鲁棒性并保持较高的准确性。准确的开始和结束时间选择不仅可以处理大量的压裂数据,还可以减少审查现场数据以进行质量控制的时间。石油工程师需要继续专注于以最大可能的效率优化他们的系统。利用常见的分析方法与现成的大型结构化数据集相结合,无需丰富的编程知识即可提供令人印象深刻的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信