Higher-Order Derivatives of Production Rate and Convolutional Neural Network for Production Forecasts

Syed Tabish Haider, T. Patzek
{"title":"Higher-Order Derivatives of Production Rate and Convolutional Neural Network for Production Forecasts","authors":"Syed Tabish Haider, T. Patzek","doi":"10.2523/iptc-22486-ms","DOIUrl":null,"url":null,"abstract":"\n In recent years, many machine-learning models have been developed to predict future production of oil in gas in \"shales\". Long-short term memory (LSTM), the most widely used model, relies on the long-term production history for a reasonably accurate production forecast. All analytical and machine learning models, including LSTM, fail miserably in the absence of long production history. Our goal is to present a novel method of production forecasting using only 24 months of production data. The first and secondorder derivatives of the distance traveled give speed and acceleration to describe the trajectory and dynamics of a moving vehicle. Similarly, higher-order derivatives of hydrocarbon/water production rate vs. time uncover hidden patterns and fluctuations in a well that act as differential markers of its future recovery factor (RF). In this paper, we couple production data and their higher-order derivatives with other known parameters for a well, i.e., well length and initial production. The time-series data are passed into a Convolutional Neural Network (CNN) with two hidden layers of 16 nodes each, and one output layer. The model is trained to predict recovery factor (RF) in the 10th year of production. We analyze the first 24 months of production data for the Barnett (1500), Marcellus (800), Haynesville (800), and Eagle Ford (1000) shale wells. All wells have a minimum pressure interference time of 34 months. The production rate vs. time and its first, second, and third-order derivatives are coupled with the well length and initial production rate, and the data are normalized with their respective maxima. For the Barnett wells, the CNN model predicts recovery factors in their 10th year of production with an average accuracy of 90%. For the Marcellus, Haynesville, and Eagle Ford wells, the prediction accuracy in the 8th year of production is 89%, 92%, and 91%, respectively. Further, we divide the wells into three groups (A, B, C) depending on the range of their recovery factor (A:RF=0-0.3, B:RF=0.3-0.6, and C:RF=0.6-0.9). We show that the clusters of wells grouped by their RFs strongly correlate with the distribution of the higher-order de rivatives of production from these wells. Thus, we posit that the detailed production history and its derivatives are the most important variables that define distributions of maximum recoverable hydrocarbon from a source rock. Our novel method uses only 24 months of production data to predict future recovery factor with an outstanding average accuracy of 90%. We show that the higher-order derivatives of high-resolution production data available from the operators could be an excellent tool for well screening and predicting future production with reasonable accuracy.","PeriodicalId":11027,"journal":{"name":"Day 3 Wed, February 23, 2022","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 3 Wed, February 23, 2022","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2523/iptc-22486-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, many machine-learning models have been developed to predict future production of oil in gas in "shales". Long-short term memory (LSTM), the most widely used model, relies on the long-term production history for a reasonably accurate production forecast. All analytical and machine learning models, including LSTM, fail miserably in the absence of long production history. Our goal is to present a novel method of production forecasting using only 24 months of production data. The first and secondorder derivatives of the distance traveled give speed and acceleration to describe the trajectory and dynamics of a moving vehicle. Similarly, higher-order derivatives of hydrocarbon/water production rate vs. time uncover hidden patterns and fluctuations in a well that act as differential markers of its future recovery factor (RF). In this paper, we couple production data and their higher-order derivatives with other known parameters for a well, i.e., well length and initial production. The time-series data are passed into a Convolutional Neural Network (CNN) with two hidden layers of 16 nodes each, and one output layer. The model is trained to predict recovery factor (RF) in the 10th year of production. We analyze the first 24 months of production data for the Barnett (1500), Marcellus (800), Haynesville (800), and Eagle Ford (1000) shale wells. All wells have a minimum pressure interference time of 34 months. The production rate vs. time and its first, second, and third-order derivatives are coupled with the well length and initial production rate, and the data are normalized with their respective maxima. For the Barnett wells, the CNN model predicts recovery factors in their 10th year of production with an average accuracy of 90%. For the Marcellus, Haynesville, and Eagle Ford wells, the prediction accuracy in the 8th year of production is 89%, 92%, and 91%, respectively. Further, we divide the wells into three groups (A, B, C) depending on the range of their recovery factor (A:RF=0-0.3, B:RF=0.3-0.6, and C:RF=0.6-0.9). We show that the clusters of wells grouped by their RFs strongly correlate with the distribution of the higher-order de rivatives of production from these wells. Thus, we posit that the detailed production history and its derivatives are the most important variables that define distributions of maximum recoverable hydrocarbon from a source rock. Our novel method uses only 24 months of production data to predict future recovery factor with an outstanding average accuracy of 90%. We show that the higher-order derivatives of high-resolution production data available from the operators could be an excellent tool for well screening and predicting future production with reasonable accuracy.
产量预测的高阶导数与卷积神经网络
近年来,人们开发了许多机器学习模型来预测“页岩”中天然气的未来产量。长短期记忆(LSTM)是应用最广泛的模型,它依赖于长期的生产历史来进行较为准确的产量预测。所有的分析和机器学习模型,包括LSTM,在缺乏长期生产历史的情况下都失败得很惨。我们的目标是提出一种仅使用24个月生产数据进行生产预测的新方法。行驶距离的一阶和二阶导数给出了速度和加速度,用以描述移动车辆的轨迹和动力学。同样,油气/水产量随时间的高阶导数揭示了井中隐藏的模式和波动,作为其未来采收率(RF)的差异标志。在本文中,我们将生产数据及其高阶导数与井的其他已知参数(即井长和初始产量)相结合。时间序列数据被传递到卷积神经网络(CNN)中,该网络有两个隐藏层,每个隐藏层有16个节点,还有一个输出层。该模型经过训练,可以预测第10年的采收率(RF)。我们分析了Barnett(1500口)、Marcellus(800口)、Haynesville(800口)和Eagle Ford(1000口)页岩井前24个月的产量数据。所有井的最小压力干扰时间为34个月。产量与时间的关系及其一、二、三阶导数与井长和初始产量相结合,并使用各自的最大值对数据进行归一化处理。对于Barnett井,CNN模型预测其生产第10年的采收率,平均准确率为90%。对于Marcellus、Haynesville和Eagle Ford井,在生产第8年的预测精度分别为89%、92%和91%。此外,我们根据采收率(A:RF=0-0.3, B:RF=0.3-0.6, C:RF=0.6-0.9)的范围将井分为三组(A, B, C)。研究表明,按RFs分组的井簇与这些井的高阶产量分布密切相关。因此,我们认为详细的生产历史及其衍生物是确定烃源岩最大可采烃分布的最重要变量。我们的新方法仅使用24个月的生产数据就可以预测未来的采收率,平均精度达到90%。研究表明,运营商提供的高分辨率生产数据的高阶导数可以成为筛井和预测未来产量的绝佳工具,并且具有合理的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信