Riadh Al Dwood , Qingbang Meng , AL-Wesabi Ibrahim , Wahib Ali Yahya , Ahmed .G. Alareqi , Ghmdan AL-Khulaidi
{"title":"用于预测石油和天然气生产率的新型混合 ANN-GB-LR 模型","authors":"Riadh Al Dwood , Qingbang Meng , AL-Wesabi Ibrahim , Wahib Ali Yahya , Ahmed .G. Alareqi , Ghmdan AL-Khulaidi","doi":"10.1016/j.flowmeasinst.2024.102690","DOIUrl":null,"url":null,"abstract":"<div><p>The Oil and Gas Production Rate (OGPR) is one of the most significant processes that play an essential role in the oil industry. Predicting OGPR is critical for effective reservoir management and enhancing oil recovery. Traditional methods (TMs) and numerical simulations (NS) often struggle to process and analyze nonlinear, complex, and massive datasets. To avoid these challenges, artificial intelligence (AI) techniques and machine learning (ML) models have been proposed as an alternative solution due to their high efficiency and rapidity in handling complex data. In this study, a new hybrid model is developed by combining the strengths of Artificial Neural Networks (ANN) and Gradient Boosting (GB), using Linear Regression (LR) as a meta-model by stacking technique. It captures nonlinear relationships effectively and manages outliers, enhancing prediction accuracy. The novelty of this study lies in the hybrid ANN-GB-LR model's ability to integrate various machine learning techniques into a robust framework, leveraging the high learning capacity of ANN, the robust handling of outliers by GB, and the straightforward interpretability of LR. This creative combination handles the limitations of individual models and enhances the general predictive performance. The model was trained and tested using actual field data from the Halewah field in Yemen. Evaluation metrics, including Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and R-squared (R<sup>2</sup>), were utilized to evaluate and compare the hybrid model with other ML models: Random Forest (RF), XGBoost (XGB), LR, Light Gradient Boosting Machine (LGBM), GB, and K-nearest neighbors (KNN). The hybrid ANN-GB-LR model achieved superior results, with an R<sup>2</sup> of 0.998, an RMSE of 11.06 for oil flow rate predictions, and an R<sup>2</sup> of 0.98 and an RMSE of 172.15 for gas flow rate predictions. These results significantly surpass the other models, demonstrating the hybrid model's outstanding ability to capture complex data and provide accurate predictions. The ANN-GB-LR model surpasses Traditional Methods in predicting OGPRs. It shows a strong and reliable tool for optimizing reservoir management. This study establishes a new standard for predictive modeling in the oil industry, providing a framework for future research to apply hybrid models in handling complex datasets.</p></div>","PeriodicalId":50440,"journal":{"name":"Flow Measurement and Instrumentation","volume":"100 ","pages":"Article 102690"},"PeriodicalIF":2.3000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel hybrid ANN-GB-LR model for predicting oil and gas production rate\",\"authors\":\"Riadh Al Dwood , Qingbang Meng , AL-Wesabi Ibrahim , Wahib Ali Yahya , Ahmed .G. Alareqi , Ghmdan AL-Khulaidi\",\"doi\":\"10.1016/j.flowmeasinst.2024.102690\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The Oil and Gas Production Rate (OGPR) is one of the most significant processes that play an essential role in the oil industry. Predicting OGPR is critical for effective reservoir management and enhancing oil recovery. Traditional methods (TMs) and numerical simulations (NS) often struggle to process and analyze nonlinear, complex, and massive datasets. To avoid these challenges, artificial intelligence (AI) techniques and machine learning (ML) models have been proposed as an alternative solution due to their high efficiency and rapidity in handling complex data. In this study, a new hybrid model is developed by combining the strengths of Artificial Neural Networks (ANN) and Gradient Boosting (GB), using Linear Regression (LR) as a meta-model by stacking technique. It captures nonlinear relationships effectively and manages outliers, enhancing prediction accuracy. The novelty of this study lies in the hybrid ANN-GB-LR model's ability to integrate various machine learning techniques into a robust framework, leveraging the high learning capacity of ANN, the robust handling of outliers by GB, and the straightforward interpretability of LR. This creative combination handles the limitations of individual models and enhances the general predictive performance. The model was trained and tested using actual field data from the Halewah field in Yemen. Evaluation metrics, including Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and R-squared (R<sup>2</sup>), were utilized to evaluate and compare the hybrid model with other ML models: Random Forest (RF), XGBoost (XGB), LR, Light Gradient Boosting Machine (LGBM), GB, and K-nearest neighbors (KNN). The hybrid ANN-GB-LR model achieved superior results, with an R<sup>2</sup> of 0.998, an RMSE of 11.06 for oil flow rate predictions, and an R<sup>2</sup> of 0.98 and an RMSE of 172.15 for gas flow rate predictions. These results significantly surpass the other models, demonstrating the hybrid model's outstanding ability to capture complex data and provide accurate predictions. The ANN-GB-LR model surpasses Traditional Methods in predicting OGPRs. It shows a strong and reliable tool for optimizing reservoir management. This study establishes a new standard for predictive modeling in the oil industry, providing a framework for future research to apply hybrid models in handling complex datasets.</p></div>\",\"PeriodicalId\":50440,\"journal\":{\"name\":\"Flow Measurement and Instrumentation\",\"volume\":\"100 \",\"pages\":\"Article 102690\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Flow Measurement and Instrumentation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0955598624001705\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MECHANICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Flow Measurement and Instrumentation","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0955598624001705","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}
引用次数: 0
摘要
油气生产率(OGPR)是在石油工业中发挥重要作用的最重要过程之一。预测 OGPR 对于有效管理油藏和提高石油采收率至关重要。传统方法(TMs)和数值模拟(NS)往往难以处理和分析非线性、复杂和海量的数据集。为了避免这些挑战,人工智能(AI)技术和机器学习(ML)模型因其处理复杂数据的高效性和快速性而被提出作为替代解决方案。本研究结合了人工神经网络(ANN)和梯度提升(GB)的优势,通过堆叠技术使用线性回归(LR)作为元模型,开发了一种新的混合模型。它能有效捕捉非线性关系并管理异常值,从而提高预测精度。这项研究的新颖之处在于混合 ANN-GB-LR 模型能够将各种机器学习技术整合到一个稳健的框架中,充分利用了 ANN 的高学习能力、GB 对异常值的稳健处理以及 LR 的直接可解释性。这种创造性的组合处理了单个模型的局限性,提高了总体预测性能。该模型使用也门 Halewah 油田的实际现场数据进行了训练和测试。评估指标包括均方根误差(RMSE)、平均绝对误差(MAE)和 R 平方(R2),用于评估和比较混合模型与其他 ML 模型:随机森林 (RF)、XGBoost (XGB)、LR、光梯度提升机 (LGBM)、GB 和 K-nearest neighbors (KNN)。混合 ANN-GB-LR 模型取得了优异的结果,其 R2 为 0.998,石油流量预测的 RMSE 为 11.06,气体流量预测的 R2 为 0.98,RMSE 为 172.15。这些结果大大超过了其他模型,证明了混合模型捕捉复杂数据和提供精确预测的出色能力。ANN-GB-LR 模型在预测 OGPR 方面超越了传统方法。它为优化油藏管理提供了一个强大而可靠的工具。这项研究为石油行业的预测建模建立了一个新标准,为未来应用混合模型处理复杂数据集的研究提供了一个框架。
A novel hybrid ANN-GB-LR model for predicting oil and gas production rate
The Oil and Gas Production Rate (OGPR) is one of the most significant processes that play an essential role in the oil industry. Predicting OGPR is critical for effective reservoir management and enhancing oil recovery. Traditional methods (TMs) and numerical simulations (NS) often struggle to process and analyze nonlinear, complex, and massive datasets. To avoid these challenges, artificial intelligence (AI) techniques and machine learning (ML) models have been proposed as an alternative solution due to their high efficiency and rapidity in handling complex data. In this study, a new hybrid model is developed by combining the strengths of Artificial Neural Networks (ANN) and Gradient Boosting (GB), using Linear Regression (LR) as a meta-model by stacking technique. It captures nonlinear relationships effectively and manages outliers, enhancing prediction accuracy. The novelty of this study lies in the hybrid ANN-GB-LR model's ability to integrate various machine learning techniques into a robust framework, leveraging the high learning capacity of ANN, the robust handling of outliers by GB, and the straightforward interpretability of LR. This creative combination handles the limitations of individual models and enhances the general predictive performance. The model was trained and tested using actual field data from the Halewah field in Yemen. Evaluation metrics, including Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and R-squared (R2), were utilized to evaluate and compare the hybrid model with other ML models: Random Forest (RF), XGBoost (XGB), LR, Light Gradient Boosting Machine (LGBM), GB, and K-nearest neighbors (KNN). The hybrid ANN-GB-LR model achieved superior results, with an R2 of 0.998, an RMSE of 11.06 for oil flow rate predictions, and an R2 of 0.98 and an RMSE of 172.15 for gas flow rate predictions. These results significantly surpass the other models, demonstrating the hybrid model's outstanding ability to capture complex data and provide accurate predictions. The ANN-GB-LR model surpasses Traditional Methods in predicting OGPRs. It shows a strong and reliable tool for optimizing reservoir management. This study establishes a new standard for predictive modeling in the oil industry, providing a framework for future research to apply hybrid models in handling complex datasets.
期刊介绍:
Flow Measurement and Instrumentation is dedicated to disseminating the latest research results on all aspects of flow measurement, in both closed conduits and open channels. The design of flow measurement systems involves a wide variety of multidisciplinary activities including modelling the flow sensor, the fluid flow and the sensor/fluid interactions through the use of computation techniques; the development of advanced transducer systems and their associated signal processing and the laboratory and field assessment of the overall system under ideal and disturbed conditions.
FMI is the essential forum for critical information exchange, and contributions are particularly encouraged in the following areas of interest:
Modelling: the application of mathematical and computational modelling to the interaction of fluid dynamics with flowmeters, including flowmeter behaviour, improved flowmeter design and installation problems. Application of CAD/CAE techniques to flowmeter modelling are eligible.
Design and development: the detailed design of the flowmeter head and/or signal processing aspects of novel flowmeters. Emphasis is given to papers identifying new sensor configurations, multisensor flow measurement systems, non-intrusive flow metering techniques and the application of microelectronic techniques in smart or intelligent systems.
Calibration techniques: including descriptions of new or existing calibration facilities and techniques, calibration data from different flowmeter types, and calibration intercomparison data from different laboratories.
Installation effect data: dealing with the effects of non-ideal flow conditions on flowmeters. Papers combining a theoretical understanding of flowmeter behaviour with experimental work are particularly welcome.