Novel enhanced oil recovery screening methodologies by implementing improved stacking ensemble learning algorithms

IF 4.6 0 ENERGY & FUELS

Geoenergy Science and Engineering Pub Date : 2025-07-21 DOI:10.1016/j.geoen.2025.214104

Min Zhang , Na Zhang , Dawei Ren , Liujun Chen , Ya Yao , Shengshuai Su , Han Wang , Yongxin Hu

{"title":"Novel enhanced oil recovery screening methodologies by implementing improved stacking ensemble learning algorithms","authors":"Min Zhang , Na Zhang , Dawei Ren , Liujun Chen , Ya Yao , Shengshuai Su , Han Wang , Yongxin Hu","doi":"10.1016/j.geoen.2025.214104","DOIUrl":null,"url":null,"abstract":"<div><div>Precise and dependable screening for Enhanced Oil Recovery (EOR) is essential for the optimal planning and design of EOR projects. The adoption of machine learning-based models for EOR screening presents a promising solution to the challenges inherent in the process. Nonetheless, issues such as data imbalance and overfitting pose significant hurdles to enhancing the predictive accuracy of these models. The goal of this research is to develop innovative EOR screening models utilizing an improved Stacking ensemble learning approach to address these challenges, thereby offering reservoir engineers more precise guidance for the swift and effective selection of the most appropriate EOR methods. In this study, traditional models such as Random Forests (RF), Extreme Gradient Boosting (XGBoost), Artificial Neural Networks (ANN), Decision Tree (DT), Support Vector Machine (SVM) and Logistic Regression (LR) were employed to establish classical EOR screening frameworks. Then based on the improved Stacking ensemble learning, these models were integrated to form advanced EOR ensemble screening models. To comprehensively assess model performance in the context of data imbalance, in addition to conventional evaluation indicators, the Kappa coefficient and the Matthews Correlation Coefficient (MCC) were introduced as novel evaluation metrics. The findings indicate that the accuracy, Kappa coefficient and MCC value of the newly developed models can reach up to 96.87 %, 0.955 and 0.955, which are much higher than other EOR screening models. The new models can provide more accurate and faster EOR screening decision support for reservoir engineers.</div></div>","PeriodicalId":100578,"journal":{"name":"Geoenergy Science and Engineering","volume":"255 ","pages":"Article 214104"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoenergy Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949891025004622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

Abstract

Precise and dependable screening for Enhanced Oil Recovery (EOR) is essential for the optimal planning and design of EOR projects. The adoption of machine learning-based models for EOR screening presents a promising solution to the challenges inherent in the process. Nonetheless, issues such as data imbalance and overfitting pose significant hurdles to enhancing the predictive accuracy of these models. The goal of this research is to develop innovative EOR screening models utilizing an improved Stacking ensemble learning approach to address these challenges, thereby offering reservoir engineers more precise guidance for the swift and effective selection of the most appropriate EOR methods. In this study, traditional models such as Random Forests (RF), Extreme Gradient Boosting (XGBoost), Artificial Neural Networks (ANN), Decision Tree (DT), Support Vector Machine (SVM) and Logistic Regression (LR) were employed to establish classical EOR screening frameworks. Then based on the improved Stacking ensemble learning, these models were integrated to form advanced EOR ensemble screening models. To comprehensively assess model performance in the context of data imbalance, in addition to conventional evaluation indicators, the Kappa coefficient and the Matthews Correlation Coefficient (MCC) were introduced as novel evaluation metrics. The findings indicate that the accuracy, Kappa coefficient and MCC value of the newly developed models can reach up to 96.87 %, 0.955 and 0.955, which are much higher than other EOR screening models. The new models can provide more accurate and faster EOR screening decision support for reservoir engineers.

查看原文本刊更多论文

采用改进的叠加集成学习算法的新型提高采收率筛选方法

提高采收率（EOR）的精确、可靠筛选对于提高采收率项目的优化规划和设计至关重要。采用基于机器学习的模型进行EOR筛选，为解决该过程中固有的挑战提供了一个有希望的解决方案。然而，数据不平衡和过拟合等问题对提高这些模型的预测准确性构成了重大障碍。本研究的目标是开发创新的EOR筛选模型，利用改进的叠加集成学习方法来解决这些挑战，从而为油藏工程师提供更精确的指导，以便快速有效地选择最合适的EOR方法。本研究采用随机森林（RF）、极端梯度增强（XGBoost）、人工神经网络（ANN）、决策树（DT）、支持向量机（SVM）和逻辑回归（LR）等传统模型建立经典的提高采收率筛选框架。然后基于改进的叠加集成学习，将这些模型进行集成，形成先进的提高采收率集成筛选模型。为了全面评价数据失衡背景下的模型性能，在常规评价指标基础上，引入Kappa系数和Matthews相关系数（MCC）作为新的评价指标。结果表明，新模型的准确率、Kappa系数和MCC值分别可达96.87%、0.955和0.955，远高于其他提高采收率筛选模型。新模型可以为油藏工程师提供更准确、更快速的EOR筛选决策支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Geoenergy Science and Engineering

CiteScore

1.00

自引率

0.00%

发文量