Novel enhanced oil recovery screening methodologies by implementing improved stacking ensemble learning algorithms

IF 4.6 0 ENERGY & FUELS
Min Zhang , Na Zhang , Dawei Ren , Liujun Chen , Ya Yao , Shengshuai Su , Han Wang , Yongxin Hu
{"title":"Novel enhanced oil recovery screening methodologies by implementing improved stacking ensemble learning algorithms","authors":"Min Zhang ,&nbsp;Na Zhang ,&nbsp;Dawei Ren ,&nbsp;Liujun Chen ,&nbsp;Ya Yao ,&nbsp;Shengshuai Su ,&nbsp;Han Wang ,&nbsp;Yongxin Hu","doi":"10.1016/j.geoen.2025.214104","DOIUrl":null,"url":null,"abstract":"<div><div>Precise and dependable screening for Enhanced Oil Recovery (EOR) is essential for the optimal planning and design of EOR projects. The adoption of machine learning-based models for EOR screening presents a promising solution to the challenges inherent in the process. Nonetheless, issues such as data imbalance and overfitting pose significant hurdles to enhancing the predictive accuracy of these models. The goal of this research is to develop innovative EOR screening models utilizing an improved Stacking ensemble learning approach to address these challenges, thereby offering reservoir engineers more precise guidance for the swift and effective selection of the most appropriate EOR methods. In this study, traditional models such as Random Forests (RF), Extreme Gradient Boosting (XGBoost), Artificial Neural Networks (ANN), Decision Tree (DT), Support Vector Machine (SVM) and Logistic Regression (LR) were employed to establish classical EOR screening frameworks. Then based on the improved Stacking ensemble learning, these models were integrated to form advanced EOR ensemble screening models. To comprehensively assess model performance in the context of data imbalance, in addition to conventional evaluation indicators, the Kappa coefficient and the Matthews Correlation Coefficient (MCC) were introduced as novel evaluation metrics. The findings indicate that the accuracy, Kappa coefficient and MCC value of the newly developed models can reach up to 96.87 %, 0.955 and 0.955, which are much higher than other EOR screening models. The new models can provide more accurate and faster EOR screening decision support for reservoir engineers.</div></div>","PeriodicalId":100578,"journal":{"name":"Geoenergy Science and Engineering","volume":"255 ","pages":"Article 214104"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoenergy Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949891025004622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

Precise and dependable screening for Enhanced Oil Recovery (EOR) is essential for the optimal planning and design of EOR projects. The adoption of machine learning-based models for EOR screening presents a promising solution to the challenges inherent in the process. Nonetheless, issues such as data imbalance and overfitting pose significant hurdles to enhancing the predictive accuracy of these models. The goal of this research is to develop innovative EOR screening models utilizing an improved Stacking ensemble learning approach to address these challenges, thereby offering reservoir engineers more precise guidance for the swift and effective selection of the most appropriate EOR methods. In this study, traditional models such as Random Forests (RF), Extreme Gradient Boosting (XGBoost), Artificial Neural Networks (ANN), Decision Tree (DT), Support Vector Machine (SVM) and Logistic Regression (LR) were employed to establish classical EOR screening frameworks. Then based on the improved Stacking ensemble learning, these models were integrated to form advanced EOR ensemble screening models. To comprehensively assess model performance in the context of data imbalance, in addition to conventional evaluation indicators, the Kappa coefficient and the Matthews Correlation Coefficient (MCC) were introduced as novel evaluation metrics. The findings indicate that the accuracy, Kappa coefficient and MCC value of the newly developed models can reach up to 96.87 %, 0.955 and 0.955, which are much higher than other EOR screening models. The new models can provide more accurate and faster EOR screening decision support for reservoir engineers.
采用改进的叠加集成学习算法的新型提高采收率筛选方法
提高采收率(EOR)的精确、可靠筛选对于提高采收率项目的优化规划和设计至关重要。采用基于机器学习的模型进行EOR筛选,为解决该过程中固有的挑战提供了一个有希望的解决方案。然而,数据不平衡和过拟合等问题对提高这些模型的预测准确性构成了重大障碍。本研究的目标是开发创新的EOR筛选模型,利用改进的叠加集成学习方法来解决这些挑战,从而为油藏工程师提供更精确的指导,以便快速有效地选择最合适的EOR方法。本研究采用随机森林(RF)、极端梯度增强(XGBoost)、人工神经网络(ANN)、决策树(DT)、支持向量机(SVM)和逻辑回归(LR)等传统模型建立经典的提高采收率筛选框架。然后基于改进的叠加集成学习,将这些模型进行集成,形成先进的提高采收率集成筛选模型。为了全面评价数据失衡背景下的模型性能,在常规评价指标基础上,引入Kappa系数和Matthews相关系数(MCC)作为新的评价指标。结果表明,新模型的准确率、Kappa系数和MCC值分别可达96.87%、0.955和0.955,远高于其他提高采收率筛选模型。新模型可以为油藏工程师提供更准确、更快速的EOR筛选决策支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.00
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信