Min Zhang , Na Zhang , Dawei Ren , Liujun Chen , Ya Yao , Shengshuai Su , Han Wang , Yongxin Hu
{"title":"Novel enhanced oil recovery screening methodologies by implementing improved stacking ensemble learning algorithms","authors":"Min Zhang , Na Zhang , Dawei Ren , Liujun Chen , Ya Yao , Shengshuai Su , Han Wang , Yongxin Hu","doi":"10.1016/j.geoen.2025.214104","DOIUrl":null,"url":null,"abstract":"<div><div>Precise and dependable screening for Enhanced Oil Recovery (EOR) is essential for the optimal planning and design of EOR projects. The adoption of machine learning-based models for EOR screening presents a promising solution to the challenges inherent in the process. Nonetheless, issues such as data imbalance and overfitting pose significant hurdles to enhancing the predictive accuracy of these models. The goal of this research is to develop innovative EOR screening models utilizing an improved Stacking ensemble learning approach to address these challenges, thereby offering reservoir engineers more precise guidance for the swift and effective selection of the most appropriate EOR methods. In this study, traditional models such as Random Forests (RF), Extreme Gradient Boosting (XGBoost), Artificial Neural Networks (ANN), Decision Tree (DT), Support Vector Machine (SVM) and Logistic Regression (LR) were employed to establish classical EOR screening frameworks. Then based on the improved Stacking ensemble learning, these models were integrated to form advanced EOR ensemble screening models. To comprehensively assess model performance in the context of data imbalance, in addition to conventional evaluation indicators, the Kappa coefficient and the Matthews Correlation Coefficient (MCC) were introduced as novel evaluation metrics. The findings indicate that the accuracy, Kappa coefficient and MCC value of the newly developed models can reach up to 96.87 %, 0.955 and 0.955, which are much higher than other EOR screening models. The new models can provide more accurate and faster EOR screening decision support for reservoir engineers.</div></div>","PeriodicalId":100578,"journal":{"name":"Geoenergy Science and Engineering","volume":"255 ","pages":"Article 214104"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoenergy Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949891025004622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Precise and dependable screening for Enhanced Oil Recovery (EOR) is essential for the optimal planning and design of EOR projects. The adoption of machine learning-based models for EOR screening presents a promising solution to the challenges inherent in the process. Nonetheless, issues such as data imbalance and overfitting pose significant hurdles to enhancing the predictive accuracy of these models. The goal of this research is to develop innovative EOR screening models utilizing an improved Stacking ensemble learning approach to address these challenges, thereby offering reservoir engineers more precise guidance for the swift and effective selection of the most appropriate EOR methods. In this study, traditional models such as Random Forests (RF), Extreme Gradient Boosting (XGBoost), Artificial Neural Networks (ANN), Decision Tree (DT), Support Vector Machine (SVM) and Logistic Regression (LR) were employed to establish classical EOR screening frameworks. Then based on the improved Stacking ensemble learning, these models were integrated to form advanced EOR ensemble screening models. To comprehensively assess model performance in the context of data imbalance, in addition to conventional evaluation indicators, the Kappa coefficient and the Matthews Correlation Coefficient (MCC) were introduced as novel evaluation metrics. The findings indicate that the accuracy, Kappa coefficient and MCC value of the newly developed models can reach up to 96.87 %, 0.955 and 0.955, which are much higher than other EOR screening models. The new models can provide more accurate and faster EOR screening decision support for reservoir engineers.