Felipe da Costa Pereira , Pedro Henrique Cardoso Paulo , Márcio da Silveira Carvalho , Helon Vicente Hultmann Ayala
{"title":"Virtual flow metering using stacking ensemble techniques and nonlinear system identification","authors":"Felipe da Costa Pereira , Pedro Henrique Cardoso Paulo , Márcio da Silveira Carvalho , Helon Vicente Hultmann Ayala","doi":"10.1016/j.fuel.2025.134893","DOIUrl":null,"url":null,"abstract":"<div><div>In petroleum production systems, data-driven virtual flow meters (VFMs) offer a viable option to physical hardware or first-principle models for predicting multiphase flow rates. Although the existing literature examines various techniques, it overlooks multi-model strategies and system identification methods. This study addresses these gaps in research by employing stacked generalization models and black-box system identification to improve production rate predictions. Stacked ensembles that combine prefitted estimators surpassed all other advanced machine learning algorithms evaluated, outperforming the best non-linear models by 5 % to 37 % and the top ensemble approaches by 1.3 % to 4.5 % in terms of root mean squared error (RMSE). System identification emerged as an effective method for evaluating optimal lags and temporal dependencies, while stacking ensembles improved well-rate prediction by merging different base learners, which can perform better for different instances of data. Additionally, free simulation runs predict long-term historical production based only on initial conditions. The strategy proposed in this study increases the reliability of VFM models by stacking various base learners and optimizing the model order, presenting a novel approach for the design of data-driven VFM estimators.</div></div>","PeriodicalId":325,"journal":{"name":"Fuel","volume":"394 ","pages":"Article 134893"},"PeriodicalIF":6.7000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuel","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016236125006179","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
In petroleum production systems, data-driven virtual flow meters (VFMs) offer a viable option to physical hardware or first-principle models for predicting multiphase flow rates. Although the existing literature examines various techniques, it overlooks multi-model strategies and system identification methods. This study addresses these gaps in research by employing stacked generalization models and black-box system identification to improve production rate predictions. Stacked ensembles that combine prefitted estimators surpassed all other advanced machine learning algorithms evaluated, outperforming the best non-linear models by 5 % to 37 % and the top ensemble approaches by 1.3 % to 4.5 % in terms of root mean squared error (RMSE). System identification emerged as an effective method for evaluating optimal lags and temporal dependencies, while stacking ensembles improved well-rate prediction by merging different base learners, which can perform better for different instances of data. Additionally, free simulation runs predict long-term historical production based only on initial conditions. The strategy proposed in this study increases the reliability of VFM models by stacking various base learners and optimizing the model order, presenting a novel approach for the design of data-driven VFM estimators.
期刊介绍:
The exploration of energy sources remains a critical matter of study. For the past nine decades, fuel has consistently held the forefront in primary research efforts within the field of energy science. This area of investigation encompasses a wide range of subjects, with a particular emphasis on emerging concerns like environmental factors and pollution.