{"title":"Predictive Analytics and Statistical Learning for Waterflooding Operations in Reservoir Simulations","authors":"X. Liao, M. Tyagi","doi":"10.1109/ICMLA.2019.00249","DOIUrl":null,"url":null,"abstract":"Recent improvements in technology and computational power have increased interest in the application of data driven modeling (DDM) in petroleum industry. Recovery process evaluation using numerical reservoir simulators are always time consuming and computational intensive with many assumptions and uncertainty involved and inefficient for fast decision making. Thus, DDM have been adopted as an alternative tool to predict production performance under waterflooding which is one of the most important techniques for improving oil recovery. A synthetic waterflooding dataset including production profile, operational parameters, reservoir properties and well locations is constructed using the numerical reservoir simulator. Exploratory data analysis provides several insights into the non-intuitive factors in building the reservoir model. K-means clustering analysis is performed to identify internal groupings among producers. Artificial neural network (ANN) and support vector regression (SVR) are used to decipher the nonlinear relationships between input attributes and waterflooding production. The trained models are subsequently used to predict cumulative oil and watercut on the unseen samples. Clustering analysis reveal that distance to the free water level has a dominant effect and the clustering assignment is controlled by the interplay among input attributes characterizing reservoir properties and relative well locations. Good agreements between predicted outputs from models and simulation targets present the satisfactory generalization performance and predictive capabilities of ANN and SVR methods. ANN model with one output provides the most accurate prediction result on the test data. SVR models provide similar but slightly worse forecast than ANN models. Proposed methodologies in this work can be utilized as a surrogate or complementary model to analyze and predict recovery process in other reservoirs fast and efficiently.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2019.00249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Recent improvements in technology and computational power have increased interest in the application of data driven modeling (DDM) in petroleum industry. Recovery process evaluation using numerical reservoir simulators are always time consuming and computational intensive with many assumptions and uncertainty involved and inefficient for fast decision making. Thus, DDM have been adopted as an alternative tool to predict production performance under waterflooding which is one of the most important techniques for improving oil recovery. A synthetic waterflooding dataset including production profile, operational parameters, reservoir properties and well locations is constructed using the numerical reservoir simulator. Exploratory data analysis provides several insights into the non-intuitive factors in building the reservoir model. K-means clustering analysis is performed to identify internal groupings among producers. Artificial neural network (ANN) and support vector regression (SVR) are used to decipher the nonlinear relationships between input attributes and waterflooding production. The trained models are subsequently used to predict cumulative oil and watercut on the unseen samples. Clustering analysis reveal that distance to the free water level has a dominant effect and the clustering assignment is controlled by the interplay among input attributes characterizing reservoir properties and relative well locations. Good agreements between predicted outputs from models and simulation targets present the satisfactory generalization performance and predictive capabilities of ANN and SVR methods. ANN model with one output provides the most accurate prediction result on the test data. SVR models provide similar but slightly worse forecast than ANN models. Proposed methodologies in this work can be utilized as a surrogate or complementary model to analyze and predict recovery process in other reservoirs fast and efficiently.