A Comparison of Supervised Machine Learning Algorithms to Predict Porosity in Carbonate Reservoirs From Well Logs: A Case Study of Quissamã Formation Carbonate in the Campos Basin, Brazil
Gisela M. S. Almeida, Carlos H. S. Barbosa, Maira C. O. L. Santo, Luiz Landau
{"title":"A Comparison of Supervised Machine Learning Algorithms to Predict Porosity in Carbonate Reservoirs From Well Logs: A Case Study of Quissamã Formation Carbonate in the Campos Basin, Brazil","authors":"Gisela M. S. Almeida, Carlos H. S. Barbosa, Maira C. O. L. Santo, Luiz Landau","doi":"10.1111/jpg.70036","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Porosity estimation is a critical step in characterizing carbonate reservoirs, particularly given their complex pore systems and heterogeneity. This study proposes a data-driven approach to predict porosity in the Quissamã Formation, a carbonate unit in Brazil's Campos Basin, using supervised machine learning techniques. Routine core analysis data served as the target variable, while conventional well logs were used as input features. Eight machine learning algorithms were evaluated: Ridge Regression (RR), Support Vector Regression (SVR), Decision Tree (DT), Random Forest (RF), Extremely Randomized Trees (ET), K-Nearest Neighbors (KNN), and two Multilayer Perceptrons (MLP-1 and MLP-2). The models were optimized through hyperparameter tuning and validated using cross-validation. A total of 328 samples were used, of which 210 (from 3 wells) were allocated for training and validation and 118 (from 5 wells) reserved for blind-well testing, ensuring an independent evaluation of model generalization. Their performance was compared against four traditional porosity estimation methods based on individual and combined well logs. Overall, the machine learning models achieved higher accuracy, with RR demonstrating the most consistent results across wells. This work highlights the potential influence of data representativeness and tuning strategy on prediction performance. Although focused on the Quissamã Formation, the methodology is scalable and adaptable to other reservoirs. Moreover, machine learning models offer practical advantages, requiring no prior petrophysical knowledge and enabling real-time application during well logging, thereby facilitating timely and informed reservoir evaluations.</p>\n </div>","PeriodicalId":16748,"journal":{"name":"Journal of Petroleum Geology","volume":"49 2","pages":"448-465"},"PeriodicalIF":1.7000,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Petroleum Geology","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jpg.70036","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/24 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Porosity estimation is a critical step in characterizing carbonate reservoirs, particularly given their complex pore systems and heterogeneity. This study proposes a data-driven approach to predict porosity in the Quissamã Formation, a carbonate unit in Brazil's Campos Basin, using supervised machine learning techniques. Routine core analysis data served as the target variable, while conventional well logs were used as input features. Eight machine learning algorithms were evaluated: Ridge Regression (RR), Support Vector Regression (SVR), Decision Tree (DT), Random Forest (RF), Extremely Randomized Trees (ET), K-Nearest Neighbors (KNN), and two Multilayer Perceptrons (MLP-1 and MLP-2). The models were optimized through hyperparameter tuning and validated using cross-validation. A total of 328 samples were used, of which 210 (from 3 wells) were allocated for training and validation and 118 (from 5 wells) reserved for blind-well testing, ensuring an independent evaluation of model generalization. Their performance was compared against four traditional porosity estimation methods based on individual and combined well logs. Overall, the machine learning models achieved higher accuracy, with RR demonstrating the most consistent results across wells. This work highlights the potential influence of data representativeness and tuning strategy on prediction performance. Although focused on the Quissamã Formation, the methodology is scalable and adaptable to other reservoirs. Moreover, machine learning models offer practical advantages, requiring no prior petrophysical knowledge and enabling real-time application during well logging, thereby facilitating timely and informed reservoir evaluations.
期刊介绍:
Journal of Petroleum Geology is a quarterly journal devoted to the geology of oil and natural gas. Editorial preference is given to original papers on oilfield regions of the world outside North America and on topics of general application in petroleum exploration and development operations, including geochemical and geophysical studies, basin modelling and reservoir evaluation.