Facies classification using k-means clustering algorithm in Mara Field, Niger Delta, Nigeria

IF 1.827 Q2 Earth and Planetary Sciences
Esther Kerubo, Moruffdeen Adedapo Adabanija, Olatunbosun Adedayo Alao
{"title":"Facies classification using k-means clustering algorithm in Mara Field, Niger Delta, Nigeria","authors":"Esther Kerubo,&nbsp;Moruffdeen Adedapo Adabanija,&nbsp;Olatunbosun Adedayo Alao","doi":"10.1007/s12517-025-12311-4","DOIUrl":null,"url":null,"abstract":"<div><p>An integrated <i>k</i>-means clustering of well log data from Mara field, Niger Delta, Nigeria, has been carried out. This is with a view to segmenting well data into different facies based on their physical and geological properties. The relationship between the cluster labels and the facies types was studied using cross-plots, histograms, and statistical analysis. The results obtained from cluster prediction were compared with conventional methods of well log interpretation. Three well datasets from Mara field (Mara-1, Mara-2, and Mara-3) containing gamma ray, neutron porosity, density, and deep resistivity logs were used. The well data was subjected to data preprocessing, exploratory data analysis, outlier detection and removal, feature selection, and scaling to make the data more suitable for machine learning (ML) methods. Due to missing data in density and neutron porosity logs that might have occurred as a result of various reasons, including tool failures, depth misalignments, and manual removal of bad data, Mara-3 well was dropped for clustering as the issue could significantly impact petrophysical analyses and machine learning model performance. The <i>k</i>-means clustering algorithm was implemented using the Scikit-learn library. The elbow method and silhouette score were then applied to cluster the datasets as well as evaluate the number of clusters. The elbow method approximated the cluster level to be at 3, while with further evaluation, the silhouette score gave the optimum level of clustering with its highest value at cluster level of 2. A cluster level of 2 was selected to be the best with the highest score of 0.552, denoting that the data points are very compact within the cluster to which they belong. Based on the clustering results, different facies (shale and sandstone) were recognized successfully. The reservoir unit of sandstone and shale intercalations was delineated from the two wells and a dynamic depositional environment. Comparison of the identified facies units with conventional method of interpretation showed that the <i>k</i>-means algorithm was able to cluster the data and correlate them with depth.</p></div>","PeriodicalId":476,"journal":{"name":"Arabian Journal of Geosciences","volume":"18 9","pages":""},"PeriodicalIF":1.8270,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal of Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s12517-025-12311-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

An integrated k-means clustering of well log data from Mara field, Niger Delta, Nigeria, has been carried out. This is with a view to segmenting well data into different facies based on their physical and geological properties. The relationship between the cluster labels and the facies types was studied using cross-plots, histograms, and statistical analysis. The results obtained from cluster prediction were compared with conventional methods of well log interpretation. Three well datasets from Mara field (Mara-1, Mara-2, and Mara-3) containing gamma ray, neutron porosity, density, and deep resistivity logs were used. The well data was subjected to data preprocessing, exploratory data analysis, outlier detection and removal, feature selection, and scaling to make the data more suitable for machine learning (ML) methods. Due to missing data in density and neutron porosity logs that might have occurred as a result of various reasons, including tool failures, depth misalignments, and manual removal of bad data, Mara-3 well was dropped for clustering as the issue could significantly impact petrophysical analyses and machine learning model performance. The k-means clustering algorithm was implemented using the Scikit-learn library. The elbow method and silhouette score were then applied to cluster the datasets as well as evaluate the number of clusters. The elbow method approximated the cluster level to be at 3, while with further evaluation, the silhouette score gave the optimum level of clustering with its highest value at cluster level of 2. A cluster level of 2 was selected to be the best with the highest score of 0.552, denoting that the data points are very compact within the cluster to which they belong. Based on the clustering results, different facies (shale and sandstone) were recognized successfully. The reservoir unit of sandstone and shale intercalations was delineated from the two wells and a dynamic depositional environment. Comparison of the identified facies units with conventional method of interpretation showed that the k-means algorithm was able to cluster the data and correlate them with depth.

Abstract Image

Abstract Image

基于k-means聚类算法的尼日利亚尼日尔三角洲Mara油田相分类
对尼日利亚尼日尔三角洲Mara油田的测井数据进行了综合k-means聚类。这是为了根据井的物理和地质性质将井数据划分为不同的相。通过交叉图、直方图和统计分析,研究了聚类标签与相类型之间的关系。将聚类预测结果与常规测井解释方法进行了比较。使用了Mara油田的3口井数据集(Mara-1、Mara-2和Mara-3),包括伽马射线、中子孔隙度、密度和深部电阻率测井。井数据经过数据预处理、探索性数据分析、异常值检测和去除、特征选择和缩放,使数据更适合机器学习(ML)方法。由于各种原因导致密度和中子孔隙度测井数据丢失,包括工具故障、深度对准错误和人工删除错误数据,因此放弃了Mara-3井进行聚类,因为该问题可能会严重影响岩石物理分析和机器学习模型的性能。k-means聚类算法使用Scikit-learn库实现。然后应用肘法和轮廓评分对数据集进行聚类以及评估聚类的数量。肘部法将聚类水平近似为3,而进一步评价,剪影评分给出了最佳聚类水平,聚类水平最高为2。选择聚类水平为2为最佳,得分最高,为0.552,说明数据点在所属的聚类内非常紧凑。根据聚类结果,成功识别了不同的相(页岩和砂岩)。根据两口井和动态沉积环境圈定了砂岩和页岩夹层的储层单元。将识别的相单元与常规解释方法进行比较,表明k-means算法能够将数据聚类并与深度关联。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Arabian Journal of Geosciences
Arabian Journal of Geosciences GEOSCIENCES, MULTIDISCIPLINARY-
自引率
0.00%
发文量
1587
审稿时长
6.7 months
期刊介绍: The Arabian Journal of Geosciences is the official journal of the Saudi Society for Geosciences and publishes peer-reviewed original and review articles on the entire range of Earth Science themes, focused on, but not limited to, those that have regional significance to the Middle East and the Euro-Mediterranean Zone. Key topics therefore include; geology, hydrogeology, earth system science, petroleum sciences, geophysics, seismology and crustal structures, tectonics, sedimentology, palaeontology, metamorphic and igneous petrology, natural hazards, environmental sciences and sustainable development, geoarchaeology, geomorphology, paleo-environment studies, oceanography, atmospheric sciences, GIS and remote sensing, geodesy, mineralogy, volcanology, geochemistry and metallogenesis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信