Decoding the Stratigraphic Heterogeneity of Bengal Basin, India Using Supervised Machine Learning-A Case Study

Day 3 Wed, February 14, 2024 Pub Date : 2024-02-12 DOI:10.2523/iptc-23372-ms

Arijit Sahu, Manomita Kundu

{"title":"Decoding the Stratigraphic Heterogeneity of Bengal Basin, India Using Supervised Machine Learning-A Case Study","authors":"Arijit Sahu, Manomita Kundu","doi":"10.2523/iptc-23372-ms","DOIUrl":null,"url":null,"abstract":"\n \n \n In oil and natural gas exploration, Machine Learning (ML) has gained noteworthy prominence for its ability to decode complex subsurface geology. ML commonly applies advanced statistical algorithms to build robust predictive regression and classification models. On-land Bengal Basin tasted exploratory success recently but displays great ordeal of stratigraphic heterogeneity. This paper discusses application of support vector machine (SVM), random forest (RF) and self-organizing map (SOM) ML algorithms with supervisions towards comprehensive modeling of the complex Miocene facies from Bengal on-land area for the maiden time, difficult otherwise by subjective conventional approach.\n \n \n \n Advanced geochemical logs (such as Elemental Capture Spectroscopy, ECS) and core data usually are very useful to quantify the facies downhole. However their availability also demands increased operation time and cost. Whenever available such data can be treated seamlessly with ML to test and build quantitative facies classification model from limited resources to over a region. Facies classification by ML not only makes the most of the available data but also eliminates the undesired subjectivity in addressing the subsurface heterogeneity with higher confidence. Gamma Ray, Neutron-Density, Resistivity, Sonic derived P-wave & S-wave velocities and suitably engineered log derivatives are mathematically modeled from the study area to classify facies using SVM, RF and SOM ML algorithms with ECS and core data calibration.\n \n \n \n Miocene sediments of the study area shows presence of six distinct facies viz. Claystone, Silty Claystone, Clayey Sand, Sandy Clay, Sand and Clean Sand. Facies data are trained by ML algorithms with multifold cross validation and returns credible accuracy for SVM, RF and SOM. The statistics driven facies model has been extrapolated for area where ECS or core data are not available but common logs are and yields geologically acceptable outputs. During exploration and field development stages such ML driven quantitative facies model improves the understanding of the subsurface from reservoir and non-reservoir point of view. ML Facies modeling captures the transition from shelf to fluvial depositional environment in the study area. Association frequency of different facies helps to visualize the changes from low/transitional to higher energy regime on a fine scale within Miocene.\n \n \n \n This paper discusses appropriate workflow, SVM kernel selection and hyper-parameter optimizations for SVM, RF and SOM that dictate the quality of facies model for Bengal basin. Heterogeneous stratigraphic play of Bengal on-land area demands accurate and quantitative subsurface lithological understanding for deploying fine exploration and development strategies, which can be addressed by this study. Nonetheless RF/SVM appear to be better facies classifier than SOM for Miocene sediments of Bengal from overall classification accuracy especially for less populous facies and calculation time.\n","PeriodicalId":518539,"journal":{"name":"Day 3 Wed, February 14, 2024","volume":"30 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 3 Wed, February 14, 2024","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2523/iptc-23372-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In oil and natural gas exploration, Machine Learning (ML) has gained noteworthy prominence for its ability to decode complex subsurface geology. ML commonly applies advanced statistical algorithms to build robust predictive regression and classification models. On-land Bengal Basin tasted exploratory success recently but displays great ordeal of stratigraphic heterogeneity. This paper discusses application of support vector machine (SVM), random forest (RF) and self-organizing map (SOM) ML algorithms with supervisions towards comprehensive modeling of the complex Miocene facies from Bengal on-land area for the maiden time, difficult otherwise by subjective conventional approach. Advanced geochemical logs (such as Elemental Capture Spectroscopy, ECS) and core data usually are very useful to quantify the facies downhole. However their availability also demands increased operation time and cost. Whenever available such data can be treated seamlessly with ML to test and build quantitative facies classification model from limited resources to over a region. Facies classification by ML not only makes the most of the available data but also eliminates the undesired subjectivity in addressing the subsurface heterogeneity with higher confidence. Gamma Ray, Neutron-Density, Resistivity, Sonic derived P-wave & S-wave velocities and suitably engineered log derivatives are mathematically modeled from the study area to classify facies using SVM, RF and SOM ML algorithms with ECS and core data calibration. Miocene sediments of the study area shows presence of six distinct facies viz. Claystone, Silty Claystone, Clayey Sand, Sandy Clay, Sand and Clean Sand. Facies data are trained by ML algorithms with multifold cross validation and returns credible accuracy for SVM, RF and SOM. The statistics driven facies model has been extrapolated for area where ECS or core data are not available but common logs are and yields geologically acceptable outputs. During exploration and field development stages such ML driven quantitative facies model improves the understanding of the subsurface from reservoir and non-reservoir point of view. ML Facies modeling captures the transition from shelf to fluvial depositional environment in the study area. Association frequency of different facies helps to visualize the changes from low/transitional to higher energy regime on a fine scale within Miocene. This paper discusses appropriate workflow, SVM kernel selection and hyper-parameter optimizations for SVM, RF and SOM that dictate the quality of facies model for Bengal basin. Heterogeneous stratigraphic play of Bengal on-land area demands accurate and quantitative subsurface lithological understanding for deploying fine exploration and development strategies, which can be addressed by this study. Nonetheless RF/SVM appear to be better facies classifier than SOM for Miocene sediments of Bengal from overall classification accuracy especially for less populous facies and calculation time.

查看原文本刊更多论文

利用监督机器学习解码印度孟加拉盆地的地层异质性--案例研究

在石油和天然气勘探领域，机器学习（ML）因其解码复杂地下地质的能力而备受瞩目。机器学习通常应用先进的统计算法来建立强大的预测回归和分类模型。陆相孟加拉盆地最近在勘探方面取得了成功，但地层的异质性也是一大难题。本文讨论了支持向量机（SVM）、随机森林（RF）和自组织图（SOM）ML 算法在监督下的应用，首次对孟加拉陆地地区复杂的中新世地层进行了全面建模，否则主观的传统方法很难实现。先进的地球化学测井（如元素捕获光谱法，ECS）和岩心数据通常对量化井下岩相非常有用。不过，使用这些数据也需要增加操作时间和成本。只要有这些数据，就可以用 ML 无缝处理，在有限的资源范围内测试和建立定量的岩相分类模型。利用 ML 进行岩相分类不仅能充分利用可用数据，还能消除在以更高置信度处理地下异质性时的主观性。利用 SVM、RF 和 SOM ML 算法以及 ECS 和岩心数据校准，对研究区域的伽马射线、中子密度、电阻率、声波推导的 P 波和 S 波速度以及适当的工程测井导数进行数学建模，从而对岩层进行分类。研究区域的中新世沉积物显示存在六个不同的岩相，即粘土岩、淤泥质粘土岩、粘土质砂、砂质粘土、砂和洁净砂。面层数据通过多倍交叉验证的 ML 算法进行训练，并返回 SVM、RF 和 SOM 的可信精度。对于没有 ECS 或岩心数据，但有普通测井记录的地区，可推断出统计驱动的岩相模型，并产生地质上可接受的输出结果。在勘探和油田开发阶段，这种 ML 驱动的定量剖面模型可以从储层和非储层的角度提高对地下的认识。ML 层面模型捕捉了研究区域从陆架到河流沉积环境的过渡。不同岩相的关联频率有助于以精细的尺度直观地显示中新世从低能/过渡能到高能机制的变化。本文讨论了 SVM、RF 和 SOM 的适当工作流程、SVM 核选择和超参数优化，这些因素决定了孟加拉盆地地层模型的质量。孟加拉陆地地区地层分布不均，需要准确、定量地了解地下岩性，以制定精细的勘探和开发战略，而本研究可以解决这一问题。尽管如此，对于孟加拉中新世沉积物而言，从总体分类准确性（尤其是对于数量较少的岩相）和计算时间来看，RF/SVM 似乎比 SOM 更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Day 3 Wed, February 14, 2024

自引率

0.00%

发文量