{"title":"基于机器学习的页岩储层有机碳综合估算方法——以加拿大阿尔伯塔省Duvernay地层为例","authors":"Gaurav Sharma, Derek Hayes","doi":"10.2118/208916-ms","DOIUrl":null,"url":null,"abstract":"\n Shale gas reservoirs have become prominent contributors to the world's hydrocarbon resources and production. They exhibit multiple storage mechanisms, two of which are linked to the free and adsorbed gas phase. Since the adsorbed gas may be stored as a denser phase than the free gas, the contribution of the adsorbed phase can be significant. The adsorbed volume is related to the total organic carbon (TOC) and thus, higher TOC can indicate higher hydrocarbon inplace. Furthermore, productivity can be linked to TOC through the potential for overpressure and conversion of kerogen to pore space. However, estimation of the TOC is not a trivial problem, as it depends on geological factors such as depositional environment. In this study, we propose an integrated workflow using concepts of machine learning to estimate TOC.\n The workflow is divided into 3 sections which are area selection, sub-region categorization, and prediction modeling. Firstly, 3 active exploration and development areas (Kaybob, Pembina, and East shale basin) of the Duvernay Formation are highlighted and the geology of each specific area is analyzed. Thereafter, using the available core data and average properties of the attributes (Gamma Ray, resistivity, density, and distance from mean vitrinite reflectance line), each area is clustered into sub-regions using SVM, logistic regression, and k-means clustering. Finally, using Random Forest prediction, models for each sub-region are developed and ranked with average mean square errors and standard deviations.\n It is observed that the Kaybob area can be clustered into 2 regions. This observation is supported by the principal component plot (PC1 vs PC2), which shows a dual cloud structure. This is further supported through clustering analysis, which also revealed the same observation. Results of the prediction modeling found random forest as the best predictor, achieving a match wiht the core data with a error less than 10% and in some cases only a 1% deviation.\n Shale reservoir characterization requires estimation of the key parameters such as TOC. However, it is difficult to estimate TOC with purely physics-based or purely statistical models, as it requires limited specialized data and is impacted by subtle variations in the reservoir. This study suggests that TOC can be accurately estimated by combining geological interpretation and machine learning based algorithms without bearing cost of the specialized data.","PeriodicalId":146458,"journal":{"name":"Day 1 Wed, March 16, 2022","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Machine Learning Based Integrated Approach to Estimate Total Organic Carbon in Shale Reservoirs – A Case Study from Duvernay Formation, Alberta Canada\",\"authors\":\"Gaurav Sharma, Derek Hayes\",\"doi\":\"10.2118/208916-ms\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Shale gas reservoirs have become prominent contributors to the world's hydrocarbon resources and production. They exhibit multiple storage mechanisms, two of which are linked to the free and adsorbed gas phase. Since the adsorbed gas may be stored as a denser phase than the free gas, the contribution of the adsorbed phase can be significant. The adsorbed volume is related to the total organic carbon (TOC) and thus, higher TOC can indicate higher hydrocarbon inplace. Furthermore, productivity can be linked to TOC through the potential for overpressure and conversion of kerogen to pore space. However, estimation of the TOC is not a trivial problem, as it depends on geological factors such as depositional environment. In this study, we propose an integrated workflow using concepts of machine learning to estimate TOC.\\n The workflow is divided into 3 sections which are area selection, sub-region categorization, and prediction modeling. Firstly, 3 active exploration and development areas (Kaybob, Pembina, and East shale basin) of the Duvernay Formation are highlighted and the geology of each specific area is analyzed. Thereafter, using the available core data and average properties of the attributes (Gamma Ray, resistivity, density, and distance from mean vitrinite reflectance line), each area is clustered into sub-regions using SVM, logistic regression, and k-means clustering. Finally, using Random Forest prediction, models for each sub-region are developed and ranked with average mean square errors and standard deviations.\\n It is observed that the Kaybob area can be clustered into 2 regions. This observation is supported by the principal component plot (PC1 vs PC2), which shows a dual cloud structure. This is further supported through clustering analysis, which also revealed the same observation. Results of the prediction modeling found random forest as the best predictor, achieving a match wiht the core data with a error less than 10% and in some cases only a 1% deviation.\\n Shale reservoir characterization requires estimation of the key parameters such as TOC. However, it is difficult to estimate TOC with purely physics-based or purely statistical models, as it requires limited specialized data and is impacted by subtle variations in the reservoir. This study suggests that TOC can be accurately estimated by combining geological interpretation and machine learning based algorithms without bearing cost of the specialized data.\",\"PeriodicalId\":146458,\"journal\":{\"name\":\"Day 1 Wed, March 16, 2022\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 1 Wed, March 16, 2022\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2118/208916-ms\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 1 Wed, March 16, 2022","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/208916-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
页岩气储层已成为世界油气资源和产量的重要贡献者。它们表现出多种储存机制,其中两种与自由气相和吸附气相有关。由于吸附气可以作为比自由气更致密的相储存,因此吸附相的贡献可能是显著的。吸附体积与总有机碳(TOC)有关,TOC越高,有机质含量越高。此外,通过潜在的超压和干酪根向孔隙空间的转化,可以将生产力与TOC联系起来。然而,TOC的估算并不是一个简单的问题,它取决于沉积环境等地质因素。在这项研究中,我们提出了一个使用机器学习概念来估计TOC的集成工作流。工作流分为区域选择、子区域分类和预测建模3个部分。首先,重点介绍了Duvernay组3个活跃勘探开发区域(Kaybob、Pembina和East shale basin),并对每个区域的地质情况进行了分析。然后,利用可用的岩心数据和属性的平均属性(伽马射线、电阻率、密度以及与平均镜质组反射率线的距离),利用支持向量机、逻辑回归和k-means聚类将每个区域聚成子区域。最后,利用随机森林预测,对每个子区域建立模型,并使用平均均方误差和标准差进行排序。观察到Kaybob区域可以聚集成2个区域。这一观测结果得到了主成分图(PC1 vs PC2)的支持,它显示了双云结构。聚类分析进一步支持了这一点,聚类分析也揭示了相同的观察结果。预测建模结果发现随机森林是最好的预测器,实现了与核心数据的匹配,误差小于10%,在某些情况下只有1%的偏差。页岩储层表征需要对TOC等关键参数进行估计。然而,纯物理模型或纯统计模型很难估计TOC,因为它需要有限的专业数据,并且受储层细微变化的影响。该研究表明,通过结合地质解释和基于机器学习的算法,可以准确估算TOC,而无需承担专业数据的成本。
Machine Learning Based Integrated Approach to Estimate Total Organic Carbon in Shale Reservoirs – A Case Study from Duvernay Formation, Alberta Canada
Shale gas reservoirs have become prominent contributors to the world's hydrocarbon resources and production. They exhibit multiple storage mechanisms, two of which are linked to the free and adsorbed gas phase. Since the adsorbed gas may be stored as a denser phase than the free gas, the contribution of the adsorbed phase can be significant. The adsorbed volume is related to the total organic carbon (TOC) and thus, higher TOC can indicate higher hydrocarbon inplace. Furthermore, productivity can be linked to TOC through the potential for overpressure and conversion of kerogen to pore space. However, estimation of the TOC is not a trivial problem, as it depends on geological factors such as depositional environment. In this study, we propose an integrated workflow using concepts of machine learning to estimate TOC.
The workflow is divided into 3 sections which are area selection, sub-region categorization, and prediction modeling. Firstly, 3 active exploration and development areas (Kaybob, Pembina, and East shale basin) of the Duvernay Formation are highlighted and the geology of each specific area is analyzed. Thereafter, using the available core data and average properties of the attributes (Gamma Ray, resistivity, density, and distance from mean vitrinite reflectance line), each area is clustered into sub-regions using SVM, logistic regression, and k-means clustering. Finally, using Random Forest prediction, models for each sub-region are developed and ranked with average mean square errors and standard deviations.
It is observed that the Kaybob area can be clustered into 2 regions. This observation is supported by the principal component plot (PC1 vs PC2), which shows a dual cloud structure. This is further supported through clustering analysis, which also revealed the same observation. Results of the prediction modeling found random forest as the best predictor, achieving a match wiht the core data with a error less than 10% and in some cases only a 1% deviation.
Shale reservoir characterization requires estimation of the key parameters such as TOC. However, it is difficult to estimate TOC with purely physics-based or purely statistical models, as it requires limited specialized data and is impacted by subtle variations in the reservoir. This study suggests that TOC can be accurately estimated by combining geological interpretation and machine learning based algorithms without bearing cost of the specialized data.