Separate Classification Prediction Model for Lithofacies Identification of Paleogene Yingxiongling Shale, Qaidam Basin

IF 5.2 3区 工程技术 Q2 ENERGY & FUELS
Yue Shen, Songtao Wu*, Yinghao Shen, Kunyu Wu, Yafeng Li, Di Zhang, Haoting Xing and Chanfei Wang, 
{"title":"Separate Classification Prediction Model for Lithofacies Identification of Paleogene Yingxiongling Shale, Qaidam Basin","authors":"Yue Shen,&nbsp;Songtao Wu*,&nbsp;Yinghao Shen,&nbsp;Kunyu Wu,&nbsp;Yafeng Li,&nbsp;Di Zhang,&nbsp;Haoting Xing and Chanfei Wang,&nbsp;","doi":"10.1021/acs.energyfuels.5c0041410.1021/acs.energyfuels.5c00414","DOIUrl":null,"url":null,"abstract":"<p >Abundant shale oil resources have been discovered in the upper member of the Paleogene Lower Ganchaigou Formation of the Yingxiongling area from the Qaidam Basin, China. The lithofacies of Yingxiongling shale oil exhibit strong heterogeneity vertically. Accurate lithofacies identification is the key to characterizing the potential of unconventional oil and gas resources. Traditional lithofacies identification is limited by factors such as the duration of experiments and the subjectivity of the scholars. Only a limited amount of coring section data is available for analysis, while a sea of logging data remains underutilized. Therefore, utilizing machine learning algorithms to effectively leverage logging data for constructing the accurate lithofacies identification model has become a crucial area in both academia and industry. In this paper, 15 basic logging curves were used, and algorithms of random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost) were selected through Python programming to establish machine learning classification models, identifying the lithofacies types of Yingxiongling shale and analyzing the results. The lithofacies classification scheme of Yingxiongling shale is based on “rock structure + mineral composition”, developing 8 lithofacies types: thin-bedded/laminated dolomitic limestone, thin-bedded/laminated limy dolostone, thin-bedded sandstone, laminated shale, and thin-bedded/laminated mixed rock. Due to the differing sensitivities of various logging data in identifying rock structures and mineral compositions, the corresponding algorithms and parameters vary accordingly. Hence, an innovative stepwise prediction model integrating “sedimentary structures and mineral composition” is proposed. The model first identified the rock structure through the genetic algorithm-RF and 15 logging curves, yielding thin-bedded/laminated structures. Then, SVM and 9 logging curves were used to identify mineral composition, yielding limy dolostone, dolomitic limestone, sandstone, shale, and mixed rock. The lithofacies were obtained by integrating the predicted results from the two models. The maximum accuracy of identifying rock structure and mineral composition can reach 87.3% and 78.7%, respectively, and the maximum prediction accuracy of the separate prediction model reached 73.2%, which is 22% higher than that of the direct prediction model. The relationship between the well logging curves and the predicted results is discussed, and the reasons for errors will be explained. These understandings can further help provide new ideas and methods for the identification of shale lithofacies types and can provide scientific guidance and technical support for the exploration and development of the Qaidam Basin.</p>","PeriodicalId":35,"journal":{"name":"Energy & Fuels","volume":"39 16","pages":"7751–7765 7751–7765"},"PeriodicalIF":5.2000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy & Fuels","FirstCategoryId":"5","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.energyfuels.5c00414","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

Abundant shale oil resources have been discovered in the upper member of the Paleogene Lower Ganchaigou Formation of the Yingxiongling area from the Qaidam Basin, China. The lithofacies of Yingxiongling shale oil exhibit strong heterogeneity vertically. Accurate lithofacies identification is the key to characterizing the potential of unconventional oil and gas resources. Traditional lithofacies identification is limited by factors such as the duration of experiments and the subjectivity of the scholars. Only a limited amount of coring section data is available for analysis, while a sea of logging data remains underutilized. Therefore, utilizing machine learning algorithms to effectively leverage logging data for constructing the accurate lithofacies identification model has become a crucial area in both academia and industry. In this paper, 15 basic logging curves were used, and algorithms of random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost) were selected through Python programming to establish machine learning classification models, identifying the lithofacies types of Yingxiongling shale and analyzing the results. The lithofacies classification scheme of Yingxiongling shale is based on “rock structure + mineral composition”, developing 8 lithofacies types: thin-bedded/laminated dolomitic limestone, thin-bedded/laminated limy dolostone, thin-bedded sandstone, laminated shale, and thin-bedded/laminated mixed rock. Due to the differing sensitivities of various logging data in identifying rock structures and mineral compositions, the corresponding algorithms and parameters vary accordingly. Hence, an innovative stepwise prediction model integrating “sedimentary structures and mineral composition” is proposed. The model first identified the rock structure through the genetic algorithm-RF and 15 logging curves, yielding thin-bedded/laminated structures. Then, SVM and 9 logging curves were used to identify mineral composition, yielding limy dolostone, dolomitic limestone, sandstone, shale, and mixed rock. The lithofacies were obtained by integrating the predicted results from the two models. The maximum accuracy of identifying rock structure and mineral composition can reach 87.3% and 78.7%, respectively, and the maximum prediction accuracy of the separate prediction model reached 73.2%, which is 22% higher than that of the direct prediction model. The relationship between the well logging curves and the predicted results is discussed, and the reasons for errors will be explained. These understandings can further help provide new ideas and methods for the identification of shale lithofacies types and can provide scientific guidance and technical support for the exploration and development of the Qaidam Basin.

Abstract Image

柴达木盆地古近系英雄岭页岩岩相分离分类预测模型
柴达木盆地雄雄岭地区古近系下干柴沟组上段发现了丰富的页岩油资源。雄雄岭页岩油岩相纵向上表现出较强的非均质性。准确的岩相识别是表征非常规油气资源潜力的关键。传统的岩相鉴定受到实验时间长、学者主观性等因素的限制。只有有限数量的取心剖面数据可用于分析,而大量的测井数据仍未得到充分利用。因此,利用机器学习算法有效地利用测井数据构建精确的岩相识别模型已成为学术界和工业界的一个重要研究领域。本文利用15条基本测井曲线,通过Python编程选择随机森林(random forest, RF)、支持向量机(support vector machine, SVM)和极限梯度提升(extreme gradient boost, XGBoost)算法,建立机器学习分类模型,识别英雄岭页岩岩相类型,并对结果进行分析。英雄岭页岩岩相分类方案以“岩石结构+矿物组成”为基础,发育薄层状/层状白云岩、薄层状/层状灰岩、薄层状砂岩、层状页岩、薄层状/层状混合岩8种岩相类型。由于各种测井资料在识别岩石结构和矿物成分方面的敏感性不同,相应的算法和参数也有所不同。在此基础上,提出了一种“沉积构造与矿物组成”相结合的分步预测模型。该模型首先通过遗传算法- rf和15条测井曲线识别岩石结构,得到薄层状/层状结构。然后利用支持向量机和9条测井曲线识别矿物组成,分别为灰质白云岩、白云岩、砂岩、页岩和混合岩。结合两种模型的预测结果,得到了相应的岩相。识别岩石结构和矿物成分的最大精度分别可达87.3%和78.7%,单独预测模型的最大预测精度达到73.2%,比直接预测模型提高了22%。讨论了测井曲线与预测结果的关系,并解释了误差产生的原因。这些认识有助于进一步为页岩岩相类型的识别提供新的思路和方法,为柴达木盆地的勘探开发提供科学指导和技术支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Energy & Fuels
Energy & Fuels 工程技术-工程:化工
CiteScore
9.20
自引率
13.20%
发文量
1101
审稿时长
2.1 months
期刊介绍: Energy & Fuels publishes reports of research in the technical area defined by the intersection of the disciplines of chemistry and chemical engineering and the application domain of non-nuclear energy and fuels. This includes research directed at the formation of, exploration for, and production of fossil fuels and biomass; the properties and structure or molecular composition of both raw fuels and refined products; the chemistry involved in the processing and utilization of fuels; fuel cells and their applications; and the analytical and instrumental techniques used in investigations of the foregoing areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信