Input Data Quality Influence On Lithoclass Predictions In Relation To Supervised Machine Learning

H. W. Bøe, K. B. Brandsegg, L. Marello, A. Črne
{"title":"Input Data Quality Influence On Lithoclass Predictions In Relation To Supervised Machine Learning","authors":"H. W. Bøe, K. B. Brandsegg, L. Marello, A. Črne","doi":"10.3997/2214-4609.201803032","DOIUrl":null,"url":null,"abstract":"We assess the importance of data availability and consistency prior to applying supervised machine learning for predicting lithoclasses from wireline logs. A dataset is pre-processed and used as training data by three machine learning models in order to investigate the sensitivity of the lithoclasses predictions. The first model uses the quality assured dataset without any modifications. The second model standardizes log signatures, whereas the third model uses the dataset in combination with additional features that dampens extreme outliers. The three models are evaluated against lithofacies interpretations based on CPI’s to show the varying predicting power of the models. The method is applied on a quality-controlled Jurassic interval dataset of ~100 exploration wells within a quadrant of the Norwegian part of the North Sea. The results shows that the number of wireline logs available has a direct influence on the prediction accuracy. For an acceptable prediction accuracy the wells should contain at least the gamma ray, density and neutron log. To distinguish between water-bearing and hydrocarbon-bearing intervals in sandstones the resistivity logs should also be present. When implementing machine learning on a regional scale we should consider varying burial depth and depositional environment in order to gain optimal predicting power.","PeriodicalId":231338,"journal":{"name":"First EAGE/PESGB Workshop Machine Learning","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"First EAGE/PESGB Workshop Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3997/2214-4609.201803032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We assess the importance of data availability and consistency prior to applying supervised machine learning for predicting lithoclasses from wireline logs. A dataset is pre-processed and used as training data by three machine learning models in order to investigate the sensitivity of the lithoclasses predictions. The first model uses the quality assured dataset without any modifications. The second model standardizes log signatures, whereas the third model uses the dataset in combination with additional features that dampens extreme outliers. The three models are evaluated against lithofacies interpretations based on CPI’s to show the varying predicting power of the models. The method is applied on a quality-controlled Jurassic interval dataset of ~100 exploration wells within a quadrant of the Norwegian part of the North Sea. The results shows that the number of wireline logs available has a direct influence on the prediction accuracy. For an acceptable prediction accuracy the wells should contain at least the gamma ray, density and neutron log. To distinguish between water-bearing and hydrocarbon-bearing intervals in sandstones the resistivity logs should also be present. When implementing machine learning on a regional scale we should consider varying burial depth and depositional environment in order to gain optimal predicting power.
与监督机器学习相关的输入数据质量对岩石层预测的影响
在应用监督式机器学习预测电缆测井岩石层之前,我们评估了数据可用性和一致性的重要性。为了研究岩石层预测的敏感性,对数据集进行预处理,并将其用作三个机器学习模型的训练数据。第一个模型使用没有任何修改的质量保证数据集。第二个模型将日志签名标准化,而第三个模型将数据集与其他特征结合使用,以抑制极端异常值。通过对比基于CPI的岩相解释,对三种模型进行了评价,表明模型的预测能力各不相同。该方法应用于北海挪威部分四象限内约100口探井的质量控制侏罗纪层数据集。结果表明,测井资料的数量直接影响预测精度。为了获得可接受的预测精度,井中至少应包含伽马射线、密度和中子测井。为了区分砂岩中的含水层和含油层,电阻率测井也应该存在。当在区域尺度上实施机器学习时,我们应该考虑不同的埋藏深度和沉积环境,以获得最佳的预测能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信