与监督机器学习相关的输入数据质量对岩石层预测的影响

First EAGE/PESGB Workshop Machine Learning Pub Date : 2018-11-30 DOI:10.3997/2214-4609.201803032

H. W. Bøe, K. B. Brandsegg, L. Marello, A. Črne

{"title":"与监督机器学习相关的输入数据质量对岩石层预测的影响","authors":"H. W. Bøe, K. B. Brandsegg, L. Marello, A. Črne","doi":"10.3997/2214-4609.201803032","DOIUrl":null,"url":null,"abstract":"We assess the importance of data availability and consistency prior to applying supervised machine learning for predicting lithoclasses from wireline logs. A dataset is pre-processed and used as training data by three machine learning models in order to investigate the sensitivity of the lithoclasses predictions. The first model uses the quality assured dataset without any modifications. The second model standardizes log signatures, whereas the third model uses the dataset in combination with additional features that dampens extreme outliers. The three models are evaluated against lithofacies interpretations based on CPI’s to show the varying predicting power of the models. The method is applied on a quality-controlled Jurassic interval dataset of ~100 exploration wells within a quadrant of the Norwegian part of the North Sea. The results shows that the number of wireline logs available has a direct influence on the prediction accuracy. For an acceptable prediction accuracy the wells should contain at least the gamma ray, density and neutron log. To distinguish between water-bearing and hydrocarbon-bearing intervals in sandstones the resistivity logs should also be present. When implementing machine learning on a regional scale we should consider varying burial depth and depositional environment in order to gain optimal predicting power.","PeriodicalId":231338,"journal":{"name":"First EAGE/PESGB Workshop Machine Learning","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Input Data Quality Influence On Lithoclass Predictions In Relation To Supervised Machine Learning\",\"authors\":\"H. W. Bøe, K. B. Brandsegg, L. Marello, A. Črne\",\"doi\":\"10.3997/2214-4609.201803032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We assess the importance of data availability and consistency prior to applying supervised machine learning for predicting lithoclasses from wireline logs. A dataset is pre-processed and used as training data by three machine learning models in order to investigate the sensitivity of the lithoclasses predictions. The first model uses the quality assured dataset without any modifications. The second model standardizes log signatures, whereas the third model uses the dataset in combination with additional features that dampens extreme outliers. The three models are evaluated against lithofacies interpretations based on CPI’s to show the varying predicting power of the models. The method is applied on a quality-controlled Jurassic interval dataset of ~100 exploration wells within a quadrant of the Norwegian part of the North Sea. The results shows that the number of wireline logs available has a direct influence on the prediction accuracy. For an acceptable prediction accuracy the wells should contain at least the gamma ray, density and neutron log. To distinguish between water-bearing and hydrocarbon-bearing intervals in sandstones the resistivity logs should also be present. When implementing machine learning on a regional scale we should consider varying burial depth and depositional environment in order to gain optimal predicting power.\",\"PeriodicalId\":231338,\"journal\":{\"name\":\"First EAGE/PESGB Workshop Machine Learning\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"First EAGE/PESGB Workshop Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3997/2214-4609.201803032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"First EAGE/PESGB Workshop Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3997/2214-4609.201803032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在应用监督式机器学习预测电缆测井岩石层之前，我们评估了数据可用性和一致性的重要性。为了研究岩石层预测的敏感性，对数据集进行预处理，并将其用作三个机器学习模型的训练数据。第一个模型使用没有任何修改的质量保证数据集。第二个模型将日志签名标准化，而第三个模型将数据集与其他特征结合使用，以抑制极端异常值。通过对比基于CPI的岩相解释，对三种模型进行了评价，表明模型的预测能力各不相同。该方法应用于北海挪威部分四象限内约100口探井的质量控制侏罗纪层数据集。结果表明，测井资料的数量直接影响预测精度。为了获得可接受的预测精度，井中至少应包含伽马射线、密度和中子测井。为了区分砂岩中的含水层和含油层，电阻率测井也应该存在。当在区域尺度上实施机器学习时，我们应该考虑不同的埋藏深度和沉积环境，以获得最佳的预测能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Input Data Quality Influence On Lithoclass Predictions In Relation To Supervised Machine Learning

We assess the importance of data availability and consistency prior to applying supervised machine learning for predicting lithoclasses from wireline logs. A dataset is pre-processed and used as training data by three machine learning models in order to investigate the sensitivity of the lithoclasses predictions. The first model uses the quality assured dataset without any modifications. The second model standardizes log signatures, whereas the third model uses the dataset in combination with additional features that dampens extreme outliers. The three models are evaluated against lithofacies interpretations based on CPI’s to show the varying predicting power of the models. The method is applied on a quality-controlled Jurassic interval dataset of ~100 exploration wells within a quadrant of the Norwegian part of the North Sea. The results shows that the number of wireline logs available has a direct influence on the prediction accuracy. For an acceptable prediction accuracy the wells should contain at least the gamma ray, density and neutron log. To distinguish between water-bearing and hydrocarbon-bearing intervals in sandstones the resistivity logs should also be present. When implementing machine learning on a regional scale we should consider varying burial depth and depositional environment in order to gain optimal predicting power.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

First EAGE/PESGB Workshop Machine Learning

自引率

0.00%

发文量