Enhanced machine learning tree classifiers for lithology identification using Bayesian optimization

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Solomon Asante-Okyere , Chuanbo Shen , Harrison Osei
{"title":"Enhanced machine learning tree classifiers for lithology identification using Bayesian optimization","authors":"Solomon Asante-Okyere ,&nbsp;Chuanbo Shen ,&nbsp;Harrison Osei","doi":"10.1016/j.acags.2022.100100","DOIUrl":null,"url":null,"abstract":"<div><p>Lithology identification is a fundamental activity in oil and gas exploration. The application of artificial intelligence (AI) is currently being adopted as a state-of-the-art means of automating lithology identification. One aspect of this AI approach is the application of population search algorithms to optimise hyperparameters for enhanced prediction performance. For the first time, Bayesian optimization is deployed to determine the optimal learning parameters for more accurate tree and tree ensemble lithology classifiers. The aim is to rely on the ability of Bayesian optimization to consider previous classification results to improve the output of decision and ensemble tree lithology models using well logs as inputs. The proposed Bayesian optimised decision tree (BODT) generated the best classification accuracy of 89.8% as compared to 86.9%, 83.3% and 81.2% for fine, medium and coarse trees. For the ensembled trees, the Bayesian optimised AdaBoost (BO-AdaBoost) classifier generated the highest improved prediction accuracy of 94.2% while Bayesian optimised Bagged (BO-Bagged) and Bayesian optimised RUSBoost (BO-RUSBoost) had a lower accuracy rate of 94.0% and 77.1% respectively. Additionally, the performance of the Bayesian optimised classifiers offered higher reliability when compared with particle swarm optimization-based artificial neural networks (PSO-ANN). Hence, incorporating Bayesian optimization as a hyperparameter search algorithm will improve litholofacies recognition, leading to a higher accuracy rate and thereby provide an improved alternative for intelligent lithology identification.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100100"},"PeriodicalIF":2.6000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000222/pdfft?md5=70b41f43c359f6a72c8f285b2d646140&pid=1-s2.0-S2590197422000222-main.pdf","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Computing and Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590197422000222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 3

Abstract

Lithology identification is a fundamental activity in oil and gas exploration. The application of artificial intelligence (AI) is currently being adopted as a state-of-the-art means of automating lithology identification. One aspect of this AI approach is the application of population search algorithms to optimise hyperparameters for enhanced prediction performance. For the first time, Bayesian optimization is deployed to determine the optimal learning parameters for more accurate tree and tree ensemble lithology classifiers. The aim is to rely on the ability of Bayesian optimization to consider previous classification results to improve the output of decision and ensemble tree lithology models using well logs as inputs. The proposed Bayesian optimised decision tree (BODT) generated the best classification accuracy of 89.8% as compared to 86.9%, 83.3% and 81.2% for fine, medium and coarse trees. For the ensembled trees, the Bayesian optimised AdaBoost (BO-AdaBoost) classifier generated the highest improved prediction accuracy of 94.2% while Bayesian optimised Bagged (BO-Bagged) and Bayesian optimised RUSBoost (BO-RUSBoost) had a lower accuracy rate of 94.0% and 77.1% respectively. Additionally, the performance of the Bayesian optimised classifiers offered higher reliability when compared with particle swarm optimization-based artificial neural networks (PSO-ANN). Hence, incorporating Bayesian optimization as a hyperparameter search algorithm will improve litholofacies recognition, leading to a higher accuracy rate and thereby provide an improved alternative for intelligent lithology identification.

使用贝叶斯优化的增强机器学习树分类器用于岩性识别
岩性识别是油气勘探的一项基础性工作。人工智能(AI)的应用目前正被采用为自动化岩性识别的最先进手段。这种人工智能方法的一个方面是应用种群搜索算法来优化超参数以增强预测性能。第一次,贝叶斯优化被用于确定更准确的树和树集合岩性分类器的最佳学习参数。其目的是依靠贝叶斯优化的能力来考虑之前的分类结果,以提高决策的输出和使用测井作为输入的集成树岩性模型的输出。所提出的贝叶斯优化决策树(BODT)的分类准确率为89.8%,而细树、中树和粗树的分类准确率分别为86.9%、83.3%和81.2%。对于集成树,贝叶斯优化AdaBoost (BO-AdaBoost)分类器的预测准确率最高,达到94.2%,而贝叶斯优化Bagged (BO-Bagged)和贝叶斯优化RUSBoost (BO-RUSBoost)的准确率较低,分别为94.0%和77.1%。此外,与基于粒子群优化的人工神经网络(PSO-ANN)相比,贝叶斯优化分类器的性能具有更高的可靠性。因此,将贝叶斯优化作为一种超参数搜索算法,可以提高岩性识别的准确率,从而为智能岩性识别提供一种改进的选择。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Computing and Geosciences
Applied Computing and Geosciences Computer Science-General Computer Science
CiteScore
5.50
自引率
0.00%
发文量
23
审稿时长
5 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信