Wide & deep learning for predicting relative mineral compositions of sediment cores solely based on XRF scans, a case study from Pleistocene Paleolake Olduvai, Tanzania

IF 4.2

Artificial Intelligence in Geosciences Pub Date : 2024-08-22 DOI:10.1016/j.aiig.2024.100088

Gayantha R.L. Kodikara , Lindsay J. McHenry , Ian G. Stanistreet , Harald Stollhofen , Jackson K. Njau , Nicholas Toth , Kathy Schick

{"title":"Wide & deep learning for predicting relative mineral compositions of sediment cores solely based on XRF scans, a case study from Pleistocene Paleolake Olduvai, Tanzania","authors":"Gayantha R.L. Kodikara , Lindsay J. McHenry , Ian G. Stanistreet , Harald Stollhofen , Jackson K. Njau , Nicholas Toth , Kathy Schick","doi":"10.1016/j.aiig.2024.100088","DOIUrl":null,"url":null,"abstract":"<div><p>This study develops a method to use deep learning models to predict the mineral assemblages and their relative abundances in paleolake cores using high-resolution XRF core scan elemental data and X-ray diffraction (XRD) mineralogical results from the same core taken at coarser resolution. It uses the XRF core scan data along with published mineralogical information from the Olduvai Gorge Coring Project (OGCP) 2014 sediment cores 1A, 2A, and 3A from Paleolake Olduvai, Tanzania. Both regression and classification models were developed using a Keras deep learning framework to assess the predictability of mineral assemblages with their relative abundances (in regression models) or at least the mineral assemblages (in classification models) using XRF core scan data. Models were created using the Sequential class and Functional API with different model architectures. The correlation matrix of element ratios calculated from XRF element intensity records from the cores and XRD-derived mineralogical information was used to select the most useful features to train the models. 1057 training data records were used for the models. Lithological classes were also used for some models using Wide & Deep neural networks since those combine the benefits of memorization and generalization for mineral prediction. The results were validated using 265 validation data records unseen by the model and discuss the accuracy of models using six test records. The optimized Deep Neural Network (DNN) classification model achieved over 86% binary accuracy while the regression models were also able to predict the relative mineral abundances of samples with high accuracies. Overall, the study shows the efficacy of a carefully crafted Deep Learning (DL) model for predicting mineral assemblages and abundances using high-resolution XRF core scan data.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100088"},"PeriodicalIF":4.2000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000297/pdfft?md5=8bd7402c96f4a311b4dbf3ffa0c2ef1b&pid=1-s2.0-S2666544124000297-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544124000297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This study develops a method to use deep learning models to predict the mineral assemblages and their relative abundances in paleolake cores using high-resolution XRF core scan elemental data and X-ray diffraction (XRD) mineralogical results from the same core taken at coarser resolution. It uses the XRF core scan data along with published mineralogical information from the Olduvai Gorge Coring Project (OGCP) 2014 sediment cores 1A, 2A, and 3A from Paleolake Olduvai, Tanzania. Both regression and classification models were developed using a Keras deep learning framework to assess the predictability of mineral assemblages with their relative abundances (in regression models) or at least the mineral assemblages (in classification models) using XRF core scan data. Models were created using the Sequential class and Functional API with different model architectures. The correlation matrix of element ratios calculated from XRF element intensity records from the cores and XRD-derived mineralogical information was used to select the most useful features to train the models. 1057 training data records were used for the models. Lithological classes were also used for some models using Wide & Deep neural networks since those combine the benefits of memorization and generalization for mineral prediction. The results were validated using 265 validation data records unseen by the model and discuss the accuracy of models using six test records. The optimized Deep Neural Network (DNN) classification model achieved over 86% binary accuracy while the regression models were also able to predict the relative mineral abundances of samples with high accuracies. Overall, the study shows the efficacy of a carefully crafted Deep Learning (DL) model for predicting mineral assemblages and abundances using high-resolution XRF core scan data.

查看原文本刊更多论文

完全基于 XRF 扫描预测沉积岩芯相对矿物成分的广泛和深度学习，坦桑尼亚更新世古湖奥杜威案例研究

本研究开发了一种方法，利用高分辨率 XRF 岩心扫描元素数据和同一岩心较粗分辨率的 X 射线衍射 (XRD) 矿物学结果，使用深度学习模型预测古湖岩心中的矿物组合及其相对丰度。该研究使用了 XRF 岩心扫描数据以及已公布的来自坦桑尼亚奥杜威峡谷岩芯采集项目（OGCP）2014 年奥杜威古湖 1A、2A 和 3A 号沉积岩芯的矿物学信息。我们使用 Keras 深度学习框架开发了回归和分类模型，以评估使用 XRF 岩心扫描数据的矿物组合及其相对丰度（在回归模型中）或至少矿物组合（在分类模型中）的可预测性。使用具有不同模型结构的序列类和功能应用程序接口创建了模型。根据岩心的 XRF 元素强度记录和 XRD 衍生矿物学信息计算出的元素比率相关矩阵用于选择最有用的特征来训练模型。模型使用了 1057 条训练数据记录。由于深度神经网络结合了矿物预测的记忆性和概括性优势，因此一些模型还使用了宽amp；深度神经网络的岩性类别。使用模型未见过的 265 条验证数据对结果进行了验证，并使用 6 条测试数据对模型的准确性进行了讨论。经过优化的深度神经网络（DNN）分类模型达到了 86% 以上的二元准确率，而回归模型也能以较高的准确率预测样本的相对矿物丰度。总之，这项研究表明，精心设计的深度学习（DL）模型能够有效地利用高分辨率 XRF 岩心扫描数据预测矿物组合和丰度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence in Geosciences

CiteScore

4.20

自引率

0.00%

发文量