利用机器学习对高分辨率岩心图像、高光谱数据和地球化学进行蚀变组合表征

IF 1 4区 地球科学 Q3 GEOCHEMISTRY & GEOPHYSICS
McLean Trott, Cole Mooney, Shervin Azad, Sam Sattarzadeh, Britt Bluemel, Matthew Leybourne, Daniel Layton-Matthews
{"title":"利用机器学习对高分辨率岩心图像、高光谱数据和地球化学进行蚀变组合表征","authors":"McLean Trott, Cole Mooney, Shervin Azad, Sam Sattarzadeh, Britt Bluemel, Matthew Leybourne, Daniel Layton-Matthews","doi":"10.1144/geochem2023-032","DOIUrl":null,"url":null,"abstract":"Integration of multiple data types is beneficial for prediction of geological characteristics. From the perspective that geochemistry characterizes the composition of a rock mass, hyperspectral data characterizes alteration mineralogy, and image feature extraction characterizes texture, most geological classifications would be well-informed by the combination of these three features. The process of meaningfully integrating distinctly sourced datasets and producing scale-relevant predictions for geological classifications involves several steps. We demonstrate a workflow to comprehensively structure and integrate these three feature families, refine training data, predict alteration classes, and mitigate noise derived from scale mismatch in output predictions. The dataset, compiled from the Josemaria porphyry copper deposit in Argentina, is comprised of more than 14,000 intervals of approximately 2 m, taken from 36 drillholes, where geochemistry was merged with hyperspectral mineralogy represented as tabular pixel abundances, and textural metrics extracted from core imagery, structured into the geochemical interval. Feature engineering and principal component analysis provided insights into the behavior of the ore system during intermediate steps, as well as providing uncorrelated feature inputs for a random forest predictor. Training data were refined by producing an initial prediction, thresholding the predictions to >70% dominant class probability and using those (high probability) samples to produce a final model encoding better constrained separation between alteration assemblages. Prediction using the final model returned an accuracy of 82.5 %, as a function of model discrepancy combined with logging ambiguity and a scale mismatch between generalized logged intervals and much more granular (2 m) feature inputs. Noise reduction and generalization to desired resolution of output was achieved by applying the multiscale multivariate continuous wavelet transform tessellation method to class membership probabilities. Ultimately a large database of logged drill-core was homogenized using empirical methodologies. The described workflow is adaptable to distinct scenarios with some modification and is apt for integrating multiple input feature types and using them to systematically define geological classifications in drill-hole data.","PeriodicalId":55114,"journal":{"name":"Geochemistry-Exploration Environment Analysis","volume":"174 1","pages":"0"},"PeriodicalIF":1.0000,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Alteration assemblage characterization using machine learning applied to high resolution drill-core images, hyperspectral data, and geochemistry\",\"authors\":\"McLean Trott, Cole Mooney, Shervin Azad, Sam Sattarzadeh, Britt Bluemel, Matthew Leybourne, Daniel Layton-Matthews\",\"doi\":\"10.1144/geochem2023-032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Integration of multiple data types is beneficial for prediction of geological characteristics. From the perspective that geochemistry characterizes the composition of a rock mass, hyperspectral data characterizes alteration mineralogy, and image feature extraction characterizes texture, most geological classifications would be well-informed by the combination of these three features. The process of meaningfully integrating distinctly sourced datasets and producing scale-relevant predictions for geological classifications involves several steps. We demonstrate a workflow to comprehensively structure and integrate these three feature families, refine training data, predict alteration classes, and mitigate noise derived from scale mismatch in output predictions. The dataset, compiled from the Josemaria porphyry copper deposit in Argentina, is comprised of more than 14,000 intervals of approximately 2 m, taken from 36 drillholes, where geochemistry was merged with hyperspectral mineralogy represented as tabular pixel abundances, and textural metrics extracted from core imagery, structured into the geochemical interval. Feature engineering and principal component analysis provided insights into the behavior of the ore system during intermediate steps, as well as providing uncorrelated feature inputs for a random forest predictor. Training data were refined by producing an initial prediction, thresholding the predictions to >70% dominant class probability and using those (high probability) samples to produce a final model encoding better constrained separation between alteration assemblages. Prediction using the final model returned an accuracy of 82.5 %, as a function of model discrepancy combined with logging ambiguity and a scale mismatch between generalized logged intervals and much more granular (2 m) feature inputs. Noise reduction and generalization to desired resolution of output was achieved by applying the multiscale multivariate continuous wavelet transform tessellation method to class membership probabilities. Ultimately a large database of logged drill-core was homogenized using empirical methodologies. The described workflow is adaptable to distinct scenarios with some modification and is apt for integrating multiple input feature types and using them to systematically define geological classifications in drill-hole data.\",\"PeriodicalId\":55114,\"journal\":{\"name\":\"Geochemistry-Exploration Environment Analysis\",\"volume\":\"174 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2023-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geochemistry-Exploration Environment Analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1144/geochem2023-032\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geochemistry-Exploration Environment Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1144/geochem2023-032","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

多种数据类型的集成有利于地质特征的预测。从地球化学特征表征岩体成分、高光谱数据特征表征蚀变矿物学、图像特征提取特征表征纹理的角度来看,这三种特征的结合将为大多数地质分类提供良好的信息。有意义地整合不同来源的数据集并为地质分类产生与规模相关的预测的过程涉及几个步骤。我们演示了一个工作流,以全面构建和集成这三个特征族,改进训练数据,预测变化类别,并减轻输出预测中规模不匹配产生的噪声。该数据集来自阿根廷的Josemaria斑岩铜矿,由36个钻孔中约2米的14000多个层段组成,其中地球化学与高光谱矿物学相结合,表示为表格像素丰度,并从岩心图像中提取纹理指标,构成地球化学层段。特征工程和主成分分析在中间步骤中提供了对矿石系统行为的洞察,并为随机森林预测器提供了不相关的特征输入。训练数据通过产生一个初始预测,将预测阈值设定为70%的优势类概率,并使用这些(高概率)样本来产生一个最终模型,该模型编码更好地约束了蚀变组合之间的分离。由于模型差异、测井模糊性以及广义测井间隔与更细粒度(2米)特征输入之间的尺度不匹配,使用最终模型进行预测的准确率为82.5%。通过对类隶属概率进行多尺度连续小波变换细分,实现了输出的降噪和泛化。最终,使用经验方法对测井岩心的大型数据库进行了均质化。所描述的工作流经过一些修改,可以适应不同的场景,并且适合于集成多个输入特征类型并使用它们系统地定义钻孔数据中的地质分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Alteration assemblage characterization using machine learning applied to high resolution drill-core images, hyperspectral data, and geochemistry
Integration of multiple data types is beneficial for prediction of geological characteristics. From the perspective that geochemistry characterizes the composition of a rock mass, hyperspectral data characterizes alteration mineralogy, and image feature extraction characterizes texture, most geological classifications would be well-informed by the combination of these three features. The process of meaningfully integrating distinctly sourced datasets and producing scale-relevant predictions for geological classifications involves several steps. We demonstrate a workflow to comprehensively structure and integrate these three feature families, refine training data, predict alteration classes, and mitigate noise derived from scale mismatch in output predictions. The dataset, compiled from the Josemaria porphyry copper deposit in Argentina, is comprised of more than 14,000 intervals of approximately 2 m, taken from 36 drillholes, where geochemistry was merged with hyperspectral mineralogy represented as tabular pixel abundances, and textural metrics extracted from core imagery, structured into the geochemical interval. Feature engineering and principal component analysis provided insights into the behavior of the ore system during intermediate steps, as well as providing uncorrelated feature inputs for a random forest predictor. Training data were refined by producing an initial prediction, thresholding the predictions to >70% dominant class probability and using those (high probability) samples to produce a final model encoding better constrained separation between alteration assemblages. Prediction using the final model returned an accuracy of 82.5 %, as a function of model discrepancy combined with logging ambiguity and a scale mismatch between generalized logged intervals and much more granular (2 m) feature inputs. Noise reduction and generalization to desired resolution of output was achieved by applying the multiscale multivariate continuous wavelet transform tessellation method to class membership probabilities. Ultimately a large database of logged drill-core was homogenized using empirical methodologies. The described workflow is adaptable to distinct scenarios with some modification and is apt for integrating multiple input feature types and using them to systematically define geological classifications in drill-hole data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Geochemistry-Exploration Environment Analysis
Geochemistry-Exploration Environment Analysis 地学-地球化学与地球物理
CiteScore
3.60
自引率
16.70%
发文量
30
审稿时长
1 months
期刊介绍: Geochemistry: Exploration, Environment, Analysis (GEEA) is a co-owned journal of the Geological Society of London and the Association of Applied Geochemists (AAG). GEEA focuses on mineral exploration using geochemistry; related fields also covered include geoanalysis, the development of methods and techniques used to analyse geochemical materials such as rocks, soils, sediments, waters and vegetation, and environmental issues associated with mining and source apportionment. GEEA is well-known for its thematic sets on hot topics and regularly publishes papers from the biennial International Applied Geochemistry Symposium (IAGS). Papers that seek to integrate geological, geochemical and geophysical methods of exploration are particularly welcome, as are those that concern geochemical mapping and those that comprise case histories. Given the many links between exploration and environmental geochemistry, the journal encourages the exchange of concepts and data; in particular, to differentiate various sources of elements. GEEA publishes research articles; discussion papers; book reviews; editorial content and thematic sets.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信