Denoising of Geochemical Data using Deep Learning–Implications for Regional Surveys

IF 4.8 2区地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY

Natural Resources Research Pub Date : 2024-02-28 DOI:10.1007/s11053-024-10317-5

Steven E. Zhang, Julie E. Bourdeau, Glen T. Nwaila, Mohammad Parsa, Yousef Ghorbani

{"title":"Denoising of Geochemical Data using Deep Learning–Implications for Regional Surveys","authors":"Steven E. Zhang, Julie E. Bourdeau, Glen T. Nwaila, Mohammad Parsa, Yousef Ghorbani","doi":"10.1007/s11053-024-10317-5","DOIUrl":null,"url":null,"abstract":"<p>Regional geochemical surveys generate large amounts of data that can be used for a number of purposes such as to guide mineral exploration. Modern surveys are typically designed to permit quantification of data uncertainty through data quality metrics by using quality assurance and quality control (QA/QC) methods. However, these metrics, such as data accuracy and precision, are obtained through the data generation phase. Consequently, it is unclear how residual uncertainty in geochemical data can be minimized (denoised). This is a limitation to propagating uncertainty through downstream activities, particularly through complex models, which can result from the usage of artificial intelligence-based methods. This study aims to develop a deep learning-based method to examine and quantify uncertainty contained in geochemical survey data. Specifically, we demonstrate that: (1) autoencoders can reduce or modulate geochemical data uncertainty; (2) a reduction in uncertainty is observable in the spatial domain as a decrease of the nugget; and (3) a clear data reconstruction regime of the autoencoder can be identified that is strongly associated with data denoising, as opposed to the removal of useful events in data, such as meaningful geochemical anomalies. Our method to post-hoc denoising of geochemical data using deep learning is simple, clear and consistent, with the amount of denoising guided by highly interpretable metrics and existing frameworks of scientific data quality. Consequently, variably denoised data, as well as the original data, could be fed into a single downstream workflow (e.g., mapping, general data analysis or mineral prospectivity mapping), and the differences in the outcome can be subsequently quantified to propagate data uncertainty.</p>","PeriodicalId":54284,"journal":{"name":"Natural Resources Research","volume":"90 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11053-024-10317-5","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Regional geochemical surveys generate large amounts of data that can be used for a number of purposes such as to guide mineral exploration. Modern surveys are typically designed to permit quantification of data uncertainty through data quality metrics by using quality assurance and quality control (QA/QC) methods. However, these metrics, such as data accuracy and precision, are obtained through the data generation phase. Consequently, it is unclear how residual uncertainty in geochemical data can be minimized (denoised). This is a limitation to propagating uncertainty through downstream activities, particularly through complex models, which can result from the usage of artificial intelligence-based methods. This study aims to develop a deep learning-based method to examine and quantify uncertainty contained in geochemical survey data. Specifically, we demonstrate that: (1) autoencoders can reduce or modulate geochemical data uncertainty; (2) a reduction in uncertainty is observable in the spatial domain as a decrease of the nugget; and (3) a clear data reconstruction regime of the autoencoder can be identified that is strongly associated with data denoising, as opposed to the removal of useful events in data, such as meaningful geochemical anomalies. Our method to post-hoc denoising of geochemical data using deep learning is simple, clear and consistent, with the amount of denoising guided by highly interpretable metrics and existing frameworks of scientific data quality. Consequently, variably denoised data, as well as the original data, could be fed into a single downstream workflow (e.g., mapping, general data analysis or mineral prospectivity mapping), and the differences in the outcome can be subsequently quantified to propagate data uncertainty.

Abstract Image

查看原文本刊更多论文

利用深度学习对地球化学数据去噪--对区域勘测的启示

区域地球化学勘测会产生大量数据，这些数据可用于多种目的，如指导矿产勘探。现代勘测的设计通常允许使用质量保证和质量控制（QA/QC）方法，通过数据质量指标对数据的不确定性进行量化。然而，这些指标，如数据准确度和精确度，是通过数据生成阶段获得的。因此，目前还不清楚如何将地球化学数据中的残余不确定性最小化（去噪）。这限制了通过下游活动传播不确定性，特别是通过复杂模型传播不确定性，而使用基于人工智能的方法可以产生这种不确定性。本研究旨在开发一种基于深度学习的方法，用于检查和量化地球化学勘测数据中包含的不确定性。具体来说，我们证明了(1) 自动编码器可以减少或调节地球化学数据的不确定性；(2) 不确定性的减少在空间域可以观察到，即金块的减少；以及 (3) 可以确定自动编码器的明确数据重建机制，该机制与数据去噪密切相关，而不是去除数据中的有用事件，如有意义的地球化学异常。我们利用深度学习对地球化学数据进行事后去噪的方法简单、清晰、一致，去噪量以高度可解释的指标和现有的科学数据质量框架为指导。因此，可变的去噪数据以及原始数据可被输入到单一的下游工作流程（如绘图、一般数据分析或矿产远景绘图）中，结果的差异随后可被量化，以传播数据的不确定性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Natural Resources Research Environmental Science-General Environmental Science

CiteScore

11.90

自引率

11.10%

发文量

151

期刊介绍： This journal publishes quantitative studies of natural (mainly but not limited to mineral) resources exploration, evaluation and exploitation, including environmental and risk-related aspects. Typical articles use geoscientific data or analyses to assess, test, or compare resource-related aspects. NRR covers a wide variety of resources including minerals, coal, hydrocarbon, geothermal, water, and vegetation. Case studies are welcome.