利用蒙特卡罗dropout法进行地球化学数据输入的不确定度量化

IF 3.4 2区 地球科学 Q1 GEOCHEMISTRY & GEOPHYSICS
Vladimir Puzyrev , Paul Duuring
{"title":"利用蒙特卡罗dropout法进行地球化学数据输入的不确定度量化","authors":"Vladimir Puzyrev ,&nbsp;Paul Duuring","doi":"10.1016/j.gexplo.2025.107695","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning models have shown their promise in geochemical data imputation tasks. However, being black-box solvers, these models require more confidence in their predictions. Using uncertainty quantification methods for deep neural networks can increase the reliability of their predictions. In this paper, we use Monte Carlo Dropout to estimate uncertainty in geochemical data imputation. Multiple forward passes with different dropout configurations yield a predictive distribution for the unknown analytes. The mean of this distribution is used as the prediction, while the standard deviation expresses the uncertainty of the neural networks. Two different scenarios, namely the WACHEM and WAMEX databases containing multi-element geochemical data for rock samples, illustrate the predictive accuracy of the method and its capability to measure the associated uncertainty. Dropout values of 0.1–0.2 were identified as a good balance in prediction accuracy and model uncertainty.</div></div>","PeriodicalId":16336,"journal":{"name":"Journal of Geochemical Exploration","volume":"272 ","pages":"Article 107695"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Uncertainty quantification of geochemical data imputation using Monte Carlo dropout\",\"authors\":\"Vladimir Puzyrev ,&nbsp;Paul Duuring\",\"doi\":\"10.1016/j.gexplo.2025.107695\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Machine learning models have shown their promise in geochemical data imputation tasks. However, being black-box solvers, these models require more confidence in their predictions. Using uncertainty quantification methods for deep neural networks can increase the reliability of their predictions. In this paper, we use Monte Carlo Dropout to estimate uncertainty in geochemical data imputation. Multiple forward passes with different dropout configurations yield a predictive distribution for the unknown analytes. The mean of this distribution is used as the prediction, while the standard deviation expresses the uncertainty of the neural networks. Two different scenarios, namely the WACHEM and WAMEX databases containing multi-element geochemical data for rock samples, illustrate the predictive accuracy of the method and its capability to measure the associated uncertainty. Dropout values of 0.1–0.2 were identified as a good balance in prediction accuracy and model uncertainty.</div></div>\",\"PeriodicalId\":16336,\"journal\":{\"name\":\"Journal of Geochemical Exploration\",\"volume\":\"272 \",\"pages\":\"Article 107695\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-02-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Geochemical Exploration\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0375674225000275\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Geochemical Exploration","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0375674225000275","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

机器学习模型在地球化学数据输入任务中显示出其前景。然而,作为黑盒求解者,这些模型需要对其预测更有信心。对深度神经网络采用不确定性量化方法可以提高其预测的可靠性。本文利用蒙特卡罗Dropout法对地球化学数据的不确定度进行估计。具有不同漏出配置的多个正向通道产生未知分析物的预测分布。使用该分布的均值作为预测,而标准差表示神经网络的不确定性。两种不同的场景,即WACHEM和WAMEX数据库,包含岩石样品的多元素地球化学数据,说明了该方法的预测准确性及其测量相关不确定性的能力。Dropout值为0.1-0.2,在预测精度和模型不确定性之间取得了很好的平衡。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Uncertainty quantification of geochemical data imputation using Monte Carlo dropout
Machine learning models have shown their promise in geochemical data imputation tasks. However, being black-box solvers, these models require more confidence in their predictions. Using uncertainty quantification methods for deep neural networks can increase the reliability of their predictions. In this paper, we use Monte Carlo Dropout to estimate uncertainty in geochemical data imputation. Multiple forward passes with different dropout configurations yield a predictive distribution for the unknown analytes. The mean of this distribution is used as the prediction, while the standard deviation expresses the uncertainty of the neural networks. Two different scenarios, namely the WACHEM and WAMEX databases containing multi-element geochemical data for rock samples, illustrate the predictive accuracy of the method and its capability to measure the associated uncertainty. Dropout values of 0.1–0.2 were identified as a good balance in prediction accuracy and model uncertainty.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Geochemical Exploration
Journal of Geochemical Exploration 地学-地球化学与地球物理
CiteScore
7.40
自引率
7.70%
发文量
148
审稿时长
8.1 months
期刊介绍: Journal of Geochemical Exploration is mostly dedicated to publication of original studies in exploration and environmental geochemistry and related topics. Contributions considered of prevalent interest for the journal include researches based on the application of innovative methods to: define the genesis and the evolution of mineral deposits including transfer of elements in large-scale mineralized areas. analyze complex systems at the boundaries between bio-geochemistry, metal transport and mineral accumulation. evaluate effects of historical mining activities on the surface environment. trace pollutant sources and define their fate and transport models in the near-surface and surface environments involving solid, fluid and aerial matrices. assess and quantify natural and technogenic radioactivity in the environment. determine geochemical anomalies and set baseline reference values using compositional data analysis, multivariate statistics and geo-spatial analysis. assess the impacts of anthropogenic contamination on ecosystems and human health at local and regional scale to prioritize and classify risks through deterministic and stochastic approaches. Papers dedicated to the presentation of newly developed methods in analytical geochemistry to be applied in the field or in laboratory are also within the topics of interest for the journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信