Stéphane Belbèze , Jérémy Rohmer , Dominique Guyonnet , Philippe Négrel , Timo Tarvainen
{"title":"改进空间插值的异常分析存在稀疏,聚类或不精确的数据集","authors":"Stéphane Belbèze , Jérémy Rohmer , Dominique Guyonnet , Philippe Négrel , Timo Tarvainen","doi":"10.1016/j.gexplo.2025.107868","DOIUrl":null,"url":null,"abstract":"<div><div>In this study, we present a new method of interpolation and anomaly detection especially designed for sparse, clustered or imprecise environmental data (SIC). Such data cannot be processed by current state of the art spatial methods and models, including the most widely used, such as kriging. Indeed, the statistics obtained on SIC data (on the order of 5–30) do not allow us to define a covariance or to calibrate the numerous hyper-parameters of sophisticated Bayesian or deep image prior models. We therefore adapted an information dissemination algorithm to handle SIC data. This probabilistic model has been enriched (anisotropy, de-clustering, auto-variography, multi-support, treatment of covariates, and censored data) in a way that fully meets the needs for environmental SIC data and can be used in conjunction with hybrid propagation of epistemic and aleatoric uncertainties and anomaly detection, whatever their mathematical form. The new interpolator for anomaly detection was applied on a very small set of 13 sparse data points characteristic of small-scale environmental studies, on digital-challenge datasets and on two real datasets, i.e., a large-scale geochemical dataset and a SIC urban soil dataset. Results highlight the added value of the proposed algorithm, that is able to pinpoint anomalies in SIC data, while avoiding in particular the smoothing effects of certain previous methods.</div></div>","PeriodicalId":16336,"journal":{"name":"Journal of Geochemical Exploration","volume":"279 ","pages":"Article 107868"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving spatial interpolation for anomaly analysis in presence of sparse, clustered or imprecise data sets\",\"authors\":\"Stéphane Belbèze , Jérémy Rohmer , Dominique Guyonnet , Philippe Négrel , Timo Tarvainen\",\"doi\":\"10.1016/j.gexplo.2025.107868\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this study, we present a new method of interpolation and anomaly detection especially designed for sparse, clustered or imprecise environmental data (SIC). Such data cannot be processed by current state of the art spatial methods and models, including the most widely used, such as kriging. Indeed, the statistics obtained on SIC data (on the order of 5–30) do not allow us to define a covariance or to calibrate the numerous hyper-parameters of sophisticated Bayesian or deep image prior models. We therefore adapted an information dissemination algorithm to handle SIC data. This probabilistic model has been enriched (anisotropy, de-clustering, auto-variography, multi-support, treatment of covariates, and censored data) in a way that fully meets the needs for environmental SIC data and can be used in conjunction with hybrid propagation of epistemic and aleatoric uncertainties and anomaly detection, whatever their mathematical form. The new interpolator for anomaly detection was applied on a very small set of 13 sparse data points characteristic of small-scale environmental studies, on digital-challenge datasets and on two real datasets, i.e., a large-scale geochemical dataset and a SIC urban soil dataset. Results highlight the added value of the proposed algorithm, that is able to pinpoint anomalies in SIC data, while avoiding in particular the smoothing effects of certain previous methods.</div></div>\",\"PeriodicalId\":16336,\"journal\":{\"name\":\"Journal of Geochemical Exploration\",\"volume\":\"279 \",\"pages\":\"Article 107868\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Geochemical Exploration\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0375674225002006\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Geochemical Exploration","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0375674225002006","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
Improving spatial interpolation for anomaly analysis in presence of sparse, clustered or imprecise data sets
In this study, we present a new method of interpolation and anomaly detection especially designed for sparse, clustered or imprecise environmental data (SIC). Such data cannot be processed by current state of the art spatial methods and models, including the most widely used, such as kriging. Indeed, the statistics obtained on SIC data (on the order of 5–30) do not allow us to define a covariance or to calibrate the numerous hyper-parameters of sophisticated Bayesian or deep image prior models. We therefore adapted an information dissemination algorithm to handle SIC data. This probabilistic model has been enriched (anisotropy, de-clustering, auto-variography, multi-support, treatment of covariates, and censored data) in a way that fully meets the needs for environmental SIC data and can be used in conjunction with hybrid propagation of epistemic and aleatoric uncertainties and anomaly detection, whatever their mathematical form. The new interpolator for anomaly detection was applied on a very small set of 13 sparse data points characteristic of small-scale environmental studies, on digital-challenge datasets and on two real datasets, i.e., a large-scale geochemical dataset and a SIC urban soil dataset. Results highlight the added value of the proposed algorithm, that is able to pinpoint anomalies in SIC data, while avoiding in particular the smoothing effects of certain previous methods.
期刊介绍:
Journal of Geochemical Exploration is mostly dedicated to publication of original studies in exploration and environmental geochemistry and related topics.
Contributions considered of prevalent interest for the journal include researches based on the application of innovative methods to:
define the genesis and the evolution of mineral deposits including transfer of elements in large-scale mineralized areas.
analyze complex systems at the boundaries between bio-geochemistry, metal transport and mineral accumulation.
evaluate effects of historical mining activities on the surface environment.
trace pollutant sources and define their fate and transport models in the near-surface and surface environments involving solid, fluid and aerial matrices.
assess and quantify natural and technogenic radioactivity in the environment.
determine geochemical anomalies and set baseline reference values using compositional data analysis, multivariate statistics and geo-spatial analysis.
assess the impacts of anthropogenic contamination on ecosystems and human health at local and regional scale to prioritize and classify risks through deterministic and stochastic approaches.
Papers dedicated to the presentation of newly developed methods in analytical geochemistry to be applied in the field or in laboratory are also within the topics of interest for the journal.