{"title":"AutoQS v1: automatic parametrization of QuickSampling based on training images analysis","authors":"Mathieu Gravey, Grégoire Mariethoz","doi":"10.5194/gmd-16-5265-2023","DOIUrl":null,"url":null,"abstract":"Abstract. Multiple-point geostatistics are widely used to simulate\ncomplex spatial structures based on a training image. The practical\napplicability of these methods relies on the possibility of finding optimal\ntraining images and parametrization of the simulation algorithms. While\nmethods for automatically selecting training images are available,\nparametrization can be cumbersome. Here, we propose to find an optimal set\nof parameters using only the training image as input. The difference between\nthis and previous work that used parametrization optimization is that it\ndoes not require the definition of an objective function. Our approach is\nbased on the analysis of the errors that occur when filling artificially\nconstructed patterns that have been borrowed from the training image. Its\nmain advantage is to eliminate the risk of overfitting an objective\nfunction, which may result in variance underestimation or in verbatim copy\nof the training image. Since it is not based on optimization, our approach\nfinds a set of acceptable parameters in a predictable manner by using the\nknowledge and understanding of how the simulation algorithms work. The\ntechnique is explored in the context of the recently developed QuickSampling\nalgorithm, but it can be easily adapted to other pixel-based multiple-point\nstatistics algorithms using pattern matching, such as direct sampling or\nsingle normal equation simulation (SNESIM).","PeriodicalId":12799,"journal":{"name":"Geoscientific Model Development","volume":"5 1","pages":"0"},"PeriodicalIF":4.0000,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscientific Model Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/gmd-16-5265-2023","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract. Multiple-point geostatistics are widely used to simulate
complex spatial structures based on a training image. The practical
applicability of these methods relies on the possibility of finding optimal
training images and parametrization of the simulation algorithms. While
methods for automatically selecting training images are available,
parametrization can be cumbersome. Here, we propose to find an optimal set
of parameters using only the training image as input. The difference between
this and previous work that used parametrization optimization is that it
does not require the definition of an objective function. Our approach is
based on the analysis of the errors that occur when filling artificially
constructed patterns that have been borrowed from the training image. Its
main advantage is to eliminate the risk of overfitting an objective
function, which may result in variance underestimation or in verbatim copy
of the training image. Since it is not based on optimization, our approach
finds a set of acceptable parameters in a predictable manner by using the
knowledge and understanding of how the simulation algorithms work. The
technique is explored in the context of the recently developed QuickSampling
algorithm, but it can be easily adapted to other pixel-based multiple-point
statistics algorithms using pattern matching, such as direct sampling or
single normal equation simulation (SNESIM).
期刊介绍:
Geoscientific Model Development (GMD) is an international scientific journal dedicated to the publication and public discussion of the description, development, and evaluation of numerical models of the Earth system and its components. The following manuscript types can be considered for peer-reviewed publication:
* geoscientific model descriptions, from statistical models to box models to GCMs;
* development and technical papers, describing developments such as new parameterizations or technical aspects of running models such as the reproducibility of results;
* new methods for assessment of models, including work on developing new metrics for assessing model performance and novel ways of comparing model results with observational data;
* papers describing new standard experiments for assessing model performance or novel ways of comparing model results with observational data;
* model experiment descriptions, including experimental details and project protocols;
* full evaluations of previously published models.