迈向一个新的框架，以评估过程为基础的模型配置和量化数据价值之前的校准

IF 5 1区地球科学 Q2 ENVIRONMENTAL SCIENCES

Water Resources Research Pub Date : 2025-09-13 DOI:10.1029/2025wr040323

Mark S. Pleasants, Michael N. Fienen, Hedeff I. Essaid, Joel D. Blomquist, Jing Yang, Ming Ye

{"title":"迈向一个新的框架，以评估过程为基础的模型配置和量化数据价值之前的校准","authors":"Mark S. Pleasants, Michael N. Fienen, Hedeff I. Essaid, Joel D. Blomquist, Jing Yang, Ming Ye","doi":"10.1029/2025wr040323","DOIUrl":null,"url":null,"abstract":"Model criticism, discrimination, and selection methods often rely on calibrated model outputs. Because calibration can be computationally expensive, model criticism can first be undertaken by assessing model outputs obtained from limited prior parameter ensembles. However, such prior‐based methods are often heuristic and do not formalize the notion of balancing model consistency with data and model complexity (i.e., model adequacy). We present a new framework to discriminate among candidate models prior to calibration that formalizes prior‐to‐calibration model adequacy into a metric to implicitly balance prior model output data coverage with model complexity represented by prior output (co)variance. The prior model adequacy metric “Mahalanobis distance deviation” quantifies the deviation of (a) the set of squared Mahalanobis distances of data from a prior model output distribution from (b) the set of squared Mahalanobis distances of data from their own distribution. A new data worth metric “discernment value” is also presented which quantifies the value of data for screening less‐adequate models prior to calibration. Discernment value is calculated from the change in variance of a weighted average of prior model outputs from all candidate models due to less‐adequate model outputs receiving lower weight. The framework is demonstrated using a one‐dimensional groundwater flow model with eight possible configurations. A synthetic data network is used to test the framework. Results show the framework identifies the candidate models most similar to the true model used to create the synthetic data. Discernment values show variation in the value of different data types and locations for screening less‐adequate models.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":"24 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward a New Framework to Evaluate Process‐Based Model Configurations and Quantify Data Worth Prior to Calibration\",\"authors\":\"Mark S. Pleasants, Michael N. Fienen, Hedeff I. Essaid, Joel D. Blomquist, Jing Yang, Ming Ye\",\"doi\":\"10.1029/2025wr040323\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Model criticism, discrimination, and selection methods often rely on calibrated model outputs. Because calibration can be computationally expensive, model criticism can first be undertaken by assessing model outputs obtained from limited prior parameter ensembles. However, such prior‐based methods are often heuristic and do not formalize the notion of balancing model consistency with data and model complexity (i.e., model adequacy). We present a new framework to discriminate among candidate models prior to calibration that formalizes prior‐to‐calibration model adequacy into a metric to implicitly balance prior model output data coverage with model complexity represented by prior output (co)variance. The prior model adequacy metric “Mahalanobis distance deviation” quantifies the deviation of (a) the set of squared Mahalanobis distances of data from a prior model output distribution from (b) the set of squared Mahalanobis distances of data from their own distribution. A new data worth metric “discernment value” is also presented which quantifies the value of data for screening less‐adequate models prior to calibration. Discernment value is calculated from the change in variance of a weighted average of prior model outputs from all candidate models due to less‐adequate model outputs receiving lower weight. The framework is demonstrated using a one‐dimensional groundwater flow model with eight possible configurations. A synthetic data network is used to test the framework. Results show the framework identifies the candidate models most similar to the true model used to create the synthetic data. Discernment values show variation in the value of different data types and locations for screening less‐adequate models.\",\"PeriodicalId\":23799,\"journal\":{\"name\":\"Water Resources Research\",\"volume\":\"24 1\",\"pages\":\"\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Water Resources Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1029/2025wr040323\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2025wr040323","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

模型批评、鉴别和选择方法通常依赖于校准的模型输出。由于校准在计算上可能是昂贵的，模型批评可以首先通过评估从有限的先验参数集合中获得的模型输出来进行。然而，这种基于先验的方法通常是启发式的，并且没有将平衡模型一致性与数据和模型复杂性（即模型充分性）的概念形式化。我们提出了一个新的框架来区分校准前的候选模型，该框架将校准前模型的充分性形式化为一个度量，以隐式平衡先前模型输出数据覆盖与由先前输出（co）方差表示的模型复杂性。先验模型充足性度量“马氏距离偏差”量化了(a)数据与先验模型输出分布的马氏距离的平方集与(b)数据与自身分布的马氏距离的平方集的偏差。还提出了一种新的数据价值度量“识别值”，它量化了在校准之前筛选不足模型的数据价值。识别值是从所有候选模型的先验模型输出的加权平均值的方差变化中计算出来的，这是由于不足够的模型输出接收到较低的权重。该框架使用具有八种可能配置的一维地下水流动模型进行了演示。采用综合数据网络对该框架进行了测试。结果表明，该框架识别出与用于创建合成数据的真实模型最相似的候选模型。辨别值显示了筛选不够充分的模型的不同数据类型和位置的值的变化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Toward a New Framework to Evaluate Process‐Based Model Configurations and Quantify Data Worth Prior to Calibration

Model criticism, discrimination, and selection methods often rely on calibrated model outputs. Because calibration can be computationally expensive, model criticism can first be undertaken by assessing model outputs obtained from limited prior parameter ensembles. However, such prior‐based methods are often heuristic and do not formalize the notion of balancing model consistency with data and model complexity (i.e., model adequacy). We present a new framework to discriminate among candidate models prior to calibration that formalizes prior‐to‐calibration model adequacy into a metric to implicitly balance prior model output data coverage with model complexity represented by prior output (co)variance. The prior model adequacy metric “Mahalanobis distance deviation” quantifies the deviation of (a) the set of squared Mahalanobis distances of data from a prior model output distribution from (b) the set of squared Mahalanobis distances of data from their own distribution. A new data worth metric “discernment value” is also presented which quantifies the value of data for screening less‐adequate models prior to calibration. Discernment value is calculated from the change in variance of a weighted average of prior model outputs from all candidate models due to less‐adequate model outputs receiving lower weight. The framework is demonstrated using a one‐dimensional groundwater flow model with eight possible configurations. A synthetic data network is used to test the framework. Results show the framework identifies the candidate models most similar to the true model used to create the synthetic data. Discernment values show variation in the value of different data types and locations for screening less‐adequate models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Water Resources Research 环境科学-湖沼学

CiteScore

8.80

自引率

13.00%

发文量

599

审稿时长

3.5 months

期刊介绍： Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.