How far can the statistical error estimation problem be closed by collocated data?

IF 1.7 4区 地球科学 Q3 GEOSCIENCES, MULTIDISCIPLINARY
Annika Vogel, Richard Ménard
{"title":"How far can the statistical error estimation problem be closed by collocated data?","authors":"Annika Vogel, Richard Ménard","doi":"10.5194/npg-30-375-2023","DOIUrl":null,"url":null,"abstract":"Abstract. Accurate specification of the error statistics required for data assimilation remains an ongoing challenge, partly because their estimation is an underdetermined problem that requires statistical assumptions. Even with the common assumption that background and observation errors are uncorrelated, the problem remains underdetermined. One natural question that could arise is as follows: can the increasing amount of overlapping observations or other datasets help to reduce the total number of statistical assumptions, or do they introduce more statistical unknowns? In order to answer this question, this paper provides a conceptual view on the statistical error estimation problem for multiple collocated datasets, including a generalized mathematical formulation, an illustrative demonstration with synthetic data, and guidelines for setting up and solving the problem. It is demonstrated that the required number of statistical assumptions increases linearly with the number of datasets. However, the number of error statistics that can be estimated increases quadratically, allowing for an estimation of an increasing number of error cross-statistics between datasets for more than three datasets. The presented generalized estimation of full error covariance and cross-covariance matrices between datasets does not necessarily accumulate the uncertainties of assumptions among error estimations of multiple datasets.","PeriodicalId":54714,"journal":{"name":"Nonlinear Processes in Geophysics","volume":"169 1","pages":"0"},"PeriodicalIF":1.7000,"publicationDate":"2023-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nonlinear Processes in Geophysics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/npg-30-375-2023","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract. Accurate specification of the error statistics required for data assimilation remains an ongoing challenge, partly because their estimation is an underdetermined problem that requires statistical assumptions. Even with the common assumption that background and observation errors are uncorrelated, the problem remains underdetermined. One natural question that could arise is as follows: can the increasing amount of overlapping observations or other datasets help to reduce the total number of statistical assumptions, or do they introduce more statistical unknowns? In order to answer this question, this paper provides a conceptual view on the statistical error estimation problem for multiple collocated datasets, including a generalized mathematical formulation, an illustrative demonstration with synthetic data, and guidelines for setting up and solving the problem. It is demonstrated that the required number of statistical assumptions increases linearly with the number of datasets. However, the number of error statistics that can be estimated increases quadratically, allowing for an estimation of an increasing number of error cross-statistics between datasets for more than three datasets. The presented generalized estimation of full error covariance and cross-covariance matrices between datasets does not necessarily accumulate the uncertainties of assumptions among error estimations of multiple datasets.
统计误差估计问题在多大程度上可以用并置数据来解决?
摘要准确地说明数据同化所需的误差统计仍然是一个持续的挑战,部分原因是它们的估计是一个需要统计假设的未确定问题。即使通常假设背景误差和观测误差是不相关的,这个问题仍然是不确定的。一个自然的问题可能出现如下:重叠观察或其他数据集的数量的增加是否有助于减少统计假设的总数,或者它们是否引入了更多的统计未知数?为了回答这一问题,本文对多数据集的统计误差估计问题进行了概念性的阐述,包括一个广义的数学公式,一个综合数据的说明,以及建立和解决问题的指导方针。结果表明,所需的统计假设数量随着数据集数量的增加而线性增加。然而,可以估计的错误统计数量呈二次增长,允许对超过三个数据集的数据集之间不断增加的错误交叉统计数量进行估计。本文提出的数据集间全误差协方差和交叉协方差矩阵的广义估计,并不一定会在多个数据集的误差估计中累积假设的不确定性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nonlinear Processes in Geophysics
Nonlinear Processes in Geophysics 地学-地球化学与地球物理
CiteScore
4.00
自引率
0.00%
发文量
21
审稿时长
6-12 weeks
期刊介绍: Nonlinear Processes in Geophysics (NPG) is an international, inter-/trans-disciplinary, non-profit journal devoted to breaking the deadlocks often faced by standard approaches in Earth and space sciences. It therefore solicits disruptive and innovative concepts and methodologies, as well as original applications of these to address the ubiquitous complexity in geoscience systems, and in interacting social and biological systems. Such systems are nonlinear, with responses strongly non-proportional to perturbations, and show an associated extreme variability across scales.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信