完成1951 - 2019年意大利中部日降水仪器数据系列

IF 2.4 3区地球科学 Q2 GEOSCIENCES, MULTIDISCIPLINARY

Geoscience Data Journal Pub Date : 2024-09-24 DOI:10.1002/gdj3.267

Gamal AbdElNasser Allam Abouzied, Guoqiang Tang, Simon Michael Papalexiou, Martyn P. Clark, Eleonora Aruffo, Piero Di Carlo

{"title":"完成1951 - 2019年意大利中部日降水仪器数据系列","authors":"Gamal AbdElNasser Allam Abouzied, Guoqiang Tang, Simon Michael Papalexiou, Martyn P. Clark, Eleonora Aruffo, Piero Di Carlo","doi":"10.1002/gdj3.267","DOIUrl":null,"url":null,"abstract":"Precipitation is a critical part of the global hydrological cycle that determines the distribution of water resources. It is also an essential meteorological variable used as input for hydroclimatic models and projections. However, precipitation data frequently lack complete series, especially at daily and sub-daily precipitation stations, which are usually large, bulky, and complex. To address this, gap filling is commonly used to produce complete hydrometeorological data series without missing values. Several gap-filling methods have been developed and improved. This study seeks to fill the gaps of 201 daily precipitation time series in Central Italy by localizing the approach used to generate the Serially Complete dataset for the Planet Earth (SC-Earth). This method combines the outcome of 15 strategies based on four various gap-filling techniques (quantile mapping, spatial interpolation, machine learning, and multi-strategy merging). These strategies employ the daily dataset of the neighbouring stations and the matched ERA5 data to estimate missing values at the target stations. Both raw data and the final serially complete station datasets (SCDs) underwent comprehensive quality control. Many accuracy indicators have been utilized to evaluate the performance of the strategies' estimations and the final SCD, such as Correlation Coefficient (CC), Root mean square error (RMSE), Relative bias (Bias %), and Kling-Gupta efficiency (KGE″). Multi-strategy merging strategy based on the Modified Kling-Gupta efficiency (MS1) shows the highest performance as an individual precipitation gap-filling strategy. However, the machine learning strategy using random forest (ML3) has the most outstanding share in the final estimates among all other strategies. In the end, the temporal–spatial performance of the final SCD is promising and depends on the pattern of the missing values (MV%). The mean values of KGE″, CC, variability (α), and bias term (β) are 0.9, 0.93, 1.064, and 4.98 × 10−7, respectively.","PeriodicalId":54351,"journal":{"name":"Geoscience Data Journal","volume":"12 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gdj3.267","citationCount":"0","resultStr":"{\"title\":\"Completion of the Central Italy daily precipitation instrumental data series from 1951 to 2019\",\"authors\":\"Gamal AbdElNasser Allam Abouzied, Guoqiang Tang, Simon Michael Papalexiou, Martyn P. Clark, Eleonora Aruffo, Piero Di Carlo\",\"doi\":\"10.1002/gdj3.267\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Precipitation is a critical part of the global hydrological cycle that determines the distribution of water resources. It is also an essential meteorological variable used as input for hydroclimatic models and projections. However, precipitation data frequently lack complete series, especially at daily and sub-daily precipitation stations, which are usually large, bulky, and complex. To address this, gap filling is commonly used to produce complete hydrometeorological data series without missing values. Several gap-filling methods have been developed and improved. This study seeks to fill the gaps of 201 daily precipitation time series in Central Italy by localizing the approach used to generate the Serially Complete dataset for the Planet Earth (SC-Earth). This method combines the outcome of 15 strategies based on four various gap-filling techniques (quantile mapping, spatial interpolation, machine learning, and multi-strategy merging). These strategies employ the daily dataset of the neighbouring stations and the matched ERA5 data to estimate missing values at the target stations. Both raw data and the final serially complete station datasets (SCDs) underwent comprehensive quality control. Many accuracy indicators have been utilized to evaluate the performance of the strategies' estimations and the final SCD, such as Correlation Coefficient (CC), Root mean square error (RMSE), Relative bias (Bias %), and Kling-Gupta efficiency (KGE″). Multi-strategy merging strategy based on the Modified Kling-Gupta efficiency (MS1) shows the highest performance as an individual precipitation gap-filling strategy. However, the machine learning strategy using random forest (ML3) has the most outstanding share in the final estimates among all other strategies. In the end, the temporal–spatial performance of the final SCD is promising and depends on the pattern of the missing values (MV%). The mean values of KGE″, CC, variability (α), and bias term (β) are 0.9, 0.93, 1.064, and 4.98 × 10−7, respectively.\",\"PeriodicalId\":54351,\"journal\":{\"name\":\"Geoscience Data Journal\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gdj3.267\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geoscience Data Journal\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://rmets.onlinelibrary.wiley.com/doi/10.1002/gdj3.267\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscience Data Journal","FirstCategoryId":"89","ListUrlMain":"https://rmets.onlinelibrary.wiley.com/doi/10.1002/gdj3.267","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

降水是全球水循环的重要组成部分，决定了水资源的分布。它也是一个重要的气象变量，用作水文气候模式和预测的输入。然而，降水资料往往缺乏完整的序列，特别是日降水和次日降水台站，这些台站通常是庞大、笨重和复杂的。为了解决这个问题，通常使用空白填充来生成完整的水文气象数据序列，而不存在缺失值。已经开发和改进了几种间隙填充方法。本研究试图通过本地化用于生成行星地球序列完整数据集（SC-Earth）的方法来填补意大利中部201日降水时间序列的空白。该方法结合了基于四种不同空白填充技术（分位数映射、空间插值、机器学习和多策略合并）的15种策略的结果。这些策略利用邻近站点的每日数据集和匹配的ERA5数据来估计目标站点的缺失值。原始数据和最终的连续完整站数据集（SCDs）都进行了全面的质量控制。许多准确度指标被用来评估策略估计的性能和最终SCD，如相关系数（CC）、均方根误差（RMSE）、相对偏差（bias %）和克林-古普塔效率（KGE″）。基于修正克林-古普塔效率（MS1）的多策略合并策略作为单个降水缺口填充策略表现出最高的性能。然而，在所有其他策略中，使用随机森林（ML3）的机器学习策略在最终估计中占有最突出的份额。最后，最终的SCD的时空性能是有希望的，这取决于缺失值（MV%）的模式。KGE″、CC、变异性（α）和偏倚项（β）的平均值分别为0.9、0.93、1.064和4.98 × 10−7。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Completion of the Central Italy daily precipitation instrumental data series from 1951 to 2019

查看原文本刊更多论文

Completion of the Central Italy daily precipitation instrumental data series from 1951 to 2019

Precipitation is a critical part of the global hydrological cycle that determines the distribution of water resources. It is also an essential meteorological variable used as input for hydroclimatic models and projections. However, precipitation data frequently lack complete series, especially at daily and sub-daily precipitation stations, which are usually large, bulky, and complex. To address this, gap filling is commonly used to produce complete hydrometeorological data series without missing values. Several gap-filling methods have been developed and improved. This study seeks to fill the gaps of 201 daily precipitation time series in Central Italy by localizing the approach used to generate the Serially Complete dataset for the Planet Earth (SC-Earth). This method combines the outcome of 15 strategies based on four various gap-filling techniques (quantile mapping, spatial interpolation, machine learning, and multi-strategy merging). These strategies employ the daily dataset of the neighbouring stations and the matched ERA5 data to estimate missing values at the target stations. Both raw data and the final serially complete station datasets (SCDs) underwent comprehensive quality control. Many accuracy indicators have been utilized to evaluate the performance of the strategies' estimations and the final SCD, such as Correlation Coefficient (CC), Root mean square error (RMSE), Relative bias (Bias %), and Kling-Gupta efficiency (KGE″). Multi-strategy merging strategy based on the Modified Kling-Gupta efficiency (MS₁) shows the highest performance as an individual precipitation gap-filling strategy. However, the machine learning strategy using random forest (ML₃) has the most outstanding share in the final estimates among all other strategies. In the end, the temporal–spatial performance of the final SCD is promising and depends on the pattern of the missing values (MV%). The mean values of KGE″, CC, variability (α), and bias term (β) are 0.9, 0.93, 1.064, and 4.98 × 10⁻⁷, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Geoscience Data Journal GEOSCIENCES, MULTIDISCIPLINARYMETEOROLOGY-METEOROLOGY & ATMOSPHERIC SCIENCES

CiteScore

5.90

自引率

9.40%

发文量

审稿时长

4 weeks

期刊介绍： Geoscience Data Journal provides an Open Access platform where scientific data can be formally published, in a way that includes scientific peer-review. Thus the dataset creator attains full credit for their efforts, while also improving the scientific record, providing version control for the community and allowing major datasets to be fully described, cited and discovered. An online-only journal, GDJ publishes short data papers cross-linked to – and citing – datasets that have been deposited in approved data centres and awarded DOIs. The journal will also accept articles on data services, and articles which support and inform data publishing best practices. Data is at the heart of science and scientific endeavour. The curation of data and the science associated with it is as important as ever in our understanding of the changing earth system and thereby enabling us to make future predictions. Geoscience Data Journal is working with recognised Data Centres across the globe to develop the future strategy for data publication, the recognition of the value of data and the communication and exploitation of data to the wider science and stakeholder communities.