A comparison of missing values imputation methods applied to precipitation of two semi-arid and humid regions of México

IF 1 4区 地球科学 Q4 METEOROLOGY & ATMOSPHERIC SCIENCES
Atmosfera Pub Date : 2022-08-02 DOI:10.20937/atm.53095
Juan Manuel Navarro Céspedes, Jesús Horacio Hernández, Pedro Camilo Alcántara Concepción, Jorge Luis Morales Martínez, Gilberto Carreño Aguilera, Francisco Padilla Benítez
{"title":"A comparison of missing values imputation methods applied to precipitation of two\n semi-arid and humid regions of México","authors":"Juan Manuel Navarro Céspedes, Jesús Horacio Hernández, Pedro Camilo Alcántara Concepción, Jorge Luis Morales Martínez, Gilberto Carreño Aguilera, Francisco Padilla Benítez","doi":"10.20937/atm.53095","DOIUrl":null,"url":null,"abstract":"Climatological data with unreliable or missing values is an important area of\n research, and multiple methods are available to fill in missing data and evaluate data\n quality. Our study aims to compare the performance of different methods for estimating\n missing values that are explicitly designed for precipitation and multipurpose\n hydrological data. The climate variable used for the analysis was daily precipitation.\n We considered two different climate and orographic regions to evaluate the effects of\n altitude, precipitation regime and percentage of missing data on the Mean Absolute Error\n of imputed values and using a homogeneity evaluation of meteorological stations. We\n excluded from the analysis meteorological stations with more than 25% missing data. In\n the semi-arid region, ReddPrec (optimal for 9 stations), and GCIDW (optimal for 8) were\n the best performing methods for the 23 stations, with average MAE values of 1.63 mm/day\n and 1.46 mm/day, respectively. In the humid region, GCIDW was optimal in ~59% of\n stations, EM in ~24%, and ReddPrec in ~17%, with average MAE values of ~6.0 mm/day, 6.5\n mm/day and ~9.8 mm/day, respectively. This research makes an important contribution to\n identifying the most appropriate methods to impute daily precipitation in different\n climatic regions of Mexico based on efficiency indicators and homogeneity\n evaluation.","PeriodicalId":55576,"journal":{"name":"Atmosfera","volume":"1 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2022-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmosfera","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.20937/atm.53095","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Climatological data with unreliable or missing values is an important area of research, and multiple methods are available to fill in missing data and evaluate data quality. Our study aims to compare the performance of different methods for estimating missing values that are explicitly designed for precipitation and multipurpose hydrological data. The climate variable used for the analysis was daily precipitation. We considered two different climate and orographic regions to evaluate the effects of altitude, precipitation regime and percentage of missing data on the Mean Absolute Error of imputed values and using a homogeneity evaluation of meteorological stations. We excluded from the analysis meteorological stations with more than 25% missing data. In the semi-arid region, ReddPrec (optimal for 9 stations), and GCIDW (optimal for 8) were the best performing methods for the 23 stations, with average MAE values of 1.63 mm/day and 1.46 mm/day, respectively. In the humid region, GCIDW was optimal in ~59% of stations, EM in ~24%, and ReddPrec in ~17%, with average MAE values of ~6.0 mm/day, 6.5 mm/day and ~9.8 mm/day, respectively. This research makes an important contribution to identifying the most appropriate methods to impute daily precipitation in different climatic regions of Mexico based on efficiency indicators and homogeneity evaluation.
缺失值估算方法在青海两个半干湿地区降水中的应用比较
不可靠或缺失值的气候数据是一个重要的研究领域,有多种方法可以填补缺失数据和评估数据质量。我们的研究旨在比较为降水和多用途水文数据明确设计的估算缺失值的不同方法的性能。用于分析的气候变量为日降水量。我们考虑了两个不同的气候和地形区域,利用气象站的同质性评估,评估了海拔、降水状况和缺失数据百分比对估算值平均绝对误差的影响。我们将丢失数据超过25%的气象站排除在分析之外。在半干旱区,23个站点的平均MAE值分别为1.63 mm/d和1.46 mm/d,其中ReddPrec(9个站点最优)和GCIDW(8个站点最优)表现最好。在湿润地区,GCIDW、EM和ReddPrec分别有59%、24%和17%的站点最优,平均MAE值分别为6.0 mm/day、6.5 mm/day和9.8 mm/day。本研究为确定基于效率指标和均匀性评价的墨西哥不同气候区日降水量估算方法做出了重要贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Atmosfera
Atmosfera 地学-气象与大气科学
CiteScore
2.20
自引率
0.00%
发文量
46
审稿时长
6 months
期刊介绍: ATMÓSFERA seeks contributions on theoretical, basic, empirical and applied research in all the areas of atmospheric sciences, with emphasis on meteorology, climatology, aeronomy, physics, chemistry, and aerobiology. Interdisciplinary contributions are also accepted; especially those related with oceanography, hydrology, climate variability and change, ecology, forestry, glaciology, agriculture, environmental pollution, and other topics related to economy and society as they are affected by atmospheric hazards.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信