An evaluation of the performance of imputation methods for missing meteorological data in Burkina Faso and Senegal

Diouf Semou, Deme Abdoulaye, Hadji Deme El, Fall Papa, Diouf Ibrahima
{"title":"An evaluation of the performance of imputation methods for missing meteorological data in Burkina Faso and Senegal","authors":"Diouf Semou, Deme Abdoulaye, Hadji Deme El, Fall Papa, Diouf Ibrahima","doi":"10.5897/ajest2023.3221","DOIUrl":null,"url":null,"abstract":"Addressing data incompleteness issues is crucial for reliable climate studies, especially in regions like Africa that commonly experience data gaps. This study aims to evaluate the performance of five imputation methods (knn, ppca, mice, imputeTS, and missForest) on meteorological data from stations in Burkina Faso and Senegal. The imputed data is compared with ERA5 reanalysis data to validate its accuracy. Temperature, relative humidity, and precipitation observations from the GSOD dataset (1973-2020) were used, creating subsets with missing rates of 5, 10, 20, 30 and 40%. An evaluation was conducted using the Taylor diagram and Kling-Gupta Efficiency (KGE). The results show a good estimation of temperature and relative humidity time series, with missForest performing the best for handling missing values. Precipitation estimation was less accurate, but there was strong agreement between estimated and observed data. ImputeTS was recommended for precipitation. Spatial consistency between imputed data and ERA5 reanalysis products was found. This research improves the quality of meteorological data, provides essential information about climatic characteristics, and serves as a foundation for climate change and weather modeling studies.   Key words: Meteorological data, imputation methods, Senegal, Burkina Faso.","PeriodicalId":7483,"journal":{"name":"African Journal of Environmental Science and Technology","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"African Journal of Environmental Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5897/ajest2023.3221","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Addressing data incompleteness issues is crucial for reliable climate studies, especially in regions like Africa that commonly experience data gaps. This study aims to evaluate the performance of five imputation methods (knn, ppca, mice, imputeTS, and missForest) on meteorological data from stations in Burkina Faso and Senegal. The imputed data is compared with ERA5 reanalysis data to validate its accuracy. Temperature, relative humidity, and precipitation observations from the GSOD dataset (1973-2020) were used, creating subsets with missing rates of 5, 10, 20, 30 and 40%. An evaluation was conducted using the Taylor diagram and Kling-Gupta Efficiency (KGE). The results show a good estimation of temperature and relative humidity time series, with missForest performing the best for handling missing values. Precipitation estimation was less accurate, but there was strong agreement between estimated and observed data. ImputeTS was recommended for precipitation. Spatial consistency between imputed data and ERA5 reanalysis products was found. This research improves the quality of meteorological data, provides essential information about climatic characteristics, and serves as a foundation for climate change and weather modeling studies.   Key words: Meteorological data, imputation methods, Senegal, Burkina Faso.
对布基纳法索和塞内加尔气象数据缺失的归算方法的性能评价
解决数据不完整问题对于可靠的气候研究至关重要,特别是在非洲等通常存在数据缺口的地区。本研究旨在评估5种估算方法(knn、ppca、mice、imputeTS和missForest)在布基纳法索和塞内加尔气象站气象数据上的表现。将输入数据与ERA5再分析数据进行对比,验证其准确性。使用GSOD数据集(1973-2020)的温度、相对湿度和降水观测数据,创建缺失率分别为5%、10%、20%、30%和40%的子集。采用泰勒图和克林-古普塔效率(KGE)进行评价。结果表明,misforest可以很好地估计温度和相对湿度时间序列,其中misforest在处理缺失值方面表现最好。降水估计不太准确,但估计数据与观测数据之间有很强的一致性。推荐使用ImputeTS进行沉淀。输入数据与ERA5再分析结果在空间上一致。这项研究提高了气象数据的质量,提供了有关气候特征的重要信息,并为气候变化和天气模式研究奠定了基础。,关键词:气象资料;归算方法;塞内加尔;
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信