利用轨道遥感和再分析数据改进日降水估算的机器学习模型评估

IF 2.8 3区地球科学 Q2 METEOROLOGY & ATMOSPHERIC SCIENCES

International Journal of Climatology Pub Date : 2025-07-08 DOI:10.1002/joc.70036

Emerson da Silva Freitas, Victor Hugo Rabelo Coelho, Guillaume Francis Bertrand, Filipe Carvalho Lemos, Cristiano das Neves Almeida

{"title":"利用轨道遥感和再分析数据改进日降水估算的机器学习模型评估","authors":"Emerson da Silva Freitas, Victor Hugo Rabelo Coelho, Guillaume Francis Bertrand, Filipe Carvalho Lemos, Cristiano das Neves Almeida","doi":"10.1002/joc.70036","DOIUrl":null,"url":null,"abstract":"<p>This study focuses on the evolution of the monthly IMERG (Integrated Multi-satellitE Retrievals for GPM) BraMaL (Brazilian Machine Learning) product, the so-called IMERG BraMaL-M, to produce accurate daily precipitation estimations in Brazil (IMERG BraMaL-D) without dependence on ground-based local or regional data. IMERG BraMaL-D uses the satellite-based precipitation product IMERG Early Run and 53 re-analysis variables from MERRA-2 (Global Modelling and Assimilation Office) as inputs for calibration. To achieve the main goal, we evaluated the performance of single (SML), double (DML) and multiple (MML) machine learning methods, using combinations of 6 regression and 8 classification models. The evaluation showed that the K-nearest neighbours (KNN) and the random forest (RF) were the best regression and classification models, respectively. The MML method, combining 5 regression models, was chosen to produce the IMERG BraMaL-D product because it performed statistically better when compared with the observed data from 3227 rain gauges. Compared with the original calibrated product of IMERG (i.e., the Final Run) and three other global satellite-based precipitation products (i.e., PERSIANN-CDR, MSWEP and CHIRPS), IMERG BraMaL-D statistically presented a better performance for almost all analyses. For instance, IMERG BraMaL-D exhibited a KGE (Kling-Gupta Efficiency) of 0.70 for daily estimations, against values ranging from 0.05 (PERSSIANN-DCR) to 0.66 (IMERG Final Run) for the other analysed global products. The monthly accumulated estimations of IMERG BraMaL-D also presented better performance, with smaller data dispersion and KGE rising from 0.86 to 0.95 when compared with IMERG BraMaL-M. Like IMERG BraMaL-M, the main advantages of the IMERG BraMaL-D product are the non-dependency on ground-based datasets after the model's calibration, the improvement of precipitation estimations where the satellite-based products usually underestimate the rain gauge data, and the faster availability to the end-users.</p>","PeriodicalId":13779,"journal":{"name":"International Journal of Climatology","volume":"45 12","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rmets.onlinelibrary.wiley.com/doi/epdf/10.1002/joc.70036","citationCount":"0","resultStr":"{\"title\":\"Evaluation of Machine Learning Models to Improve Daily Precipitation Estimations From Orbital Remote Sensing and Reanalysis Data\",\"authors\":\"Emerson da Silva Freitas, Victor Hugo Rabelo Coelho, Guillaume Francis Bertrand, Filipe Carvalho Lemos, Cristiano das Neves Almeida\",\"doi\":\"10.1002/joc.70036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This study focuses on the evolution of the monthly IMERG (Integrated Multi-satellitE Retrievals for GPM) BraMaL (Brazilian Machine Learning) product, the so-called IMERG BraMaL-M, to produce accurate daily precipitation estimations in Brazil (IMERG BraMaL-D) without dependence on ground-based local or regional data. IMERG BraMaL-D uses the satellite-based precipitation product IMERG Early Run and 53 re-analysis variables from MERRA-2 (Global Modelling and Assimilation Office) as inputs for calibration. To achieve the main goal, we evaluated the performance of single (SML), double (DML) and multiple (MML) machine learning methods, using combinations of 6 regression and 8 classification models. The evaluation showed that the K-nearest neighbours (KNN) and the random forest (RF) were the best regression and classification models, respectively. The MML method, combining 5 regression models, was chosen to produce the IMERG BraMaL-D product because it performed statistically better when compared with the observed data from 3227 rain gauges. Compared with the original calibrated product of IMERG (i.e., the Final Run) and three other global satellite-based precipitation products (i.e., PERSIANN-CDR, MSWEP and CHIRPS), IMERG BraMaL-D statistically presented a better performance for almost all analyses. For instance, IMERG BraMaL-D exhibited a KGE (Kling-Gupta Efficiency) of 0.70 for daily estimations, against values ranging from 0.05 (PERSSIANN-DCR) to 0.66 (IMERG Final Run) for the other analysed global products. The monthly accumulated estimations of IMERG BraMaL-D also presented better performance, with smaller data dispersion and KGE rising from 0.86 to 0.95 when compared with IMERG BraMaL-M. Like IMERG BraMaL-M, the main advantages of the IMERG BraMaL-D product are the non-dependency on ground-based datasets after the model's calibration, the improvement of precipitation estimations where the satellite-based products usually underestimate the rain gauge data, and the faster availability to the end-users.</p>\",\"PeriodicalId\":13779,\"journal\":{\"name\":\"International Journal of Climatology\",\"volume\":\"45 12\",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://rmets.onlinelibrary.wiley.com/doi/epdf/10.1002/joc.70036\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Climatology\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://rmets.onlinelibrary.wiley.com/doi/10.1002/joc.70036\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"METEOROLOGY & ATMOSPHERIC SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Climatology","FirstCategoryId":"89","ListUrlMain":"https://rmets.onlinelibrary.wiley.com/doi/10.1002/joc.70036","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

本研究的重点是每月IMERG（综合多卫星检索GPM） BraMaL（巴西机器学习）产品的演变，即所谓的IMERG BraMaL- m，在不依赖地面本地或区域数据的情况下，产生准确的巴西日降水估计（IMERG BraMaL- d）。IMERG BraMaL-D使用基于卫星的降水产品IMERG Early Run和来自MERRA-2（全球模拟和同化办公室）的53个再分析变量作为校准输入。为了实现主要目标，我们评估了单（SML），双（DML）和多（MML）机器学习方法的性能，使用6种回归和8种分类模型的组合。评价结果表明，k近邻模型（KNN）和随机森林模型（RF）分别是最佳的回归模型和分类模型。由于与3227个雨量计的观测数据相比，在统计上表现更好，因此选择结合5个回归模型的MML方法来生成IMERG BraMaL-D产品。与IMERG的原始校准产品（即Final Run）和其他三种全球卫星降水产品（即PERSIANN-CDR， MSWEP和CHIRPS）相比，IMERG BraMaL-D在几乎所有分析中都具有更好的统计性能。例如，IMERG BraMaL-D显示每日估计的KGE（克林-古普塔效率）为0.70，而其他分析的全球产品的值从0.05 （perssian - dcr）到0.66 （IMERG Final Run）不等。与IMERG BraMaL-M相比，IMERG BraMaL-D的月累积估计也表现出更好的性能，数据离散度更小，KGE从0.86上升到0.95。与IMERG BraMaL-M一样，IMERG BraMaL-D产品的主要优点是在模式定标后不依赖地面数据集，改进了卫星产品通常低估雨量计数据的降水估计，以及更快地提供给最终用户。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Evaluation of Machine Learning Models to Improve Daily Precipitation Estimations From Orbital Remote Sensing and Reanalysis Data

查看原文本刊更多论文

Evaluation of Machine Learning Models to Improve Daily Precipitation Estimations From Orbital Remote Sensing and Reanalysis Data

This study focuses on the evolution of the monthly IMERG (Integrated Multi-satellitE Retrievals for GPM) BraMaL (Brazilian Machine Learning) product, the so-called IMERG BraMaL-M, to produce accurate daily precipitation estimations in Brazil (IMERG BraMaL-D) without dependence on ground-based local or regional data. IMERG BraMaL-D uses the satellite-based precipitation product IMERG Early Run and 53 re-analysis variables from MERRA-2 (Global Modelling and Assimilation Office) as inputs for calibration. To achieve the main goal, we evaluated the performance of single (SML), double (DML) and multiple (MML) machine learning methods, using combinations of 6 regression and 8 classification models. The evaluation showed that the K-nearest neighbours (KNN) and the random forest (RF) were the best regression and classification models, respectively. The MML method, combining 5 regression models, was chosen to produce the IMERG BraMaL-D product because it performed statistically better when compared with the observed data from 3227 rain gauges. Compared with the original calibrated product of IMERG (i.e., the Final Run) and three other global satellite-based precipitation products (i.e., PERSIANN-CDR, MSWEP and CHIRPS), IMERG BraMaL-D statistically presented a better performance for almost all analyses. For instance, IMERG BraMaL-D exhibited a KGE (Kling-Gupta Efficiency) of 0.70 for daily estimations, against values ranging from 0.05 (PERSSIANN-DCR) to 0.66 (IMERG Final Run) for the other analysed global products. The monthly accumulated estimations of IMERG BraMaL-D also presented better performance, with smaller data dispersion and KGE rising from 0.86 to 0.95 when compared with IMERG BraMaL-M. Like IMERG BraMaL-M, the main advantages of the IMERG BraMaL-D product are the non-dependency on ground-based datasets after the model's calibration, the improvement of precipitation estimations where the satellite-based products usually underestimate the rain gauge data, and the faster availability to the end-users.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Climatology 地学-气象与大气科学

CiteScore

7.50

自引率

7.70%

发文量

417

审稿时长

4 months

期刊介绍： The International Journal of Climatology aims to span the well established but rapidly growing field of climatology, through the publication of research papers, short communications, major reviews of progress and reviews of new books and reports in the area of climate science. The Journal’s main role is to stimulate and report research in climatology, from the expansive fields of the atmospheric, biophysical, engineering and social sciences. Coverage includes: Climate system science; Local to global scale climate observations and modelling; Seasonal to interannual climate prediction; Climatic variability and climate change; Synoptic, dynamic and urban climatology, hydroclimatology, human bioclimatology, ecoclimatology, dendroclimatology, palaeoclimatology, marine climatology and atmosphere-ocean interactions; Application of climatological knowledge to environmental assessment and management and economic production; Climate and society interactions