Machine Learning Methods Based on Limited Meteorological Data to Simulate Potential Evapotranspiration: A Case Study of Source Region of Yellow River Basin
IF 3.5 3区 地球科学Q2 METEOROLOGY & ATMOSPHERIC SCIENCES
{"title":"Machine Learning Methods Based on Limited Meteorological Data to Simulate Potential Evapotranspiration: A Case Study of Source Region of Yellow River Basin","authors":"Yinan Pei, Shengqi Jian, Guodong Zhang","doi":"10.1002/joc.8717","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Simulation of potential evapotranspiration (<i>PET</i>) is an important part of drought warning and water resource planning. However, the commonly used empirical models need to input a large number of meteorological elements. Therefore, to improve the efficiency and accuracy of <i>PET</i> simulation in areas lacking meteorological data, this study evaluated the performance of Extreme learning machine (ELM), multi-layer perceptron (MLP), and Random Forest (RF) three machine learning methods to simulate daily <i>PET</i> using limited meteorological data in the source region of the Yellow River (SYRB). Two bionic optimization algorithms, Grey Wolf Optimizer (GWO) and Sparrow Search Algorithm (SSA), were used to optimise the hyperparameters of the model to improve the accuracy of the model. In addition, the effect of months on daily <i>PET</i> simulations was evaluated. The <b>results</b> showed that the daily maximum temperature (<i>T</i>\n <sub>max</sub>) was the most important factor affecting the <i>PET</i> simulation, and the daily average relative humidity (<i>RH</i>) and wind speed (<i>U</i>\n <sub>10</sub>) were the secondary factors. It is recommended to use <i>T</i>\n <sub>max</sub>, <i>RH</i>, <i>U</i>\n <sub>10</sub>, and sunshine duration as the optimum combination of input (<i>R</i>\n <sup>2</sup> > 0.95). In the case of limited meteorological data, the input combination of <i>T</i>\n <sub>max</sub>, RH, <i>U</i>\n <sub>10</sub>, or <i>T</i>\n <sub>max</sub>, <i>RH</i> (<i>R</i>\n <sup>2</sup> > 0.75) was considered. Considering the accuracy and the time and space overhead of the model, the ELM-GWO model is recommended. When month information was used as an input factor, model performance improved in all scenarios, and June to July was the most accurate month for the model to simulate daily <i>PET</i>. This research resultwill allow researchers to choose the appropriate meteorological factor when simulating the <i>PET</i> to provide the reference.</p>\n </div>","PeriodicalId":13779,"journal":{"name":"International Journal of Climatology","volume":"45 3","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Climatology","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/joc.8717","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Simulation of potential evapotranspiration (PET) is an important part of drought warning and water resource planning. However, the commonly used empirical models need to input a large number of meteorological elements. Therefore, to improve the efficiency and accuracy of PET simulation in areas lacking meteorological data, this study evaluated the performance of Extreme learning machine (ELM), multi-layer perceptron (MLP), and Random Forest (RF) three machine learning methods to simulate daily PET using limited meteorological data in the source region of the Yellow River (SYRB). Two bionic optimization algorithms, Grey Wolf Optimizer (GWO) and Sparrow Search Algorithm (SSA), were used to optimise the hyperparameters of the model to improve the accuracy of the model. In addition, the effect of months on daily PET simulations was evaluated. The results showed that the daily maximum temperature (Tmax) was the most important factor affecting the PET simulation, and the daily average relative humidity (RH) and wind speed (U10) were the secondary factors. It is recommended to use Tmax, RH, U10, and sunshine duration as the optimum combination of input (R2 > 0.95). In the case of limited meteorological data, the input combination of Tmax, RH, U10, or Tmax, RH (R2 > 0.75) was considered. Considering the accuracy and the time and space overhead of the model, the ELM-GWO model is recommended. When month information was used as an input factor, model performance improved in all scenarios, and June to July was the most accurate month for the model to simulate daily PET. This research resultwill allow researchers to choose the appropriate meteorological factor when simulating the PET to provide the reference.
潜在蒸散量(PET)模拟是干旱预警和水资源规划的重要组成部分。然而,常用的经验模型需要输入大量气象要素。因此,为了提高在缺乏气象数据地区模拟 PET 的效率和准确性,本研究评估了极限学习机(ELM)、多层感知器(MLP)和随机森林(RF)三种机器学习方法的性能,以利用黄河源区(SYRB)有限的气象数据模拟日 PET。灰狼优化器(GWO)和麻雀搜索算法(SSA)这两种仿生优化算法用于优化模型的超参数,以提高模型的准确性。此外,还评估了月份对每日 PET 模拟的影响。结果表明,日最高气温(T max)是影响 PET 模拟的最重要因素,日平均相对湿度(RH)和风速(U 10)是次要因素。建议使用最大温度、相对湿度、U 10 和日照时间作为输入的最佳组合(R 2 > 0.95)。在气象数据有限的情况下,考虑使用 T max、RH、U 10 或 T max、RH 的输入组合(R 2 > 0.75)。考虑到模型的精度和时空开销,推荐使用 ELM-GWO 模型。当使用月份信息作为输入因素时,模型的性能在所有情况下都有所改善,6 月至 7 月是模型模拟日 PET 最准确的月份。这一研究成果将为研究人员在模拟 PET 时选择合适的气象因子提供参考。
期刊介绍:
The International Journal of Climatology aims to span the well established but rapidly growing field of climatology, through the publication of research papers, short communications, major reviews of progress and reviews of new books and reports in the area of climate science. The Journal’s main role is to stimulate and report research in climatology, from the expansive fields of the atmospheric, biophysical, engineering and social sciences. Coverage includes: Climate system science; Local to global scale climate observations and modelling; Seasonal to interannual climate prediction; Climatic variability and climate change; Synoptic, dynamic and urban climatology, hydroclimatology, human bioclimatology, ecoclimatology, dendroclimatology, palaeoclimatology, marine climatology and atmosphere-ocean interactions; Application of climatological knowledge to environmental assessment and management and economic production; Climate and society interactions