N. L. Kushwaha, Jitendra Rajput, D. Sena, A. Elbeltagi, Dhananjai Singh, I. Mani
{"title":"Evaluation of Data-driven Hybrid Machine Learning Algorithms for Modelling Daily Reference Evapotranspiration","authors":"N. L. Kushwaha, Jitendra Rajput, D. Sena, A. Elbeltagi, Dhananjai Singh, I. Mani","doi":"10.1080/07055900.2022.2087589","DOIUrl":null,"url":null,"abstract":"ABSTRACT Reference evapotranspiration (ET0) is one of the crucial variables used for irrigation scheduling, agricultural production, and water balance studies. This study compares six different models with sequential inclusion of six meteorological input variables such as minimum temperature (Tmin), maximum temperature (Tmax), mean relative humidity (RH), wind speed (SW), sunshine hours (HSS), and solar radiation (RS), which are necessarily used in physical or empirical-based models to estimate ET0. Each model utilized three variants of machine learning algorithms, i.e. Additive Regression (AdR), Random Subspace (RSS), M5 Pruning tree (M5P) independently and four novel permutated hybrid combinations of these algorithms. To evaluate the efficacy of these hybridizations and the stability of machine learning models, a comprehensive evaluation of independent and hybrid models was performed. With more input variables, the model performances were found to be superior in terms of prediction accuracies. The model AdR6 that included all the 6 selected meteorological variables outperformed other models during the testing period, exhibiting statistical performance of MAPE (1.30), RMSE (0.07), RAE (2.41), RRSE (3.10), and R 2 (0.998). However, the AdR algorithm, alone, was found to capture about 86% of variance in the observed data conforming to the 95% confidence band across all models irrespective of the number of input variables used to predict ET0. The RSS algorithm, in comparison to other algorithms, failed to capture the observed trends even with all the input variables. The hybrid combinations of algorithms with AdR as a constituent were better performers in terms of their prediction accuracies but remained inferior to AdR as an individual performer. All the algorithms are better predictors of the higher values of ET0 that included values beyond the 75% quartile.","PeriodicalId":55434,"journal":{"name":"Atmosphere-Ocean","volume":"60 1","pages":"519 - 540"},"PeriodicalIF":1.6000,"publicationDate":"2022-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmosphere-Ocean","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1080/07055900.2022.2087589","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 16
Abstract
ABSTRACT Reference evapotranspiration (ET0) is one of the crucial variables used for irrigation scheduling, agricultural production, and water balance studies. This study compares six different models with sequential inclusion of six meteorological input variables such as minimum temperature (Tmin), maximum temperature (Tmax), mean relative humidity (RH), wind speed (SW), sunshine hours (HSS), and solar radiation (RS), which are necessarily used in physical or empirical-based models to estimate ET0. Each model utilized three variants of machine learning algorithms, i.e. Additive Regression (AdR), Random Subspace (RSS), M5 Pruning tree (M5P) independently and four novel permutated hybrid combinations of these algorithms. To evaluate the efficacy of these hybridizations and the stability of machine learning models, a comprehensive evaluation of independent and hybrid models was performed. With more input variables, the model performances were found to be superior in terms of prediction accuracies. The model AdR6 that included all the 6 selected meteorological variables outperformed other models during the testing period, exhibiting statistical performance of MAPE (1.30), RMSE (0.07), RAE (2.41), RRSE (3.10), and R 2 (0.998). However, the AdR algorithm, alone, was found to capture about 86% of variance in the observed data conforming to the 95% confidence band across all models irrespective of the number of input variables used to predict ET0. The RSS algorithm, in comparison to other algorithms, failed to capture the observed trends even with all the input variables. The hybrid combinations of algorithms with AdR as a constituent were better performers in terms of their prediction accuracies but remained inferior to AdR as an individual performer. All the algorithms are better predictors of the higher values of ET0 that included values beyond the 75% quartile.
期刊介绍:
Atmosphere-Ocean is the principal scientific journal of the Canadian Meteorological and Oceanographic Society (CMOS). It contains results of original research, survey articles, notes and comments on published papers in all fields of the atmospheric, oceanographic and hydrological sciences. Arctic, coastal and mid- to high-latitude regions are areas of particular interest. Applied or fundamental research contributions in English or French on the following topics are welcomed:
climate and climatology;
observation technology, remote sensing;
forecasting, modelling, numerical methods;
physics, dynamics, chemistry, biogeochemistry;
boundary layers, pollution, aerosols;
circulation, cloud physics, hydrology, air-sea interactions;
waves, ice, energy exchange and related environmental topics.