精准农业气象预报的进展:从统计建模到基于变压器的架构

IF 3.9 3区 环境科学与生态学 Q1 ENGINEERING, CIVIL
Chouaib El Hachimi, Salwa Belaqziz, Saïd Khabba, Bouchra Ait Hssaine, Mohamed Hakim Kharrou, Abdelghani Chehbouni
{"title":"精准农业气象预报的进展:从统计建模到基于变压器的架构","authors":"Chouaib El Hachimi, Salwa Belaqziz, Saïd Khabba, Bouchra Ait Hssaine, Mohamed Hakim Kharrou, Abdelghani Chehbouni","doi":"10.1007/s00477-024-02778-0","DOIUrl":null,"url":null,"abstract":"<p>As precision agriculture (PA) advances, the demand for accurate and high-resolution weather forecasts becomes critical for optimizing agricultural management practices. Despite improvements in Numerical Weather Prediction (NWP) models, they lack the granularity and efficiency needed for PA. Data-driven models offer a promising alternative by integrating predictive capabilities closer to IoT edge data sources, but their efficacy requires evaluation. Here, this paper evaluates six models from three data-driven eras (statistical, machine learning, and deep learning) using agrometeorological data from an Automatic Weather Station (AWS) in Sidi Rahal, East Marrakech, central Morocco, covering 2013–2020 at half-hour intervals, including air temperature, solar radiation, and relative humidity. First, the data is quality-controlled through imputation using ERA5-Land. Then, the dataset was split into training (2013–2019) and evaluation (2020) sets, with validation horizons of 1 day, 3 days, and 1 week. Statistical models generally perform well in air temperature forecasting, occasionally surpassing other models. However, the Temporal Convolutional Neural Network (TCNN) consistently demonstrates superior performance for challenging variables, balancing low RMSE and high R<sup>2</sup> across various horizons, with some exceptions. Specifically, for relative humidity, the linear regression model achieves slightly lower RMSE (3,96% and 6,05%) compared to TCNN (4,00% and 6,79%) for 1 day and 3 days, respectively. Additionally, CatBoost outperforms TCNN for 1-week forecasts. In terms of training time, the Transformer requires the longest, followed by AutoARIMA and CatBoost. Uncertainty analysis of stochastic models using solar radiation showed the stable performance of TCNN with 0,80 and 0,01 for the RMSE and R<sup>2</sup> standard deviations, respectively. Considering the trade-off between performance, training time, and capturing complex relationships, TCNN emerges as the optimal choice. ANOVA, Tukey’s HSD and Mann-Whitney U statistical tests also confirmed TCNN’s performance. Finally, a comparison with the Global Forecast System (GFS) reveals TCNN’s clear superiority in all metrics, particularly evident for the RMSE of 3 days air temperature forecasts (TCNN: 1,96 °C, GFS: 3,59 °C).</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"59 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advancements in weather forecasting for precision agriculture: From statistical modeling to transformer-based architectures\",\"authors\":\"Chouaib El Hachimi, Salwa Belaqziz, Saïd Khabba, Bouchra Ait Hssaine, Mohamed Hakim Kharrou, Abdelghani Chehbouni\",\"doi\":\"10.1007/s00477-024-02778-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>As precision agriculture (PA) advances, the demand for accurate and high-resolution weather forecasts becomes critical for optimizing agricultural management practices. Despite improvements in Numerical Weather Prediction (NWP) models, they lack the granularity and efficiency needed for PA. Data-driven models offer a promising alternative by integrating predictive capabilities closer to IoT edge data sources, but their efficacy requires evaluation. Here, this paper evaluates six models from three data-driven eras (statistical, machine learning, and deep learning) using agrometeorological data from an Automatic Weather Station (AWS) in Sidi Rahal, East Marrakech, central Morocco, covering 2013–2020 at half-hour intervals, including air temperature, solar radiation, and relative humidity. First, the data is quality-controlled through imputation using ERA5-Land. Then, the dataset was split into training (2013–2019) and evaluation (2020) sets, with validation horizons of 1 day, 3 days, and 1 week. Statistical models generally perform well in air temperature forecasting, occasionally surpassing other models. However, the Temporal Convolutional Neural Network (TCNN) consistently demonstrates superior performance for challenging variables, balancing low RMSE and high R<sup>2</sup> across various horizons, with some exceptions. Specifically, for relative humidity, the linear regression model achieves slightly lower RMSE (3,96% and 6,05%) compared to TCNN (4,00% and 6,79%) for 1 day and 3 days, respectively. Additionally, CatBoost outperforms TCNN for 1-week forecasts. In terms of training time, the Transformer requires the longest, followed by AutoARIMA and CatBoost. Uncertainty analysis of stochastic models using solar radiation showed the stable performance of TCNN with 0,80 and 0,01 for the RMSE and R<sup>2</sup> standard deviations, respectively. Considering the trade-off between performance, training time, and capturing complex relationships, TCNN emerges as the optimal choice. ANOVA, Tukey’s HSD and Mann-Whitney U statistical tests also confirmed TCNN’s performance. Finally, a comparison with the Global Forecast System (GFS) reveals TCNN’s clear superiority in all metrics, particularly evident for the RMSE of 3 days air temperature forecasts (TCNN: 1,96 °C, GFS: 3,59 °C).</p>\",\"PeriodicalId\":21987,\"journal\":{\"name\":\"Stochastic Environmental Research and Risk Assessment\",\"volume\":\"59 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Environmental Research and Risk Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s00477-024-02778-0\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Environmental Research and Risk Assessment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s00477-024-02778-0","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

随着精准农业(PA)的发展,对精确和高分辨率天气预报的需求成为优化农业管理实践的关键。尽管数值天气预报(NWP)模型有所改进,但仍缺乏精准农业所需的粒度和效率。数据驱动模型通过将预测能力整合到更接近物联网边缘数据源的地方,提供了一种有前途的替代方案,但其功效需要评估。本文利用摩洛哥中部马拉喀什东部 Sidi Rahal 自动气象站(AWS)提供的 2013-2020 年半小时间隔的农业气象数据,评估了三个数据驱动时代(统计、机器学习和深度学习)的六个模型,包括气温、太阳辐射和相对湿度。首先,通过使用ERA5-Land进行估算,对数据进行质量控制。然后,将数据集分为训练集(2013-2019 年)和评估集(2020 年),验证范围分别为 1 天、3 天和 1 周。统计模型在气温预报中通常表现良好,有时甚至超过其他模型。然而,时空卷积神经网络(TCNN)在具有挑战性的变量方面始终表现出卓越的性能,在各种范围内都兼顾了低 RMSE 和高 R2,但也有一些例外。具体来说,对于相对湿度,线性回归模型的 RMSE(3.96% 和 6.05%)略低于 TCNN(4.00% 和 6.79%)(分别为 1 天和 3 天)。此外,在 1 周预测方面,CatBoost 优于 TCNN。就训练时间而言,Transformer 需要的时间最长,其次是 AutoARIMA 和 CatBoost。利用太阳辐射对随机模型进行的不确定性分析表明,TCNN 性能稳定,RMSE 和 R2 标准偏差分别为 0.80 和 0.01。考虑到性能、训练时间和捕捉复杂关系之间的权衡,TCNN 成为最佳选择。方差分析、Tukey's HSD 和 Mann-Whitney U 统计检验也证实了 TCNN 的性能。最后,与全球预报系统(GFS)的比较显示,TCNN 在所有指标上都具有明显优势,尤其是 3 天气温预报的均方根误差(TCNN:1.96 °C,GFS:3.59 °C)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Advancements in weather forecasting for precision agriculture: From statistical modeling to transformer-based architectures

Advancements in weather forecasting for precision agriculture: From statistical modeling to transformer-based architectures

As precision agriculture (PA) advances, the demand for accurate and high-resolution weather forecasts becomes critical for optimizing agricultural management practices. Despite improvements in Numerical Weather Prediction (NWP) models, they lack the granularity and efficiency needed for PA. Data-driven models offer a promising alternative by integrating predictive capabilities closer to IoT edge data sources, but their efficacy requires evaluation. Here, this paper evaluates six models from three data-driven eras (statistical, machine learning, and deep learning) using agrometeorological data from an Automatic Weather Station (AWS) in Sidi Rahal, East Marrakech, central Morocco, covering 2013–2020 at half-hour intervals, including air temperature, solar radiation, and relative humidity. First, the data is quality-controlled through imputation using ERA5-Land. Then, the dataset was split into training (2013–2019) and evaluation (2020) sets, with validation horizons of 1 day, 3 days, and 1 week. Statistical models generally perform well in air temperature forecasting, occasionally surpassing other models. However, the Temporal Convolutional Neural Network (TCNN) consistently demonstrates superior performance for challenging variables, balancing low RMSE and high R2 across various horizons, with some exceptions. Specifically, for relative humidity, the linear regression model achieves slightly lower RMSE (3,96% and 6,05%) compared to TCNN (4,00% and 6,79%) for 1 day and 3 days, respectively. Additionally, CatBoost outperforms TCNN for 1-week forecasts. In terms of training time, the Transformer requires the longest, followed by AutoARIMA and CatBoost. Uncertainty analysis of stochastic models using solar radiation showed the stable performance of TCNN with 0,80 and 0,01 for the RMSE and R2 standard deviations, respectively. Considering the trade-off between performance, training time, and capturing complex relationships, TCNN emerges as the optimal choice. ANOVA, Tukey’s HSD and Mann-Whitney U statistical tests also confirmed TCNN’s performance. Finally, a comparison with the Global Forecast System (GFS) reveals TCNN’s clear superiority in all metrics, particularly evident for the RMSE of 3 days air temperature forecasts (TCNN: 1,96 °C, GFS: 3,59 °C).

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.10
自引率
9.50%
发文量
189
审稿时长
3.8 months
期刊介绍: Stochastic Environmental Research and Risk Assessment (SERRA) will publish research papers, reviews and technical notes on stochastic and probabilistic approaches to environmental sciences and engineering, including interactions of earth and atmospheric environments with people and ecosystems. The basic idea is to bring together research papers on stochastic modelling in various fields of environmental sciences and to provide an interdisciplinary forum for the exchange of ideas, for communicating on issues that cut across disciplinary barriers, and for the dissemination of stochastic techniques used in different fields to the community of interested researchers. Original contributions will be considered dealing with modelling (theoretical and computational), measurements and instrumentation in one or more of the following topical areas: - Spatiotemporal analysis and mapping of natural processes. - Enviroinformatics. - Environmental risk assessment, reliability analysis and decision making. - Surface and subsurface hydrology and hydraulics. - Multiphase porous media domains and contaminant transport modelling. - Hazardous waste site characterization. - Stochastic turbulence and random hydrodynamic fields. - Chaotic and fractal systems. - Random waves and seafloor morphology. - Stochastic atmospheric and climate processes. - Air pollution and quality assessment research. - Modern geostatistics. - Mechanisms of pollutant formation, emission, exposure and absorption. - Physical, chemical and biological analysis of human exposure from single and multiple media and routes; control and protection. - Bioinformatics. - Probabilistic methods in ecology and population biology. - Epidemiological investigations. - Models using stochastic differential equations stochastic or partial differential equations. - Hazardous waste site characterization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信