Impacts of meteorological variables and machine learning algorithms on rice yield prediction in Korea

IF 3 3区 地球科学 Q2 BIOPHYSICS
Subin Ha, Yong-Tak Kim, Eun-Soon Im, Jina Hur, Sera Jo, Yong-Seok Kim, Kyo‑Moon Shim
{"title":"Impacts of meteorological variables and machine learning algorithms on rice yield prediction in Korea","authors":"Subin Ha,&nbsp;Yong-Tak Kim,&nbsp;Eun-Soon Im,&nbsp;Jina Hur,&nbsp;Sera Jo,&nbsp;Yong-Seok Kim,&nbsp;Kyo‑Moon Shim","doi":"10.1007/s00484-023-02544-x","DOIUrl":null,"url":null,"abstract":"<div><p>As crop productivity is greatly influenced by weather conditions, many attempts have been made to estimate crop yields using meteorological data and have achieved great progress with the development of machine learning. However, most yield prediction models are developed based on observational data, and the utilization of climate model output in yield prediction has been addressed in very few studies. In this study, we estimate rice yields in South Korea using the meteorological variables provided by ERA5 reanalysis data (ERA-O) and its dynamically downscaled data (ERA-DS). After ERA-O and ERA-DS are validated against observations (OBS), two different machine learning models, Support Vector Machine (SVM) and Long Short-Term Memory (LSTM), are trained with different combinations of eight meteorological variables (mean temperature, maximum temperature, minimum temperature, precipitation, diurnal temperature range, solar irradiance, mean wind speed, and relative humidity) obtained from OBS, ERA-O, and ERA-DS at weekly and monthly timescales from May to September. Regardless of the model type and the source of the input data, training a model with weekly datasets leads to better yield estimates compared to monthly datasets. LSTM generally outperforms SVM, especially when the model is trained with ERA-DS data at a weekly timescale. The best yield estimates are produced by the LSTM model trained with all eight variables at a weekly timescale. Altogether this study shows the significance of high spatial and temporal resolution of input meteorological data in yield prediction, which can also serve to substantiate the added value of dynamical downscaling.</p></div>","PeriodicalId":588,"journal":{"name":"International Journal of Biometeorology","volume":"67 11","pages":"1825 - 1838"},"PeriodicalIF":3.0000,"publicationDate":"2023-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biometeorology","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s00484-023-02544-x","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

As crop productivity is greatly influenced by weather conditions, many attempts have been made to estimate crop yields using meteorological data and have achieved great progress with the development of machine learning. However, most yield prediction models are developed based on observational data, and the utilization of climate model output in yield prediction has been addressed in very few studies. In this study, we estimate rice yields in South Korea using the meteorological variables provided by ERA5 reanalysis data (ERA-O) and its dynamically downscaled data (ERA-DS). After ERA-O and ERA-DS are validated against observations (OBS), two different machine learning models, Support Vector Machine (SVM) and Long Short-Term Memory (LSTM), are trained with different combinations of eight meteorological variables (mean temperature, maximum temperature, minimum temperature, precipitation, diurnal temperature range, solar irradiance, mean wind speed, and relative humidity) obtained from OBS, ERA-O, and ERA-DS at weekly and monthly timescales from May to September. Regardless of the model type and the source of the input data, training a model with weekly datasets leads to better yield estimates compared to monthly datasets. LSTM generally outperforms SVM, especially when the model is trained with ERA-DS data at a weekly timescale. The best yield estimates are produced by the LSTM model trained with all eight variables at a weekly timescale. Altogether this study shows the significance of high spatial and temporal resolution of input meteorological data in yield prediction, which can also serve to substantiate the added value of dynamical downscaling.

Abstract Image

气象变量和机器学习算法对韩国水稻产量预测的影响。
由于作物生产力在很大程度上受到天气条件的影响,人们已经尝试使用气象数据来估计作物产量,并随着机器学习的发展取得了巨大进展。然而,大多数产量预测模型都是基于观测数据开发的,很少有研究涉及气候模型输出在产量预测中的应用。在本研究中,我们使用ERA5再分析数据(ERA-O)及其动态缩减数据(ERA-DS)提供的气象变量来估计韩国的水稻产量。在ERA-O和ERA-DS根据观察结果(OBS)进行验证后,两种不同的机器学习模型,支持向量机(SVM)和长短期记忆(LSTM),使用从OBS、ERA-O和ERA-DS获得的八个气象变量(平均温度、最高温度、最低温度、降水量、昼夜温度范围、太阳辐照度、平均风速和相对湿度)的不同组合,在5月至9月的每周和每月时间尺度上进行训练。无论模型类型和输入数据的来源如何,与月度数据集相比,用每周数据集训练模型可以获得更好的产量估计。LSTM通常优于SVM,尤其是当模型在每周的时间尺度上使用ERA-DS数据进行训练时。最佳产量估计是由LSTM模型在每周的时间尺度上用所有八个变量进行训练产生的。总之,本研究表明了输入气象数据的高时空分辨率在产量预测中的重要性,这也有助于证实动态降尺度的附加值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.40
自引率
9.40%
发文量
183
审稿时长
1 months
期刊介绍: The Journal publishes original research papers, review articles and short communications on studies examining the interactions between living organisms and factors of the natural and artificial atmospheric environment. Living organisms extend from single cell organisms, to plants and animals, including humans. The atmospheric environment includes climate and weather, electromagnetic radiation, and chemical and biological pollutants. The journal embraces basic and applied research and practical aspects such as living conditions, agriculture, forestry, and health. The journal is published for the International Society of Biometeorology, and most membership categories include a subscription to the Journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信