Durum wheat yield forecasting using machine learning

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY
Nabila Chergui
{"title":"Durum wheat yield forecasting using machine learning","authors":"Nabila Chergui","doi":"10.1016/j.aiia.2022.09.003","DOIUrl":null,"url":null,"abstract":"<div><p>A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector. Machine learning approaches allow for building such predictive models, but the quality of predictions decreases if data is scarce. In this work, we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria. We first increased the dimension of each data set by adding more features, and then we augmented the size of the data by merging the two data sets. To assess the effectiveness of data-augmentation approaches, we conducted three sets of experiments based on three data sets: the primary data sets, data sets with additional features and the augmented data sets obtained by merging, using five regression models (Support Vector Regression, Random Forest, Extreme Learning Machine, Artificial Neural Network, Deep Neural Network). To evaluate the models, we used cross-validation; the results showed an overall increase in performance with the augmented data. DNN outperformed the other models for the first Province with a Root Mean Square Error (RMSE) of 0.04 q/ha and R_Squared (<em>R</em><sup>2</sup>) of 0.96, whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha. The data-augmentation approach proposed in this study showed encouraging results.</p></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"6 ","pages":"Pages 156-166"},"PeriodicalIF":8.2000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2589721722000137/pdfft?md5=4964a697dabfe27531e6ff34bdc2d2dd&pid=1-s2.0-S2589721722000137-main.pdf","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Agriculture","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589721722000137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 5

Abstract

A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector. Machine learning approaches allow for building such predictive models, but the quality of predictions decreases if data is scarce. In this work, we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria. We first increased the dimension of each data set by adding more features, and then we augmented the size of the data by merging the two data sets. To assess the effectiveness of data-augmentation approaches, we conducted three sets of experiments based on three data sets: the primary data sets, data sets with additional features and the augmented data sets obtained by merging, using five regression models (Support Vector Regression, Random Forest, Extreme Learning Machine, Artificial Neural Network, Deep Neural Network). To evaluate the models, we used cross-validation; the results showed an overall increase in performance with the augmented data. DNN outperformed the other models for the first Province with a Root Mean Square Error (RMSE) of 0.04 q/ha and R_Squared (R2) of 0.96, whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha. The data-augmentation approach proposed in this study showed encouraging results.

利用机器学习预测硬粒小麦产量
一个可靠和准确的作物产量预测模型对于每个农业部门的有效决策至关重要。机器学习方法允许建立这样的预测模型,但如果数据稀缺,预测的质量会下降。在这项工作中,我们建议在阿尔及利亚两个不同省份的小数据集存在的情况下,对小麦产量预测进行数据增强。我们首先通过添加更多的特征来增加每个数据集的维度,然后通过合并两个数据集来增加数据的大小。为了评估数据增强方法的有效性,我们使用五种回归模型(支持向量回归、随机森林、极限学习机、人工神经网络、深度神经网络),基于三个数据集进行了三组实验:原始数据集、附加特征数据集和合并后的增强数据集。为了评估模型,我们使用交叉验证;结果显示,随着数据的增强,性能总体上有所提高。DNN在第一个省的表现优于其他模型,RMSE为0.04 q/ha, R_Squared (R2)为0.96,而随机森林在第二个省的表现优于其他模型,RMSE为0.05 q/ha。本研究提出的数据增强方法取得了令人鼓舞的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Artificial Intelligence in Agriculture
Artificial Intelligence in Agriculture Engineering-Engineering (miscellaneous)
CiteScore
21.60
自引率
0.00%
发文量
18
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信