Machine Learning predictive model of grapevine yield based on agroclimatic patterns

Q2 Engineering
Manisha S. Sirsat , João Mendes-Moreira , Carlos Ferreira , Mario Cunha
{"title":"Machine Learning predictive model of grapevine yield based on agroclimatic patterns","authors":"Manisha S. Sirsat ,&nbsp;João Mendes-Moreira ,&nbsp;Carlos Ferreira ,&nbsp;Mario Cunha","doi":"10.1016/j.eaef.2019.07.003","DOIUrl":null,"url":null,"abstract":"<div><p>Grapevine yield prediction during phenostage and particularly, before harvest is highly significant as advanced forecasting could be a great value for superior grapevine management. The main contribution of the current study is to develop predictive model for each phenology that predicts yield during growing stages of grapevine and to identify highly relevant predictive variables. Current study uses climatic conditions, grapevine yield, phenological dates, fertilizer information, soil analysis and maturation index data to construct the relational dataset. After words, we use several approaches to pre-process the data to put it into tabular format. For instance, generalization of climatic variables using phenological dates. Random Forest, LASSO and Elasticnet in generalized linear models, and Spikeslab are feature selection embedded methods which are used to overcome dataset dimensionality issue. We used 10-fold cross validation to evaluate predictive model by partitioning the dataset into training set to train the model and test set to evaluate it by calculating Root Mean Squared Error (RMSE) and Relative Root Mean Squared Error (RRMSE). Results of the study show that rf_PF, rf_PC and rf_MH are optimal models for flowering (PF), colouring (PC) and harvest (MH) phenology respectively which estimate 1484.5, 1504.2 and 1459.4 (Kg/ha) low RMSE and 24.6%, 24.9% and 24.2% RRMSE, respectively as compared to other models. These models also identify some derived climatic variables as major variables for grapevine yield prediction. The reliability and early-indication ability of these forecast models justify their use by institutions and economists in decision making, adoption of technical improvements, and fraud detection.</p></div>","PeriodicalId":38965,"journal":{"name":"Engineering in Agriculture, Environment and Food","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.eaef.2019.07.003","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering in Agriculture, Environment and Food","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1881836618302106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 20

Abstract

Grapevine yield prediction during phenostage and particularly, before harvest is highly significant as advanced forecasting could be a great value for superior grapevine management. The main contribution of the current study is to develop predictive model for each phenology that predicts yield during growing stages of grapevine and to identify highly relevant predictive variables. Current study uses climatic conditions, grapevine yield, phenological dates, fertilizer information, soil analysis and maturation index data to construct the relational dataset. After words, we use several approaches to pre-process the data to put it into tabular format. For instance, generalization of climatic variables using phenological dates. Random Forest, LASSO and Elasticnet in generalized linear models, and Spikeslab are feature selection embedded methods which are used to overcome dataset dimensionality issue. We used 10-fold cross validation to evaluate predictive model by partitioning the dataset into training set to train the model and test set to evaluate it by calculating Root Mean Squared Error (RMSE) and Relative Root Mean Squared Error (RRMSE). Results of the study show that rf_PF, rf_PC and rf_MH are optimal models for flowering (PF), colouring (PC) and harvest (MH) phenology respectively which estimate 1484.5, 1504.2 and 1459.4 (Kg/ha) low RMSE and 24.6%, 24.9% and 24.2% RRMSE, respectively as compared to other models. These models also identify some derived climatic variables as major variables for grapevine yield prediction. The reliability and early-indication ability of these forecast models justify their use by institutions and economists in decision making, adoption of technical improvements, and fraud detection.

基于农业气候模式的葡萄产量机器学习预测模型
葡萄在表型期,特别是收获前的产量预测非常重要,因为先进的预测对葡萄的优质管理有很大的价值。本研究的主要贡献是建立了葡萄各物候期产量预测模型,并确定了高度相关的预测变量。目前研究利用气候条件、葡萄产量、物候日期、肥料信息、土壤分析和成熟指数数据构建相关数据集。在单词之后,我们使用几种方法对数据进行预处理,将其放入表格格式。例如,利用物候日期概括气候变量。随机森林、广义线性模型中的LASSO和Elasticnet以及Spikeslab是用来克服数据集维数问题的特征选择嵌入方法。我们通过将数据集划分为训练集来训练模型,并通过计算均方根误差(RMSE)和相对均方根误差(RRMSE)来评估测试集,使用10倍交叉验证来评估预测模型。结果表明,rf_PF、rf_PC和rf_MH分别是开花(PF)、显色(PC)和收获(MH)物候的最优模型,其RMSE值分别为1484.5、1504.2和1459.4 (Kg/ha), RRMSE值分别为24.6%、24.9%和24.2%。这些模型还确定了一些衍生的气候变量作为葡萄产量预测的主要变量。这些预测模型的可靠性和早期指示能力证明了机构和经济学家在决策、采用技术改进和欺诈检测中使用它们的合理性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering in Agriculture, Environment and Food
Engineering in Agriculture, Environment and Food Engineering-Industrial and Manufacturing Engineering
CiteScore
1.00
自引率
0.00%
发文量
4
期刊介绍: Engineering in Agriculture, Environment and Food (EAEF) is devoted to the advancement and dissemination of scientific and technical knowledge concerning agricultural machinery, tillage, terramechanics, precision farming, agricultural instrumentation, sensors, bio-robotics, systems automation, processing of agricultural products and foods, quality evaluation and food safety, waste treatment and management, environmental control, energy utilization agricultural systems engineering, bio-informatics, computer simulation, computational mechanics, farm work systems and mechanized cropping. It is an international English E-journal published and distributed by the Asian Agricultural and Biological Engineering Association (AABEA). Authors should submit the manuscript file written by MS Word through a web site. The manuscript must be approved by the author''s organization prior to submission if required. Contact the societies which you belong to, if you have any question on manuscript submission or on the Journal EAEF.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信