Grape vine (Vitis vinifera) yield prediction using optimized weighted ensemble machine learning approach

IF 5.7 Q1 AGRICULTURAL ENGINEERING
Nobin Chandra Paul, Pratapsingh S. Khapte, Navyasree Ponnaganti, Sushil S. Changan, Sangram B. Chavan, K. Ravi Kumar, Dhananjay D. Nangare, K. Sammi Reddy
{"title":"Grape vine (Vitis vinifera) yield prediction using optimized weighted ensemble machine learning approach","authors":"Nobin Chandra Paul,&nbsp;Pratapsingh S. Khapte,&nbsp;Navyasree Ponnaganti,&nbsp;Sushil S. Changan,&nbsp;Sangram B. Chavan,&nbsp;K. Ravi Kumar,&nbsp;Dhananjay D. Nangare,&nbsp;K. Sammi Reddy","doi":"10.1016/j.atech.2025.101151","DOIUrl":null,"url":null,"abstract":"<div><div>Grape vine (<em>Vitis vinifera</em>) plays a significant role in the agricultural industry, contributing substantially to the global economy through the production of table grapes, wine, and raisins. With increasing demand for high-quality grapes, both for domestic consumption and export, there is a pressing need to improve yield prediction models for better resource management. In this study, we propose an optimized weighted ensemble machine learning approach for predicting grape vine yield, integrating multiple morphological, physiological, and berry quality parameters. A diverse set of machine learning (ML) models, including Random Forest (RF), Artificial Neural Network (ANN), Extreme Gradient Boosting (XgBoost), Support Vector Regression (SVR), Gaussian Process Regression (GPR), Cubist and Multivariate Adaptive Regression Splines (MARS), were employed to model the grapevine yield. A Minimum Data Set (MDS) selection was performed using Principal Component Analysis (PCA), followed by data normalization to enhance model efficiency. Additionally, three ensemble approaches-Simple Averaging, Weighted Averaging, and Ridge Regression-based ensemble models were implemented to improve prediction accuracy. The dataset was divided into training and testing subsets, with hyperparameters of each model tuned using repeated k-fold cross-validation. The ensemble approach demonstrated superior performance, with improved accuracy in yield prediction compared to individual base models. This study highlights the effectiveness of ensemble learning in precision viticulture, offering a reliable framework for yield prediction in grapevine cultivation. The proposed approach offers a practical framework for vineyard managers and growers to optimize resource allocation and improve decision-making.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"12 ","pages":"Article 101151"},"PeriodicalIF":5.7000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375525003831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

Grape vine (Vitis vinifera) plays a significant role in the agricultural industry, contributing substantially to the global economy through the production of table grapes, wine, and raisins. With increasing demand for high-quality grapes, both for domestic consumption and export, there is a pressing need to improve yield prediction models for better resource management. In this study, we propose an optimized weighted ensemble machine learning approach for predicting grape vine yield, integrating multiple morphological, physiological, and berry quality parameters. A diverse set of machine learning (ML) models, including Random Forest (RF), Artificial Neural Network (ANN), Extreme Gradient Boosting (XgBoost), Support Vector Regression (SVR), Gaussian Process Regression (GPR), Cubist and Multivariate Adaptive Regression Splines (MARS), were employed to model the grapevine yield. A Minimum Data Set (MDS) selection was performed using Principal Component Analysis (PCA), followed by data normalization to enhance model efficiency. Additionally, three ensemble approaches-Simple Averaging, Weighted Averaging, and Ridge Regression-based ensemble models were implemented to improve prediction accuracy. The dataset was divided into training and testing subsets, with hyperparameters of each model tuned using repeated k-fold cross-validation. The ensemble approach demonstrated superior performance, with improved accuracy in yield prediction compared to individual base models. This study highlights the effectiveness of ensemble learning in precision viticulture, offering a reliable framework for yield prediction in grapevine cultivation. The proposed approach offers a practical framework for vineyard managers and growers to optimize resource allocation and improve decision-making.

Abstract Image

利用优化加权集成机器学习方法预测葡萄产量
葡萄藤(Vitis vinifera)在农业中扮演着重要的角色,通过生产鲜食葡萄、葡萄酒和葡萄干,为全球经济做出了重大贡献。随着国内消费和出口对优质葡萄的需求不断增加,迫切需要改进产量预测模型,以便更好地进行资源管理。在这项研究中,我们提出了一种优化的加权集成机器学习方法,用于预测葡萄藤产量,整合多种形态,生理和浆果质量参数。采用随机森林(RF)、人工神经网络(ANN)、极端梯度增强(XgBoost)、支持向量回归(SVR)、高斯过程回归(GPR)、立体主义和多元自适应回归样条(MARS)等多种机器学习(ML)模型对葡萄产量进行建模。使用主成分分析(PCA)进行最小数据集(MDS)选择,然后进行数据归一化以提高模型效率。此外,为了提高预测精度,采用了简单平均、加权平均和基于Ridge回归的三种集成方法。将数据集分为训练子集和测试子集,每个模型的超参数使用重复的k-fold交叉验证进行调整。与单个基本模型相比,集成方法在产率预测方面具有更高的准确性。该研究强调了集成学习在葡萄精确栽培中的有效性,为葡萄种植产量预测提供了可靠的框架。提出的方法为葡萄园管理者和种植者优化资源配置和改进决策提供了一个实用的框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信