Grape vine (Vitis vinifera) yield prediction using optimized weighted ensemble machine learning approach

IF 5.7 Q1 AGRICULTURAL ENGINEERING

Smart agricultural technology Pub Date : 2025-07-01 DOI:10.1016/j.atech.2025.101151

Nobin Chandra Paul, Pratapsingh S. Khapte, Navyasree Ponnaganti, Sushil S. Changan, Sangram B. Chavan, K. Ravi Kumar, Dhananjay D. Nangare, K. Sammi Reddy

{"title":"Grape vine (Vitis vinifera) yield prediction using optimized weighted ensemble machine learning approach","authors":"Nobin Chandra Paul, Pratapsingh S. Khapte, Navyasree Ponnaganti, Sushil S. Changan, Sangram B. Chavan, K. Ravi Kumar, Dhananjay D. Nangare, K. Sammi Reddy","doi":"10.1016/j.atech.2025.101151","DOIUrl":null,"url":null,"abstract":"<div><div>Grape vine (<em>Vitis vinifera</em>) plays a significant role in the agricultural industry, contributing substantially to the global economy through the production of table grapes, wine, and raisins. With increasing demand for high-quality grapes, both for domestic consumption and export, there is a pressing need to improve yield prediction models for better resource management. In this study, we propose an optimized weighted ensemble machine learning approach for predicting grape vine yield, integrating multiple morphological, physiological, and berry quality parameters. A diverse set of machine learning (ML) models, including Random Forest (RF), Artificial Neural Network (ANN), Extreme Gradient Boosting (XgBoost), Support Vector Regression (SVR), Gaussian Process Regression (GPR), Cubist and Multivariate Adaptive Regression Splines (MARS), were employed to model the grapevine yield. A Minimum Data Set (MDS) selection was performed using Principal Component Analysis (PCA), followed by data normalization to enhance model efficiency. Additionally, three ensemble approaches-Simple Averaging, Weighted Averaging, and Ridge Regression-based ensemble models were implemented to improve prediction accuracy. The dataset was divided into training and testing subsets, with hyperparameters of each model tuned using repeated k-fold cross-validation. The ensemble approach demonstrated superior performance, with improved accuracy in yield prediction compared to individual base models. This study highlights the effectiveness of ensemble learning in precision viticulture, offering a reliable framework for yield prediction in grapevine cultivation. The proposed approach offers a practical framework for vineyard managers and growers to optimize resource allocation and improve decision-making.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"12 ","pages":"Article 101151"},"PeriodicalIF":5.7000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375525003831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Grape vine (Vitis vinifera) plays a significant role in the agricultural industry, contributing substantially to the global economy through the production of table grapes, wine, and raisins. With increasing demand for high-quality grapes, both for domestic consumption and export, there is a pressing need to improve yield prediction models for better resource management. In this study, we propose an optimized weighted ensemble machine learning approach for predicting grape vine yield, integrating multiple morphological, physiological, and berry quality parameters. A diverse set of machine learning (ML) models, including Random Forest (RF), Artificial Neural Network (ANN), Extreme Gradient Boosting (XgBoost), Support Vector Regression (SVR), Gaussian Process Regression (GPR), Cubist and Multivariate Adaptive Regression Splines (MARS), were employed to model the grapevine yield. A Minimum Data Set (MDS) selection was performed using Principal Component Analysis (PCA), followed by data normalization to enhance model efficiency. Additionally, three ensemble approaches-Simple Averaging, Weighted Averaging, and Ridge Regression-based ensemble models were implemented to improve prediction accuracy. The dataset was divided into training and testing subsets, with hyperparameters of each model tuned using repeated k-fold cross-validation. The ensemble approach demonstrated superior performance, with improved accuracy in yield prediction compared to individual base models. This study highlights the effectiveness of ensemble learning in precision viticulture, offering a reliable framework for yield prediction in grapevine cultivation. The proposed approach offers a practical framework for vineyard managers and growers to optimize resource allocation and improve decision-making.

Abstract Image

查看原文本刊更多论文

利用优化加权集成机器学习方法预测葡萄产量

葡萄藤（Vitis vinifera）在农业中扮演着重要的角色，通过生产鲜食葡萄、葡萄酒和葡萄干，为全球经济做出了重大贡献。随着国内消费和出口对优质葡萄的需求不断增加，迫切需要改进产量预测模型，以便更好地进行资源管理。在这项研究中，我们提出了一种优化的加权集成机器学习方法，用于预测葡萄藤产量，整合多种形态，生理和浆果质量参数。采用随机森林（RF）、人工神经网络（ANN）、极端梯度增强（XgBoost）、支持向量回归（SVR）、高斯过程回归（GPR）、立体主义和多元自适应回归样条（MARS）等多种机器学习（ML）模型对葡萄产量进行建模。使用主成分分析（PCA）进行最小数据集（MDS）选择，然后进行数据归一化以提高模型效率。此外，为了提高预测精度，采用了简单平均、加权平均和基于Ridge回归的三种集成方法。将数据集分为训练子集和测试子集，每个模型的超参数使用重复的k-fold交叉验证进行调整。与单个基本模型相比，集成方法在产率预测方面具有更高的准确性。该研究强调了集成学习在葡萄精确栽培中的有效性，为葡萄种植产量预测提供了可靠的框架。提出的方法为葡萄园管理者和种植者优化资源配置和改进决策提供了一个实用的框架。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Smart agricultural technology

CiteScore

4.20

自引率

0.00%

发文量