Machine learning techniques for tomato yield prediction: A comprehensive analysis

IF 5.7 Q1 AGRICULTURAL ENGINEERING

Smart agricultural technology Pub Date : 2025-06-06 DOI:10.1016/j.atech.2025.101067

Kodjo Abel Odah , Sèton Calmette Ariane Houetohossou , Vinasetan Ratheil Houndji , Romain Lucas Glèlè Kakaï

{"title":"Machine learning techniques for tomato yield prediction: A comprehensive analysis","authors":"Kodjo Abel Odah , Sèton Calmette Ariane Houetohossou , Vinasetan Ratheil Houndji , Romain Lucas Glèlè Kakaï","doi":"10.1016/j.atech.2025.101067","DOIUrl":null,"url":null,"abstract":"<div><div>Effective yield prediction is crucial for farmers and the agricultural sector. It allows producers to enhance control over their operations and better align with market supply and demand. With the emergence of Artificial Intelligence (AI), various Machine Learning (ML) models have been developed to predict crop yield. In this study, we conducted a systematic literature review to examine the ML models used for predicting tomato yield, the features associated with the most effective models, and the challenges faced by users. We retrieved 1,486 scientific papers from six electronic databases. Following the PRISMA guidelines, we included 57 studies in our analysis. The results showed that 66.67% of the models achieving the best performance in predicting or estimating tomato yield are Deep Learning (DL) models, with neural networks accounting for 42.11% of these. Specifically, Long Short-Term Memory (LSTM), Artificial Neural Networks (ANN), and Support Vector Regression (SVR) are the models most commonly used, demonstrating strong performance when considering factors such as climate, soil conditions, plant growth, fertilization, and irrigation. Additionally, when using computed vegetation indices from image data, Random Forest Regression (RFR) is frequently applied with notable success. The YOLO-Tomato and R-CNN methods are commonly used for detecting tomato fruits prior to yield estimation. Furthermore, DeepSort and linear regression are the predominant methods employed for counting and estimating tomato yield. For future research, it is important to conduct a comparative analysis of models such as LSTM, ANN, SVR, and RFR specifically for predicting tomato yield using data from Africa.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"12 ","pages":"Article 101067"},"PeriodicalIF":5.7000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375525003004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Effective yield prediction is crucial for farmers and the agricultural sector. It allows producers to enhance control over their operations and better align with market supply and demand. With the emergence of Artificial Intelligence (AI), various Machine Learning (ML) models have been developed to predict crop yield. In this study, we conducted a systematic literature review to examine the ML models used for predicting tomato yield, the features associated with the most effective models, and the challenges faced by users. We retrieved 1,486 scientific papers from six electronic databases. Following the PRISMA guidelines, we included 57 studies in our analysis. The results showed that 66.67% of the models achieving the best performance in predicting or estimating tomato yield are Deep Learning (DL) models, with neural networks accounting for 42.11% of these. Specifically, Long Short-Term Memory (LSTM), Artificial Neural Networks (ANN), and Support Vector Regression (SVR) are the models most commonly used, demonstrating strong performance when considering factors such as climate, soil conditions, plant growth, fertilization, and irrigation. Additionally, when using computed vegetation indices from image data, Random Forest Regression (RFR) is frequently applied with notable success. The YOLO-Tomato and R-CNN methods are commonly used for detecting tomato fruits prior to yield estimation. Furthermore, DeepSort and linear regression are the predominant methods employed for counting and estimating tomato yield. For future research, it is important to conduct a comparative analysis of models such as LSTM, ANN, SVR, and RFR specifically for predicting tomato yield using data from Africa.

查看原文本刊更多论文

番茄产量预测的机器学习技术：综合分析

有效的产量预测对农民和农业部门至关重要。它使生产商能够加强对其运营的控制，并更好地与市场供需保持一致。随着人工智能（AI）的出现，人们开发了各种机器学习（ML）模型来预测作物产量。在这项研究中，我们进行了系统的文献综述，以检查用于预测番茄产量的ML模型，与最有效模型相关的特征，以及用户面临的挑战。我们从6个电子数据库中检索了1486篇科学论文。按照PRISMA指南，我们在分析中纳入了57项研究。结果表明，在预测或估计番茄产量方面，66.67%的模型是深度学习（DL）模型，其中神经网络模型占42.11%。具体来说，长短期记忆（LSTM）、人工神经网络（ANN）和支持向量回归（SVR）是最常用的模型，在考虑气候、土壤条件、植物生长、施肥和灌溉等因素时表现出很强的性能。此外，当使用从图像数据中计算的植被指数时，随机森林回归（RFR）经常得到显著的成功应用。YOLO-Tomato和R-CNN方法通常用于在产量估计之前检测番茄果实。此外，深度排序和线性回归是用于计数和估计番茄产量的主要方法。对于未来的研究，重要的是对LSTM、ANN、SVR和RFR等模型进行比较分析，专门用于利用非洲数据预测番茄产量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Smart agricultural technology

CiteScore

4.20

自引率

0.00%

发文量