Zhenfei Zhang , Jing Guo , Yingzhi Gao , Fei Zhang , Zhengqing Hou , Qi An , An Yan , Lei Zhang
{"title":"Increasing yield estimation accuracy for individual apple trees via ensemble learning and growth stage stacking","authors":"Zhenfei Zhang , Jing Guo , Yingzhi Gao , Fei Zhang , Zhengqing Hou , Qi An , An Yan , Lei Zhang","doi":"10.1016/j.compag.2025.110648","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate prediction of individual apple tree yields during the preharvest stage is essential for precision orchard management and market planning. However, systematic studies focusing on apple yield estimation are scarce. To address this gap, this study targets Fuji apples in the Aksu region of Xinjiang. Multiple images were captured via a UAV during four key growth stages: flowering, fruit formation, fruit expansion, and ripening. On the basis of the extracted vegetation indices, we first developed yield estimation models using random forest (RF), support vector regression (SVR), partial least squares regression (PLS), and ridge regression (RR) methods. We subsequently combined these four models to construct a stacking ensemble learning (SEL) model. To further increase the accuracy of apple yield estimation, we refined the growth stage stacking method and developed a new model, the growth stage stacking ensemble (GSSE). This model maximises the use of spectral information from multiple apple growth stages by employing various machine learning algorithms and integrating multistage spectral data to improve yield estimation accuracy. The results indicate that the optimal period for yield estimation occurs during the fruit expansion stage, with the support vector regression (SVR) model achieving the best performance (R<sup>2</sup> = 0.654, RMSE = 5.307 kg). Compared with individual machine learning models, the SEL approach enhances yield estimation accuracy, reaching a maximum R<sup>2</sup> of 0.686 and an RMSE of 5.058 kg. Furthermore, GSSE significantly enhanced accuracy compared with the single-growth stage estimation models and SEL, with the combination of fruit expansion and fruit ripening stages yielding the best results, with an R<sup>2</sup> of 0.759 and an RMSE of 4.431 kg, with the fruit expansion stage contributing the most. This study is the first to apply the GSSE to apple yield estimation, offering novel insights for UAV-based apple yield estimation.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"237 ","pages":"Article 110648"},"PeriodicalIF":8.9000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925007549","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate prediction of individual apple tree yields during the preharvest stage is essential for precision orchard management and market planning. However, systematic studies focusing on apple yield estimation are scarce. To address this gap, this study targets Fuji apples in the Aksu region of Xinjiang. Multiple images were captured via a UAV during four key growth stages: flowering, fruit formation, fruit expansion, and ripening. On the basis of the extracted vegetation indices, we first developed yield estimation models using random forest (RF), support vector regression (SVR), partial least squares regression (PLS), and ridge regression (RR) methods. We subsequently combined these four models to construct a stacking ensemble learning (SEL) model. To further increase the accuracy of apple yield estimation, we refined the growth stage stacking method and developed a new model, the growth stage stacking ensemble (GSSE). This model maximises the use of spectral information from multiple apple growth stages by employing various machine learning algorithms and integrating multistage spectral data to improve yield estimation accuracy. The results indicate that the optimal period for yield estimation occurs during the fruit expansion stage, with the support vector regression (SVR) model achieving the best performance (R2 = 0.654, RMSE = 5.307 kg). Compared with individual machine learning models, the SEL approach enhances yield estimation accuracy, reaching a maximum R2 of 0.686 and an RMSE of 5.058 kg. Furthermore, GSSE significantly enhanced accuracy compared with the single-growth stage estimation models and SEL, with the combination of fruit expansion and fruit ripening stages yielding the best results, with an R2 of 0.759 and an RMSE of 4.431 kg, with the fruit expansion stage contributing the most. This study is the first to apply the GSSE to apple yield estimation, offering novel insights for UAV-based apple yield estimation.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.