Boosting grapevine phenological stages prediction based on climatic data by pseudo-labeling approach

IF 12.4 Q1 AGRICULTURE, MULTIDISCIPLINARY
Mehdi Fasihi , Mirko Sodini , Alex Falcon , Francesco Degano , Paolo Sivilotti , Giuseppe Serra
{"title":"Boosting grapevine phenological stages prediction based on climatic data by pseudo-labeling approach","authors":"Mehdi Fasihi ,&nbsp;Mirko Sodini ,&nbsp;Alex Falcon ,&nbsp;Francesco Degano ,&nbsp;Paolo Sivilotti ,&nbsp;Giuseppe Serra","doi":"10.1016/j.aiia.2025.03.003","DOIUrl":null,"url":null,"abstract":"<div><div>Predicting grapevine phenological stages (GPHS) is critical for precisely managing vineyard operations, including plant disease treatments, pruning, and harvest. Solutions commonly used to address viticulture challenges rely on image processing techniques, which have achieved significant results. However, they require the installation of dedicated hardware in the vineyard, making it invasive and difficult to maintain. Moreover, accurate prediction is influenced by the interplay of climatic factors, especially temperature, and the impact of global warming, which are difficult to model using images. Another problem frequently found in GPHS prediction is the persistent issue of missing values in viticultural datasets, particularly in phenological stages. This paper proposes a semi-supervised approach that begins with a small set of labeled phenological stage examples and automatically generates new annotations for large volumes of unlabeled climatic data. This approach aims to address key challenges in phenological analysis. This novel climatic data-based approach offers advantages over common image processing methods, as it is non-intrusive, cost-effective, and adaptable for vineyards of various sizes and technological levels. To ensure the robustness of the proposed Pseudo-labelling strategy, we integrated it into eight machine-learning algorithms. We evaluated its performance across seven diverse datasets, each exhibiting varying percentages of missing values. Performance metrics, including the coefficient of determination (R<sup>2</sup>) and root-mean-square error (RMSE), are employed to assess the effectiveness of the models. The study demonstrates that integrating the proposed Pseudo-labeling strategy with supervised learning approaches significantly improves predictive accuracy. Moreover, the study shows that the proposed methodology can also be integrated with explainable artificial intelligence techniques to determine the importance of the input features. In particular, the investigation highlights that growing degree days are crucial for improved GPHS prediction.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 3","pages":"Pages 550-563"},"PeriodicalIF":12.4000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Agriculture","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589721725000303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Predicting grapevine phenological stages (GPHS) is critical for precisely managing vineyard operations, including plant disease treatments, pruning, and harvest. Solutions commonly used to address viticulture challenges rely on image processing techniques, which have achieved significant results. However, they require the installation of dedicated hardware in the vineyard, making it invasive and difficult to maintain. Moreover, accurate prediction is influenced by the interplay of climatic factors, especially temperature, and the impact of global warming, which are difficult to model using images. Another problem frequently found in GPHS prediction is the persistent issue of missing values in viticultural datasets, particularly in phenological stages. This paper proposes a semi-supervised approach that begins with a small set of labeled phenological stage examples and automatically generates new annotations for large volumes of unlabeled climatic data. This approach aims to address key challenges in phenological analysis. This novel climatic data-based approach offers advantages over common image processing methods, as it is non-intrusive, cost-effective, and adaptable for vineyards of various sizes and technological levels. To ensure the robustness of the proposed Pseudo-labelling strategy, we integrated it into eight machine-learning algorithms. We evaluated its performance across seven diverse datasets, each exhibiting varying percentages of missing values. Performance metrics, including the coefficient of determination (R2) and root-mean-square error (RMSE), are employed to assess the effectiveness of the models. The study demonstrates that integrating the proposed Pseudo-labeling strategy with supervised learning approaches significantly improves predictive accuracy. Moreover, the study shows that the proposed methodology can also be integrated with explainable artificial intelligence techniques to determine the importance of the input features. In particular, the investigation highlights that growing degree days are crucial for improved GPHS prediction.
伪标记法提高葡萄物候期预测的气候数据
预测葡萄物候阶段(GPHS)是精确管理葡萄园操作,包括植物病害治疗,修剪和收获的关键。通常用于解决葡萄栽培挑战的解决方案依赖于图像处理技术,该技术已经取得了显著的成果。然而,它们需要在葡萄园中安装专用硬件,使其具有侵入性且难以维护。此外,准确的预测受到气候因素,特别是温度和全球变暖的影响的相互作用的影响,这些因素很难利用图像进行建模。在GPHS预测中经常发现的另一个问题是葡萄栽培数据集中持续存在的缺失值问题,特别是在物候阶段。本文提出了一种半监督方法,该方法从一小组标记物候阶段示例开始,并为大量未标记的气候数据自动生成新的注释。这种方法旨在解决物候分析中的关键挑战。这种新颖的基于气候数据的方法比普通的图像处理方法具有优势,因为它是非侵入性的,具有成本效益,并且适用于各种规模和技术水平的葡萄园。为了确保提出的伪标签策略的鲁棒性,我们将其集成到八种机器学习算法中。我们在七个不同的数据集上评估了它的性能,每个数据集都显示了不同的缺失值百分比。采用决策系数(R2)和均方根误差(RMSE)等绩效指标来评估模型的有效性。研究表明,将伪标注策略与监督学习方法相结合,可以显著提高预测精度。此外,研究表明,所提出的方法也可以与可解释的人工智能技术相结合,以确定输入特征的重要性。该调查特别强调,生长度日对于改进GPHS预测至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Artificial Intelligence in Agriculture
Artificial Intelligence in Agriculture Engineering-Engineering (miscellaneous)
CiteScore
21.60
自引率
0.00%
发文量
18
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信