Rui Mao, Ying Shi, Tao Zhu, Jia Zhou, Junjuan Wang, Liangwei Zhao, Meiyan Liu, Xiangming Xu, Xiaoping Hu
{"title":"Machine Learning Approaches for Predicting Fusarium Head Blight Epidemic Levels from Climatological Time Series Features.","authors":"Rui Mao, Ying Shi, Tao Zhu, Jia Zhou, Junjuan Wang, Liangwei Zhao, Meiyan Liu, Xiangming Xu, Xiaoping Hu","doi":"10.1094/PHYTO-06-25-0221-FI","DOIUrl":null,"url":null,"abstract":"<p><p>Fusarium head blight (FHB), caused by the FHB species complex, is one of the most damaging diseases affecting wheat. Accurately predicting FHB occurrence prior to infection is crucial for preventing outbreaks, minimizing crop losses, and reducing the risks of mycotoxins entering the food chain. This study utilized 55 years of historical weather data and the level of the primary <i>Fusarium</i> inoculum in crop debris to predict FHB severity. Time series features were extracted from daily average temperature, relative humidity, precipitation, and sunshine hours recorded from 1 January to 31 March each year. Random forest (RF), statistical analysis, binary enumeration, and feature interpretability analysis were employed to identify features most strongly associated with FHB occurrence. Six machine learning (ML) models, including artificial neural networks (ANNs), logistic regression, <i>K</i>-nearest neighbors, support vector machine, RF, and extreme gradient boosting, were applied to the selected features to classify FHB epidemic levels. The results revealed that (i) the inclusion of inoculum strength did not significantly enhance the predictive accuracy of models based solely on climatological data; (ii) the nine complex time series features derived from the four types of climatological data were effective in classifying FHB epidemic levels through cross-regional validation by capturing critical conditions linked to the pathogen lifecycle; and (iii) ANNs outperformed the other five ML models in classifying observed FHB epidemic levels using the selected nine features. Further research should focus on applying this model to larger datasets to enhance its practical utility for FHB management.</p>","PeriodicalId":20410,"journal":{"name":"Phytopathology","volume":" ","pages":"PHYTO06250221FI"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Phytopathology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1094/PHYTO-06-25-0221-FI","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Fusarium head blight (FHB), caused by the FHB species complex, is one of the most damaging diseases affecting wheat. Accurately predicting FHB occurrence prior to infection is crucial for preventing outbreaks, minimizing crop losses, and reducing the risks of mycotoxins entering the food chain. This study utilized 55 years of historical weather data and the level of the primary Fusarium inoculum in crop debris to predict FHB severity. Time series features were extracted from daily average temperature, relative humidity, precipitation, and sunshine hours recorded from 1 January to 31 March each year. Random forest (RF), statistical analysis, binary enumeration, and feature interpretability analysis were employed to identify features most strongly associated with FHB occurrence. Six machine learning (ML) models, including artificial neural networks (ANNs), logistic regression, K-nearest neighbors, support vector machine, RF, and extreme gradient boosting, were applied to the selected features to classify FHB epidemic levels. The results revealed that (i) the inclusion of inoculum strength did not significantly enhance the predictive accuracy of models based solely on climatological data; (ii) the nine complex time series features derived from the four types of climatological data were effective in classifying FHB epidemic levels through cross-regional validation by capturing critical conditions linked to the pathogen lifecycle; and (iii) ANNs outperformed the other five ML models in classifying observed FHB epidemic levels using the selected nine features. Further research should focus on applying this model to larger datasets to enhance its practical utility for FHB management.
小麦赤霉病(Fusarium head blight, FHB)是小麦最具破坏性的病害之一,由赤霉病菌种复合体引起。在感染之前准确预测食物毒素的发生对于预防暴发、最大限度地减少作物损失和减少真菌毒素进入食物链的风险至关重要。本研究利用55年的历史天气数据和作物碎屑中初级镰刀菌的接种水平来预测FHB的严重程度。时间序列特征提取自每年1月1日至3月31日的日平均气温、相对湿度、降水和日照时数。采用随机森林(RF)、统计分析、二进制枚举和特征可解释性分析来识别与FHB发生最密切相关的特征。采用人工神经网络(ANN)、逻辑回归(LR)、k近邻(KNN)、支持向量机(SVM)、RF和极端梯度增强(XGBoost)等6种机器学习(ML)模型对所选特征进行FHB流行程度分类。结果表明:(1)纳入接种量并没有显著提高单纯基于气候资料的模式的预测精度;(2)通过捕获与病原体生命周期相关的关键条件,通过跨区域验证,从4种气候数据中获得的9个复杂时间序列特征可有效分类FHB流行水平;(3) ANN在使用选定的9个特征对观察到的FHB流行水平进行分类方面优于其他5种ML模型。进一步的研究应侧重于将该模型应用于更大的数据集,以提高其在食品中毒管理中的实际效用。
期刊介绍:
Phytopathology publishes articles on fundamental research that advances understanding of the nature of plant diseases, the agents that cause them, their spread, the losses they cause, and measures that can be used to control them. Phytopathology considers manuscripts covering all aspects of plant diseases including bacteriology, host-parasite biochemistry and cell biology, biological control, disease control and pest management, description of new pathogen species description of new pathogen species, ecology and population biology, epidemiology, disease etiology, host genetics and resistance, mycology, nematology, plant stress and abiotic disorders, postharvest pathology and mycotoxins, and virology. Papers dealing mainly with taxonomy, such as descriptions of new plant pathogen taxa are acceptable if they include plant disease research results such as pathogenicity, host range, etc. Taxonomic papers that focus on classification, identification, and nomenclature below the subspecies level may also be submitted to Phytopathology.