Marwa Radwan, Amel Ali Alhussan, Abdelhameed Ibrahim, Sayed M. Tawfeek
{"title":"利用优化的机器学习模型和特征选择技术进行马铃薯叶病分类","authors":"Marwa Radwan, Amel Ali Alhussan, Abdelhameed Ibrahim, Sayed M. Tawfeek","doi":"10.1007/s11540-024-09763-8","DOIUrl":null,"url":null,"abstract":"<p>The diseases that particularly affect potato leaves are early blight and the late blight, and they are dangerous as they reduce yield and quality of the potatoes. In this paper, different machine learning (ML) models for predicting these diseases are analysed based on a detailed database of more than 4000 records of weather conditions. Some of the critical factors that have been investigated to determine correlations with disease prevalence include temperature, humidity, wind speed, and atmospheric pressure. These types of data relationships were comprehensively identified through sophisticated means of analysis such as <i>K</i>-means clustering, PCA, and copula analysis. To achieve this, several machine learning models were used in the study: logistic regression, gradient boosting, multilayer perceptron (MLP), and support vector machine (SVM), as well as <i>K</i>-nearest neighbor (KNN) models both with and without feature selection. Feature selection methods such as the binary Greylag Goose Optimization (bGGO) were applied to improve the predictive performance of the models by identifying feature sets pertinent to the models. Results demonstrated that the MLP model, with feature selection, achieved an accuracy of 98.3%, underscoring the critical role of feature selection in improving model performance. These findings highlight the importance of optimized ML models in proactive agricultural disease management, aiming to minimize crop loss and promote sustainable farming practices.</p>","PeriodicalId":20378,"journal":{"name":"Potato Research","volume":"1045 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Potato Leaf Disease Classification Using Optimized Machine Learning Models and Feature Selection Techniques\",\"authors\":\"Marwa Radwan, Amel Ali Alhussan, Abdelhameed Ibrahim, Sayed M. Tawfeek\",\"doi\":\"10.1007/s11540-024-09763-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The diseases that particularly affect potato leaves are early blight and the late blight, and they are dangerous as they reduce yield and quality of the potatoes. In this paper, different machine learning (ML) models for predicting these diseases are analysed based on a detailed database of more than 4000 records of weather conditions. Some of the critical factors that have been investigated to determine correlations with disease prevalence include temperature, humidity, wind speed, and atmospheric pressure. These types of data relationships were comprehensively identified through sophisticated means of analysis such as <i>K</i>-means clustering, PCA, and copula analysis. To achieve this, several machine learning models were used in the study: logistic regression, gradient boosting, multilayer perceptron (MLP), and support vector machine (SVM), as well as <i>K</i>-nearest neighbor (KNN) models both with and without feature selection. Feature selection methods such as the binary Greylag Goose Optimization (bGGO) were applied to improve the predictive performance of the models by identifying feature sets pertinent to the models. Results demonstrated that the MLP model, with feature selection, achieved an accuracy of 98.3%, underscoring the critical role of feature selection in improving model performance. These findings highlight the importance of optimized ML models in proactive agricultural disease management, aiming to minimize crop loss and promote sustainable farming practices.</p>\",\"PeriodicalId\":20378,\"journal\":{\"name\":\"Potato Research\",\"volume\":\"1045 1\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-07-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Potato Research\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1007/s11540-024-09763-8\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Potato Research","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s11540-024-09763-8","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
摘要
特别影响马铃薯叶片的病害是早疫病和晚疫病,它们会降低马铃薯的产量和质量,因此非常危险。本文基于一个包含 4000 多条天气条件记录的详细数据库,分析了预测这些病害的不同机器学习 (ML) 模型。为了确定与病害发生率的相关性,对一些关键因素进行了调查,其中包括温度、湿度、风速和大气压力。通过 K 均值聚类、PCA 和 copula 分析等复杂的分析手段,全面确定了这些类型的数据关系。为此,研究中使用了多种机器学习模型:逻辑回归、梯度提升、多层感知器(MLP)和支持向量机(SVM),以及有特征选择和无特征选择的 K 近邻(KNN)模型。特征选择方法(如二元灰雁优化法(bGGO))通过识别与模型相关的特征集来提高模型的预测性能。结果表明,经过特征选择的 MLP 模型的准确率达到了 98.3%,突出了特征选择在提高模型性能方面的关键作用。这些发现凸显了优化的 ML 模型在前瞻性农业疾病管理中的重要性,其目的是最大限度地减少作物损失,促进可持续的农业实践。
Potato Leaf Disease Classification Using Optimized Machine Learning Models and Feature Selection Techniques
The diseases that particularly affect potato leaves are early blight and the late blight, and they are dangerous as they reduce yield and quality of the potatoes. In this paper, different machine learning (ML) models for predicting these diseases are analysed based on a detailed database of more than 4000 records of weather conditions. Some of the critical factors that have been investigated to determine correlations with disease prevalence include temperature, humidity, wind speed, and atmospheric pressure. These types of data relationships were comprehensively identified through sophisticated means of analysis such as K-means clustering, PCA, and copula analysis. To achieve this, several machine learning models were used in the study: logistic regression, gradient boosting, multilayer perceptron (MLP), and support vector machine (SVM), as well as K-nearest neighbor (KNN) models both with and without feature selection. Feature selection methods such as the binary Greylag Goose Optimization (bGGO) were applied to improve the predictive performance of the models by identifying feature sets pertinent to the models. Results demonstrated that the MLP model, with feature selection, achieved an accuracy of 98.3%, underscoring the critical role of feature selection in improving model performance. These findings highlight the importance of optimized ML models in proactive agricultural disease management, aiming to minimize crop loss and promote sustainable farming practices.
期刊介绍:
Potato Research, the journal of the European Association for Potato Research (EAPR), promotes the exchange of information on all aspects of this fast-evolving global industry. It offers the latest developments in innovative research to scientists active in potato research. The journal includes authoritative coverage of new scientific developments, publishing original research and review papers on such topics as:
Molecular sciences;
Breeding;
Physiology;
Pathology;
Nematology;
Virology;
Agronomy;
Engineering and Utilization.