Machine Learning Classification Models Comparison for Crop Damage Identification

Douaer Abdelmalek, Kourgli Assia
{"title":"Machine Learning Classification Models Comparison for Crop Damage Identification","authors":"Douaer Abdelmalek, Kourgli Assia","doi":"10.1109/ICAEE53772.2022.9962088","DOIUrl":null,"url":null,"abstract":"Plants provide over eighty percent of the food consumed by humans and livestock. Its safety is threatened by diseases and pests. The main objective of the paper is to identify health status of crop state using machine learning classification models. We manage to compare eight approaches: Logistic Regression (LG), K-nearest neighbors (KNN), Decision Tree (DT), Random Forest (RF), Extreme Gradient Boosting (XGB), Gaussian Naive Bayes (GNB), Support Vector Machine (SVM) and Multilayer perceptron (MLP), on Indian agriculture dataset. We also used the Principal component analysis (PCA) and attempted to re-apply the same models. This paper includes an overview of these previously mentioned models, comparison between them according to data used, and a conclusion about which model leads to the best performance. Tests conducted show that the Extreme Gradient Boosting (XGB) is the best model for these data and problem for all cases then Decision Tree (DT), Logistic Regression (LR), and K-nearest neighbors (KNN). The last three models being very close in results.","PeriodicalId":206584,"journal":{"name":"2022 2nd International Conference on Advanced Electrical Engineering (ICAEE)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Advanced Electrical Engineering (ICAEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAEE53772.2022.9962088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Plants provide over eighty percent of the food consumed by humans and livestock. Its safety is threatened by diseases and pests. The main objective of the paper is to identify health status of crop state using machine learning classification models. We manage to compare eight approaches: Logistic Regression (LG), K-nearest neighbors (KNN), Decision Tree (DT), Random Forest (RF), Extreme Gradient Boosting (XGB), Gaussian Naive Bayes (GNB), Support Vector Machine (SVM) and Multilayer perceptron (MLP), on Indian agriculture dataset. We also used the Principal component analysis (PCA) and attempted to re-apply the same models. This paper includes an overview of these previously mentioned models, comparison between them according to data used, and a conclusion about which model leads to the best performance. Tests conducted show that the Extreme Gradient Boosting (XGB) is the best model for these data and problem for all cases then Decision Tree (DT), Logistic Regression (LR), and K-nearest neighbors (KNN). The last three models being very close in results.
作物损害识别的机器学习分类模型比较
植物提供了人类和牲畜消耗的80%以上的食物。其安全性受到病虫害的威胁。本文的主要目的是利用机器学习分类模型来识别作物状态的健康状态。我们在印度农业数据集上比较了八种方法:逻辑回归(LG)、k近邻(KNN)、决策树(DT)、随机森林(RF)、极端梯度增强(XGB)、高斯朴素贝叶斯(GNB)、支持向量机(SVM)和多层感知器(MLP)。我们还使用了主成分分析(PCA),并试图重新应用相同的模型。本文包括对前面提到的这些模型的概述,根据所使用的数据对它们进行比较,并得出关于哪种模型可以获得最佳性能的结论。所进行的测试表明,极端梯度增强(XGB)是这些数据和问题的最佳模型,然后是决策树(DT)、逻辑回归(LR)和k近邻(KNN)。最后三个模型的结果非常接近。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信