基于CNN特征提取与选择的EPO优化棉花叶病分类

Mehwish Zafar, Javeria Amin, Muhammad Sharif, Muhammad Almas Anjum, Seifedine Kadry, Jungeun Kim
{"title":"基于CNN特征提取与选择的EPO优化棉花叶病分类","authors":"Mehwish Zafar, Javeria Amin, Muhammad Sharif, Muhammad Almas Anjum, Seifedine Kadry, Jungeun Kim","doi":"10.32604/cmc.2023.035860","DOIUrl":null,"url":null,"abstract":"Worldwide cotton is the most profitable cash crop. Each year the production of this crop suffers because of several diseases. At an early stage, computerized methods are used for disease detection that may reduce the loss in the production of cotton. Although several methods are proposed for the detection of cotton diseases, however, still there are limitations because of low-quality images, size, shape, variations in orientation, and complex background. Due to these factors, there is a need for novel methods for features extraction/selection for the accurate cotton disease classification. Therefore in this research, an optimized features fusion-based model is proposed, in which two pre-trained architectures called EfficientNet-b0 and Inception-v3 are utilized to extract features, each model extracts the feature vector of length N 1000. After that, the extracted features are serially concatenated having a feature vector length N 2000. The most prominent features are selected using Emperor Penguin Optimizer (EPO) method. The method is evaluated on two publically available datasets, such as Kaggle cotton disease dataset-I, and Kaggle cotton-leaf-infection-II. The EPO method returns the feature vector of length 1 755, and 1 824 using dataset-I, and dataset-II, respectively. The classification is performed using 5, 7, and 10 folds cross-validation. The Quadratic Discriminant Analysis (QDA) classifier provides an accuracy of 98.9% on 5 fold, 98.96% on 7 fold, and 99.07% on 10 fold using Kaggle cotton disease dataset-I while the Ensemble Subspace K Nearest Neighbor (KNN) provides 99.16% on 5 fold, 98.99% on 7 fold, and 99.27% on 10 fold using Kaggle cotton-leaf-infection dataset-II.","PeriodicalId":93535,"journal":{"name":"Computers, materials & continua","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CNN Based Features Extraction and Selection Using EPO Optimizer for Cotton Leaf Diseases Classification\",\"authors\":\"Mehwish Zafar, Javeria Amin, Muhammad Sharif, Muhammad Almas Anjum, Seifedine Kadry, Jungeun Kim\",\"doi\":\"10.32604/cmc.2023.035860\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Worldwide cotton is the most profitable cash crop. Each year the production of this crop suffers because of several diseases. At an early stage, computerized methods are used for disease detection that may reduce the loss in the production of cotton. Although several methods are proposed for the detection of cotton diseases, however, still there are limitations because of low-quality images, size, shape, variations in orientation, and complex background. Due to these factors, there is a need for novel methods for features extraction/selection for the accurate cotton disease classification. Therefore in this research, an optimized features fusion-based model is proposed, in which two pre-trained architectures called EfficientNet-b0 and Inception-v3 are utilized to extract features, each model extracts the feature vector of length N 1000. After that, the extracted features are serially concatenated having a feature vector length N 2000. The most prominent features are selected using Emperor Penguin Optimizer (EPO) method. The method is evaluated on two publically available datasets, such as Kaggle cotton disease dataset-I, and Kaggle cotton-leaf-infection-II. The EPO method returns the feature vector of length 1 755, and 1 824 using dataset-I, and dataset-II, respectively. The classification is performed using 5, 7, and 10 folds cross-validation. The Quadratic Discriminant Analysis (QDA) classifier provides an accuracy of 98.9% on 5 fold, 98.96% on 7 fold, and 99.07% on 10 fold using Kaggle cotton disease dataset-I while the Ensemble Subspace K Nearest Neighbor (KNN) provides 99.16% on 5 fold, 98.99% on 7 fold, and 99.27% on 10 fold using Kaggle cotton-leaf-infection dataset-II.\",\"PeriodicalId\":93535,\"journal\":{\"name\":\"Computers, materials & continua\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers, materials & continua\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.32604/cmc.2023.035860\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers, materials & continua","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32604/cmc.2023.035860","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

棉花是全世界最赚钱的经济作物。这种作物的产量每年都受到几种病害的影响。在早期阶段,计算机方法被用于疾病检测,这可能会减少棉花生产中的损失。虽然提出了几种检测棉花病害的方法,但由于图像质量低、大小、形状、方向变化和背景复杂等原因,仍然存在局限性。由于这些因素,需要新的特征提取/选择方法来实现棉花病害的准确分类。因此,本研究提出了一种优化的基于特征融合的模型,其中利用两个预训练的架构effentnet -b0和Inception-v3进行特征提取,每个模型提取长度为n1000的特征向量。之后,将提取的特征进行序列拼接,特征向量长度为n2000。使用帝企鹅优化(EPO)方法选择最突出的特征。该方法在两个公开的数据集上进行了评估,如Kaggle棉花疾病数据集i和Kaggle棉花叶片感染数据集ii。EPO方法分别使用数据集i和数据集ii返回长度为1 755和1 824的特征向量。分类使用5倍、7倍和10倍交叉验证执行。二次判别分析(Quadratic Discriminant Analysis, QDA)分类器在Kaggle棉叶侵染数据集i上的5次准确率为98.9%,7次准确率为98.96%,10次准确率为99.07%,而集成子空间K最邻近(KNN)在Kaggle棉叶侵染数据集ii上的5次准确率为99.16%,7次准确率为98.99%,10次准确率为99.27%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
CNN Based Features Extraction and Selection Using EPO Optimizer for Cotton Leaf Diseases Classification
Worldwide cotton is the most profitable cash crop. Each year the production of this crop suffers because of several diseases. At an early stage, computerized methods are used for disease detection that may reduce the loss in the production of cotton. Although several methods are proposed for the detection of cotton diseases, however, still there are limitations because of low-quality images, size, shape, variations in orientation, and complex background. Due to these factors, there is a need for novel methods for features extraction/selection for the accurate cotton disease classification. Therefore in this research, an optimized features fusion-based model is proposed, in which two pre-trained architectures called EfficientNet-b0 and Inception-v3 are utilized to extract features, each model extracts the feature vector of length N 1000. After that, the extracted features are serially concatenated having a feature vector length N 2000. The most prominent features are selected using Emperor Penguin Optimizer (EPO) method. The method is evaluated on two publically available datasets, such as Kaggle cotton disease dataset-I, and Kaggle cotton-leaf-infection-II. The EPO method returns the feature vector of length 1 755, and 1 824 using dataset-I, and dataset-II, respectively. The classification is performed using 5, 7, and 10 folds cross-validation. The Quadratic Discriminant Analysis (QDA) classifier provides an accuracy of 98.9% on 5 fold, 98.96% on 7 fold, and 99.07% on 10 fold using Kaggle cotton disease dataset-I while the Ensemble Subspace K Nearest Neighbor (KNN) provides 99.16% on 5 fold, 98.99% on 7 fold, and 99.27% on 10 fold using Kaggle cotton-leaf-infection dataset-II.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信