A Spatial Model of K-Nearest Neighbors for Classification of Cotton (Gossypium) Varieties based on Image Segmentation

Salman Qadri
{"title":"A Spatial Model of K-Nearest Neighbors for Classification of Cotton (Gossypium) Varieties based on Image Segmentation","authors":"Salman Qadri","doi":"10.54692/lgurjcsit.2021.0501173","DOIUrl":null,"url":null,"abstract":"In this study, we describe a technique that used a machine learning (ML) approach to classify four (4) different cotton leaf varieties namely; BS-15, S-32, Z-31, and Z-32. Each variety of cotton leaves were collected from 500 Farmers. These image datasets are captured by using the cell phone camera in the open agricultural field area, and every image was captured from both sides (Front and Back) of the cotton leaf. Each variety of cotton has used over 300 (150 Front Side and 150 Back Side of the leaves) leaf images and the total calculated cotton leaves are 1200 (300 x 4) as leaf image samples. These sample datasets have analyzed through image preprocessing and image segmentation process. Each image was employing four different non-over-lapping regions of interest (ROI’s) and calculated a total of 4800 (1200 x 4) ROI’s. The acquired datasets are employed different machine learning features such as Scalability, Texture, Spectral, Binary, Histogram, Rotational, and translational (R-S-T). A total of fifty-seven (57) machine learning features were evaluated on each ROI and a total calculated 273,600 (4800 x 57) features. Furthermore, the Correlation-Based Feature Selection (CFS) genetic algorithm technique was employed for feature optimization. It has been evaluated 22 optimized features and applying different machine learning (M-L) classifiers namely; K-Nearest Neighbor (K-NN), K*, Random Forest (RF) Tree, and Naive Bayes (NB) Tree. The resulting accuracy produced by K-NN presented is 98.9167% on (512 x 512) ROI’s. The individually overall result accuracy dataset values by using K-NN classifier on the four varieties of cotton leaf namely; BS-15, S-32, Z-31, and Z-32 were evaluated 97.83%, 99.50%, 99%, and 99.33%, respectively.","PeriodicalId":197260,"journal":{"name":"Lahore Garrison University Research Journal of Computer Science and Information Technology","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lahore Garrison University Research Journal of Computer Science and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54692/lgurjcsit.2021.0501173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

In this study, we describe a technique that used a machine learning (ML) approach to classify four (4) different cotton leaf varieties namely; BS-15, S-32, Z-31, and Z-32. Each variety of cotton leaves were collected from 500 Farmers. These image datasets are captured by using the cell phone camera in the open agricultural field area, and every image was captured from both sides (Front and Back) of the cotton leaf. Each variety of cotton has used over 300 (150 Front Side and 150 Back Side of the leaves) leaf images and the total calculated cotton leaves are 1200 (300 x 4) as leaf image samples. These sample datasets have analyzed through image preprocessing and image segmentation process. Each image was employing four different non-over-lapping regions of interest (ROI’s) and calculated a total of 4800 (1200 x 4) ROI’s. The acquired datasets are employed different machine learning features such as Scalability, Texture, Spectral, Binary, Histogram, Rotational, and translational (R-S-T). A total of fifty-seven (57) machine learning features were evaluated on each ROI and a total calculated 273,600 (4800 x 57) features. Furthermore, the Correlation-Based Feature Selection (CFS) genetic algorithm technique was employed for feature optimization. It has been evaluated 22 optimized features and applying different machine learning (M-L) classifiers namely; K-Nearest Neighbor (K-NN), K*, Random Forest (RF) Tree, and Naive Bayes (NB) Tree. The resulting accuracy produced by K-NN presented is 98.9167% on (512 x 512) ROI’s. The individually overall result accuracy dataset values by using K-NN classifier on the four varieties of cotton leaf namely; BS-15, S-32, Z-31, and Z-32 were evaluated 97.83%, 99.50%, 99%, and 99.33%, respectively.
基于图像分割的棉花品种分类空间k近邻模型
在本研究中,我们描述了一种使用机器学习(ML)方法对四(4)种不同的棉花叶片品种进行分类的技术,即;BS-15, S-32, Z-31和Z-32。每个品种的棉花叶子都是从500个农民那里收集来的。这些图像数据集是利用手机相机在开阔的农田地区拍摄的,每张图像都是从棉花叶片的正面和背面拍摄的。每个棉花品种使用了300多个叶片图像(正面150个,背面150个),计算出的棉花叶片总数为1200个(300 × 4)作为叶片图像样本。这些样本数据集通过图像预处理和图像分割过程进行分析。每张图像采用四个不同的非重叠感兴趣区域(ROI 's),并计算出总共4800 (1200 x 4)个ROI 's。获取的数据集采用不同的机器学习特征,如可伸缩性、纹理、光谱、二进制、直方图、旋转和平移(R-S-T)。在每个ROI上总共评估了57个机器学习特征,总共计算了273600个(4800 x 57)特征。在此基础上,采用基于关联的特征选择(CFS)遗传算法进行特征优化。它已经评估了22个优化的特征,并应用了不同的机器学习(M-L)分类器,即;K-最近邻(K- nn)、K*、随机森林(RF)树和朴素贝叶斯(NB)树。在(512 × 512) ROI上,K-NN产生的准确率为98.9167%。K-NN分类器在4种棉花叶片上的单独总体结果精度数据集值为;BS-15、S-32、Z-31和Z-32的评价分别为97.83%、99.50%、99%和99.33%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信