Classification of Dog and Cat Images using the CNN Method

Teguh Adriyanto, Risky Aswi Ramadhani, R. Helilintar, Aidina Ristyawan
{"title":"Classification of Dog and Cat Images using the CNN Method","authors":"Teguh Adriyanto, Risky Aswi Ramadhani, R. Helilintar, Aidina Ristyawan","doi":"10.33096/ilkom.v14i3.1116.203-208","DOIUrl":null,"url":null,"abstract":"Blind people can be defined as those people who are unable to see objects or pictures around them with their eyes. This inability becomes an issue for them when dealing with objects or images in front of them. These problems lead to the novelty of this study that is to recognize objects or images around blind people with the CNN algorithm. Dogs and cats were used as objects in this study. These object recognitions used Deep Learning, a relatively new science in the field of machine learning. Deep learning works like the human brain's ability to recognize an object. In this study, the objects that were used were pictures of a dog and a cat. This study used 3 types of data, namely training, validation, and testing data. The data training consisted of dog data with a total of 1000 images and cat data with a total of 1000 images. Data validation consisted of 500 dog data  and 500 cat data. The CCN architecture employed 3 convolution layers. The layer was convolution 1 using 16 filters of kernel size 3x3, the second convolution using 32 filters of  kernel size 3x3 and the third using 64 filters of kernel size 3x3. While the data testing consisted of 51dog data and 27 cat data. The method used to analyze the image was CNN. The input was an image with a size of 150x150 pixels with 3 channels, namely R, G, and B. This classification went through a performance test with the Confusion Matrix and it obtained 45% precision, 45% recall and 45% f1-score. From these results it can be concluded that the accuracy values should be improved.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ilkom Jurnal Ilmiah","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33096/ilkom.v14i3.1116.203-208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Blind people can be defined as those people who are unable to see objects or pictures around them with their eyes. This inability becomes an issue for them when dealing with objects or images in front of them. These problems lead to the novelty of this study that is to recognize objects or images around blind people with the CNN algorithm. Dogs and cats were used as objects in this study. These object recognitions used Deep Learning, a relatively new science in the field of machine learning. Deep learning works like the human brain's ability to recognize an object. In this study, the objects that were used were pictures of a dog and a cat. This study used 3 types of data, namely training, validation, and testing data. The data training consisted of dog data with a total of 1000 images and cat data with a total of 1000 images. Data validation consisted of 500 dog data  and 500 cat data. The CCN architecture employed 3 convolution layers. The layer was convolution 1 using 16 filters of kernel size 3x3, the second convolution using 32 filters of  kernel size 3x3 and the third using 64 filters of kernel size 3x3. While the data testing consisted of 51dog data and 27 cat data. The method used to analyze the image was CNN. The input was an image with a size of 150x150 pixels with 3 channels, namely R, G, and B. This classification went through a performance test with the Confusion Matrix and it obtained 45% precision, 45% recall and 45% f1-score. From these results it can be concluded that the accuracy values should be improved.
利用CNN方法对猫狗图像进行分类
盲人可以定义为那些不能用眼睛看到周围物体或图片的人。当他们处理面前的物体或图像时,这种无能就成了一个问题。这些问题导致了本研究的新颖之处,即使用CNN算法识别盲人周围的物体或图像。本研究以猫和狗为研究对象。这些对象识别使用了深度学习,这是机器学习领域的一门相对较新的科学。深度学习的工作原理类似于人类大脑识别物体的能力。在这项研究中,使用的对象是一只狗和一只猫的照片。本研究使用了3种数据,即训练数据、验证数据和测试数据。数据训练由总共1000张图片的狗数据和总共1000张图片的猫数据组成。数据验证包括500条狗数据和500条猫数据。CCN架构采用3个卷积层。该层是卷积1,使用16个内核大小为3x3的滤波器,第二次卷积使用32个内核大小为3x3的滤波器,第三次卷积使用64个内核大小为3x3的滤波器。而数据测试包括51条狗数据和27条猫数据。对图像进行分析的方法是CNN。输入为150x150像素大小的图像,有3个通道,分别是R、G、b。该分类通过混淆矩阵进行性能测试,获得了45%的准确率、45%的召回率和45%的f1-score。从这些结果可以得出结论,精度值有待提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
4 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信