Visual Recognition of Local Kashmiri Objects with Limited Image Data using Transfer Learning

Asrar Nehvi, Rayees M. Dar, Assif Assad
{"title":"Visual Recognition of Local Kashmiri Objects with Limited Image Data using Transfer Learning","authors":"Asrar Nehvi, Rayees M. Dar, Assif Assad","doi":"10.1109/ICETCI51973.2021.9574047","DOIUrl":null,"url":null,"abstract":"Learning to recognize object categories is a challenging task for computers and the task becomes more difficult if the image data is small in size. Traditional machine learning methods require extensive training data to generalize and produce accurate results. Seeking inspiration from human perception, it has been found using prior knowledge about related tasks helps in learning new tasks. Transfer Learning is based on this natural learning process and can help to reproduce the remarkable human capability of recognizing objects from just one single view. In this manuscript we explored transfer learning techniques along with state-of-the-art object recognition models to discover improved ways of performing Visual Object Recognition for Object categories with limited image data. A small data set was built from scratch having images for four local object categories and then a model was developed using pre trained Inception-v3 model for classifying them. The results were compared with stock 3 Layer Convolutional model. The proposed model obtained a respectable accuracy result of around 90% while the stock model had an accuracy of 70%. Considering the fact that the dataset used is very tiny (training portion of the data set had only 320 images for all categories, 80 per category) the results obtained are encouraging. Thus, this work further strengthens the fact that transfer-based techniques can be utilized for computer vision tasks with limited data.","PeriodicalId":281877,"journal":{"name":"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)","volume":"73 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETCI51973.2021.9574047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Learning to recognize object categories is a challenging task for computers and the task becomes more difficult if the image data is small in size. Traditional machine learning methods require extensive training data to generalize and produce accurate results. Seeking inspiration from human perception, it has been found using prior knowledge about related tasks helps in learning new tasks. Transfer Learning is based on this natural learning process and can help to reproduce the remarkable human capability of recognizing objects from just one single view. In this manuscript we explored transfer learning techniques along with state-of-the-art object recognition models to discover improved ways of performing Visual Object Recognition for Object categories with limited image data. A small data set was built from scratch having images for four local object categories and then a model was developed using pre trained Inception-v3 model for classifying them. The results were compared with stock 3 Layer Convolutional model. The proposed model obtained a respectable accuracy result of around 90% while the stock model had an accuracy of 70%. Considering the fact that the dataset used is very tiny (training portion of the data set had only 320 images for all categories, 80 per category) the results obtained are encouraging. Thus, this work further strengthens the fact that transfer-based techniques can be utilized for computer vision tasks with limited data.
基于迁移学习的有限图像数据下克什米尔局部物体的视觉识别
学习识别对象类别对计算机来说是一项具有挑战性的任务,如果图像数据很小,任务就会变得更加困难。传统的机器学习方法需要大量的训练数据来概括和产生准确的结果。从人类感知中寻求灵感,人们发现利用相关任务的先验知识有助于学习新任务。迁移学习是基于这种自然的学习过程,可以帮助重现人类从单一视角识别物体的非凡能力。在本文中,我们探索了迁移学习技术以及最先进的对象识别模型,以发现对有限图像数据的对象类别执行视觉对象识别的改进方法。我们从零开始构建了一个小数据集,其中包含四个局部对象类别的图像,然后使用预训练的Inception-v3模型开发了一个模型来对它们进行分类。结果与现有的3层卷积模型进行了比较。该模型的准确率在90%左右,而股票模型的准确率为70%。考虑到使用的数据集非常小(数据集的训练部分对于所有类别只有320张图像,每个类别80张),获得的结果是令人鼓舞的。因此,这项工作进一步加强了这样一个事实,即基于转移的技术可以用于有限数据的计算机视觉任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信