Visual Recognition of Local Kashmiri Objects with Limited Image Data using Transfer Learning

2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI) Pub Date : 2021-08-25 DOI:10.1109/ICETCI51973.2021.9574047

Asrar Nehvi, Rayees M. Dar, Assif Assad

{"title":"Visual Recognition of Local Kashmiri Objects with Limited Image Data using Transfer Learning","authors":"Asrar Nehvi, Rayees M. Dar, Assif Assad","doi":"10.1109/ICETCI51973.2021.9574047","DOIUrl":null,"url":null,"abstract":"Learning to recognize object categories is a challenging task for computers and the task becomes more difficult if the image data is small in size. Traditional machine learning methods require extensive training data to generalize and produce accurate results. Seeking inspiration from human perception, it has been found using prior knowledge about related tasks helps in learning new tasks. Transfer Learning is based on this natural learning process and can help to reproduce the remarkable human capability of recognizing objects from just one single view. In this manuscript we explored transfer learning techniques along with state-of-the-art object recognition models to discover improved ways of performing Visual Object Recognition for Object categories with limited image data. A small data set was built from scratch having images for four local object categories and then a model was developed using pre trained Inception-v3 model for classifying them. The results were compared with stock 3 Layer Convolutional model. The proposed model obtained a respectable accuracy result of around 90% while the stock model had an accuracy of 70%. Considering the fact that the dataset used is very tiny (training portion of the data set had only 320 images for all categories, 80 per category) the results obtained are encouraging. Thus, this work further strengthens the fact that transfer-based techniques can be utilized for computer vision tasks with limited data.","PeriodicalId":281877,"journal":{"name":"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)","volume":"73 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETCI51973.2021.9574047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Learning to recognize object categories is a challenging task for computers and the task becomes more difficult if the image data is small in size. Traditional machine learning methods require extensive training data to generalize and produce accurate results. Seeking inspiration from human perception, it has been found using prior knowledge about related tasks helps in learning new tasks. Transfer Learning is based on this natural learning process and can help to reproduce the remarkable human capability of recognizing objects from just one single view. In this manuscript we explored transfer learning techniques along with state-of-the-art object recognition models to discover improved ways of performing Visual Object Recognition for Object categories with limited image data. A small data set was built from scratch having images for four local object categories and then a model was developed using pre trained Inception-v3 model for classifying them. The results were compared with stock 3 Layer Convolutional model. The proposed model obtained a respectable accuracy result of around 90% while the stock model had an accuracy of 70%. Considering the fact that the dataset used is very tiny (training portion of the data set had only 320 images for all categories, 80 per category) the results obtained are encouraging. Thus, this work further strengthens the fact that transfer-based techniques can be utilized for computer vision tasks with limited data.

查看原文本刊更多论文

基于迁移学习的有限图像数据下克什米尔局部物体的视觉识别

学习识别对象类别对计算机来说是一项具有挑战性的任务，如果图像数据很小，任务就会变得更加困难。传统的机器学习方法需要大量的训练数据来概括和产生准确的结果。从人类感知中寻求灵感，人们发现利用相关任务的先验知识有助于学习新任务。迁移学习是基于这种自然的学习过程，可以帮助重现人类从单一视角识别物体的非凡能力。在本文中，我们探索了迁移学习技术以及最先进的对象识别模型，以发现对有限图像数据的对象类别执行视觉对象识别的改进方法。我们从零开始构建了一个小数据集，其中包含四个局部对象类别的图像，然后使用预训练的Inception-v3模型开发了一个模型来对它们进行分类。结果与现有的3层卷积模型进行了比较。该模型的准确率在90%左右，而股票模型的准确率为70%。考虑到使用的数据集非常小(数据集的训练部分对于所有类别只有320张图像，每个类别80张)，获得的结果是令人鼓舞的。因此，这项工作进一步加强了这样一个事实，即基于转移的技术可以用于有限数据的计算机视觉任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)

自引率

0.00%

发文量