基于零射击学习的未知物体预测与完整案例分析

IF 0.6 Q3 SOCIAL SCIENCES, INTERDISCIPLINARY

Interdisciplinary Description of Complex Systems Pub Date : 2022-08-31 DOI:10.7906/indecs.20.4.10

S. Chakravarthy, Jatin V. R. Arutla

{"title":"基于零射击学习的未知物体预测与完整案例分析","authors":"S. Chakravarthy, Jatin V. R. Arutla","doi":"10.7906/indecs.20.4.10","DOIUrl":null,"url":null,"abstract":"Generally, for a machine learning model to perform well, the data instances on which the model is being trained have to be relevant to the use case. In the case of relevant samples not being available, Zero-shot learning can be used to perform classification tasks. Zero-shot learning is the process of solving a problem when there are no examples of that problem in the phase of training. It lets us classify target classes on which the deep learning model has not been trained. In this article, Zero-shot learning is used to classify food dish classes through an object recognition model. First, the data is collected from Google Images and Kaggle. The image attributes are then extracted using a VGG16 model. The image attributes belonging to the training categories are then used to train a custom-built deep learning model. Various hypermeters of the model are tuned and the results are analyzed in order to get the best possible performance. The image attributes extracted from the zero-shot learning categories are used to test the model after the training process is completed. The predictions are made by comparing the vectors of the target class with the training classes in the Word2Vec space. The metric used to evaluate the model is Top-5 accuracy which indicates whether the expected result is present in the predictions. A Top-5 accuracy of 92% is achieved by implementing zero-shot learning for the classification of unseen food dish images.","PeriodicalId":54126,"journal":{"name":"Interdisciplinary Description of Complex Systems","volume":"14 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prognostication of Unseen Objects using\\n Zero-Shot Learning with a Complete Case Analysis\",\"authors\":\"S. Chakravarthy, Jatin V. R. Arutla\",\"doi\":\"10.7906/indecs.20.4.10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generally, for a machine learning model to perform well, the data instances on which the model is being trained have to be relevant to the use case. In the case of relevant samples not being available, Zero-shot learning can be used to perform classification tasks. Zero-shot learning is the process of solving a problem when there are no examples of that problem in the phase of training. It lets us classify target classes on which the deep learning model has not been trained. In this article, Zero-shot learning is used to classify food dish classes through an object recognition model. First, the data is collected from Google Images and Kaggle. The image attributes are then extracted using a VGG16 model. The image attributes belonging to the training categories are then used to train a custom-built deep learning model. Various hypermeters of the model are tuned and the results are analyzed in order to get the best possible performance. The image attributes extracted from the zero-shot learning categories are used to test the model after the training process is completed. The predictions are made by comparing the vectors of the target class with the training classes in the Word2Vec space. The metric used to evaluate the model is Top-5 accuracy which indicates whether the expected result is present in the predictions. A Top-5 accuracy of 92% is achieved by implementing zero-shot learning for the classification of unseen food dish images.\",\"PeriodicalId\":54126,\"journal\":{\"name\":\"Interdisciplinary Description of Complex Systems\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2022-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Interdisciplinary Description of Complex Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7906/indecs.20.4.10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"SOCIAL SCIENCES, INTERDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Interdisciplinary Description of Complex Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7906/indecs.20.4.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"SOCIAL SCIENCES, INTERDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

通常，为了使机器学习模型表现良好，训练模型的数据实例必须与用例相关。在没有相关样本的情况下，可以使用Zero-shot学习来执行分类任务。零射击学习是指在训练阶段没有任何问题的例子时解决问题的过程。它让我们对深度学习模型尚未训练过的目标类进行分类。在本文中，Zero-shot学习通过物体识别模型来分类食物的菜肴类别。首先，数据是从Google Images和Kaggle收集的。然后使用VGG16模型提取图像属性。然后使用属于训练类别的图像属性来训练定制的深度学习模型。为了获得最佳的性能，对模型的各种超尺度进行了调整，并对结果进行了分析。在训练过程完成后，使用从零射击学习类别中提取的图像属性对模型进行测试。通过比较目标类的向量与Word2Vec空间中的训练类来进行预测。用于评估模型的度量是Top-5精度，它指示预期结果是否出现在预测中。通过实施零射击学习对未见过的食物图像进行分类，达到92%的Top-5准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Prognostication of Unseen Objects using Zero-Shot Learning with a Complete Case Analysis

Generally, for a machine learning model to perform well, the data instances on which the model is being trained have to be relevant to the use case. In the case of relevant samples not being available, Zero-shot learning can be used to perform classification tasks. Zero-shot learning is the process of solving a problem when there are no examples of that problem in the phase of training. It lets us classify target classes on which the deep learning model has not been trained. In this article, Zero-shot learning is used to classify food dish classes through an object recognition model. First, the data is collected from Google Images and Kaggle. The image attributes are then extracted using a VGG16 model. The image attributes belonging to the training categories are then used to train a custom-built deep learning model. Various hypermeters of the model are tuned and the results are analyzed in order to get the best possible performance. The image attributes extracted from the zero-shot learning categories are used to test the model after the training process is completed. The predictions are made by comparing the vectors of the target class with the training classes in the Word2Vec space. The metric used to evaluate the model is Top-5 accuracy which indicates whether the expected result is present in the predictions. A Top-5 accuracy of 92% is achieved by implementing zero-shot learning for the classification of unseen food dish images.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Interdisciplinary Description of Complex Systems SOCIAL SCIENCES, INTERDISCIPLINARY-

自引率

0.00%

发文量

审稿时长

3 weeks