Automatic Image Annotation Using Global and Local Features

2011 Sixth International Workshop on Semantic Media Adaptation and Personalization Pub Date : 2011-12-01 DOI:10.1109/SMAP.2011.14

M. Bieliková, Eduard Kuric

{"title":"Automatic Image Annotation Using Global and Local Features","authors":"M. Bieliková, Eduard Kuric","doi":"10.1109/SMAP.2011.14","DOIUrl":null,"url":null,"abstract":"Automatic image annotation methods require a quality training image dataset, from which annotations for target images are obtained. At present, the main problem with these methods is their low effectiveness and scalability if a large-scale training dataset is used. Current methods use only global image features for search. We proposed a method to obtain annotations for target images, which is based on a novel combination of local and global features during search stage. We are able to ensure the robustness and generalization needed by complex queries and significantly eliminate irrelevant results. In our method, in analogy with text documents, the global features represent words extracted from paragraphs of a document with the highest frequency of occurrence and the local features represent key words extracted from the entire document. We are able to identify objects directly in target images and for each obtained annotation we estimate the probability of its relevance. During search, we retrieve similar images containing the correct keywords for a given target image. For example, we prioritize images where extracted objects of interest from the target images are dominant as it is more likely that words associated with the images describe the objects. We tailored our method to use large-scale image training datasets and evaluated it with the Corel5K corpus which consists of 5000 images from 50 Corel Stock Photo CDs.","PeriodicalId":346975,"journal":{"name":"2011 Sixth International Workshop on Semantic Media Adaptation and Personalization","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Sixth International Workshop on Semantic Media Adaptation and Personalization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMAP.2011.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Automatic image annotation methods require a quality training image dataset, from which annotations for target images are obtained. At present, the main problem with these methods is their low effectiveness and scalability if a large-scale training dataset is used. Current methods use only global image features for search. We proposed a method to obtain annotations for target images, which is based on a novel combination of local and global features during search stage. We are able to ensure the robustness and generalization needed by complex queries and significantly eliminate irrelevant results. In our method, in analogy with text documents, the global features represent words extracted from paragraphs of a document with the highest frequency of occurrence and the local features represent key words extracted from the entire document. We are able to identify objects directly in target images and for each obtained annotation we estimate the probability of its relevance. During search, we retrieve similar images containing the correct keywords for a given target image. For example, we prioritize images where extracted objects of interest from the target images are dominant as it is more likely that words associated with the images describe the objects. We tailored our method to use large-scale image training datasets and evaluated it with the Corel5K corpus which consists of 5000 images from 50 Corel Stock Photo CDs.

查看原文本刊更多论文

使用全局和局部特征的自动图像注释

自动图像标注方法需要一个高质量的训练图像数据集，从中获得目标图像的标注。目前，这些方法的主要问题是在使用大规模训练数据集时，它们的有效性和可扩展性较低。目前的方法仅使用全局图像特征进行搜索。提出了一种基于局部特征和全局特征相结合的目标图像标注方法。我们能够确保复杂查询所需的鲁棒性和泛化，并显著消除不相关的结果。在我们的方法中，与文本文档类似，全局特征表示从文档中出现频率最高的段落中提取的词，局部特征表示从整个文档中提取的关键词。我们能够直接识别目标图像中的物体，并且对于每个获得的注释，我们估计其相关的概率。在搜索过程中，我们为给定的目标图像检索包含正确关键字的相似图像。例如，我们对从目标图像中提取的感兴趣的物体进行优先排序，因为与图像相关的单词更有可能描述物体。我们将我们的方法定制为使用大规模图像训练数据集，并使用Corel5K语料库对其进行评估，该语料库由来自50张Corel Stock Photo cd的5000张图像组成。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 Sixth International Workshop on Semantic Media Adaptation and Personalization

自引率

0.00%

发文量