结合词和基于对象的视觉特征在图像检索中的应用

12th International Conference on Image Analysis and Processing, 2003.Proceedings. Pub Date : 2003-09-17 DOI:10.1109/ICIAP.2003.1234075

Akihiko Nakagawa, Andrea Kutics, Kiyotaka Tanaka, Masaomi Nakajima

{"title":"结合词和基于对象的视觉特征在图像检索中的应用","authors":"Akihiko Nakagawa, Andrea Kutics, Kiyotaka Tanaka, Masaomi Nakajima","doi":"10.1109/ICIAP.2003.1234075","DOIUrl":null,"url":null,"abstract":"The paper presents a novel approach for image retrieval by combining textual and object-based visual features in order to reduce the inconsistency between the subjective user's similarity interpretation and the retrieval results produced by objective similarity models. A novel multi-scale segmentation framework is proposed to detect prominent image objects. These objects are clustered according to their visual features and mapped to related words determined by psychophysical studies. Furthermore, a hierarchy of words expressing higher-level meaning is determined on the basis of natural language processing and user evaluation. Experiments conducted on a large set of natural images showed that higher retrieval precision in terms of estimating user retrieval semantics could be achieved via this two-layer word association and also by supporting various query specifications and options.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Combining words and object-based visual features in image retrieval\",\"authors\":\"Akihiko Nakagawa, Andrea Kutics, Kiyotaka Tanaka, Masaomi Nakajima\",\"doi\":\"10.1109/ICIAP.2003.1234075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper presents a novel approach for image retrieval by combining textual and object-based visual features in order to reduce the inconsistency between the subjective user's similarity interpretation and the retrieval results produced by objective similarity models. A novel multi-scale segmentation framework is proposed to detect prominent image objects. These objects are clustered according to their visual features and mapped to related words determined by psychophysical studies. Furthermore, a hierarchy of words expressing higher-level meaning is determined on the basis of natural language processing and user evaluation. Experiments conducted on a large set of natural images showed that higher retrieval precision in terms of estimating user retrieval semantics could be achieved via this two-layer word association and also by supporting various query specifications and options.\",\"PeriodicalId\":218076,\"journal\":{\"name\":\"12th International Conference on Image Analysis and Processing, 2003.Proceedings.\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"12th International Conference on Image Analysis and Processing, 2003.Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIAP.2003.1234075\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAP.2003.1234075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

本文提出了一种基于文本和基于对象的视觉特征相结合的图像检索方法，以减少主观用户的相似度解释与客观相似度模型产生的检索结果之间的不一致。提出了一种新的多尺度分割框架来检测图像中的突出目标。这些对象根据其视觉特征聚类，并映射到心理物理学研究确定的相关单词。此外，在自然语言处理和用户评价的基础上，确定了表达更高层次意义的词的层次。在大量自然图像上进行的实验表明，通过这种两层词关联和支持各种查询规范和选项，可以在估计用户检索语义方面获得更高的检索精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Combining words and object-based visual features in image retrieval

The paper presents a novel approach for image retrieval by combining textual and object-based visual features in order to reduce the inconsistency between the subjective user's similarity interpretation and the retrieval results produced by objective similarity models. A novel multi-scale segmentation framework is proposed to detect prominent image objects. These objects are clustered according to their visual features and mapped to related words determined by psychophysical studies. Furthermore, a hierarchy of words expressing higher-level meaning is determined on the basis of natural language processing and user evaluation. Experiments conducted on a large set of natural images showed that higher retrieval precision in terms of estimating user retrieval semantics could be achieved via this two-layer word association and also by supporting various query specifications and options.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

自引率

0.00%

发文量