使用基于直方图的分数计算算法扩展并消除重复图像来检索有效分类、重新排序的图像

International Conference for Convergence for Technology-2014 Pub Date : 2014-04-06 DOI:10.1109/I2CT.2014.7092036

Parag Shinde, A. Manjrekar

{"title":"使用基于直方图的分数计算算法扩展并消除重复图像来检索有效分类、重新排序的图像","authors":"Parag Shinde, A. Manjrekar","doi":"10.1109/I2CT.2014.7092036","DOIUrl":null,"url":null,"abstract":"Internet Image search is a day to day activity performed by user. User enters a keyword in search engines like Google, Yahoo, Bing etc for retrieval of keyword related images, where millions of images are retrieved through search engines. The problem with a keyword search is that keywords entered by user are very short and ambiguous, hence images which are retrieved are of different categories and some of them are irrelevant. Visual information is used in order to solve the ambiguity in text based image retrieval. User only has to click on one query image. The query image is categorized based on textual features like image title, image URL, context, where a metadata corresponding to every image is extracted and also some visual features like histogram distance computation, SIFT, region based features are extracted. The query image selected by the user is first classified into a particular category and the images related to the query image are then retrieved by matching the class of query image and the class of other images. Using image clustering, classified images are clustered to group highly relevant images into one cluster and the keywords corresponding to the image clusters are extracted. The original keyword is extended by appending the extracted keyword with highest frequency. This gives more detail idea about user's search intention. The images are then re-ranked using visual and textual similarity metrics. Duplicate images which are retrieved in search results are detected and eliminated by using SURF(Speeded Up Robust Feature) technique. The system is tested on variety of categories like person, scenery images at semantic level and other general categories like general objects, objects with simple background etc. The system is totally web based and works dynamically on any keyword given as a input by user.","PeriodicalId":384966,"journal":{"name":"International Conference for Convergence for Technology-2014","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Retrieval of efficiently classified, re-ranked images using histogram based score computation algorithm extended with the elimination of duplicate images\",\"authors\":\"Parag Shinde, A. Manjrekar\",\"doi\":\"10.1109/I2CT.2014.7092036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Internet Image search is a day to day activity performed by user. User enters a keyword in search engines like Google, Yahoo, Bing etc for retrieval of keyword related images, where millions of images are retrieved through search engines. The problem with a keyword search is that keywords entered by user are very short and ambiguous, hence images which are retrieved are of different categories and some of them are irrelevant. Visual information is used in order to solve the ambiguity in text based image retrieval. User only has to click on one query image. The query image is categorized based on textual features like image title, image URL, context, where a metadata corresponding to every image is extracted and also some visual features like histogram distance computation, SIFT, region based features are extracted. The query image selected by the user is first classified into a particular category and the images related to the query image are then retrieved by matching the class of query image and the class of other images. Using image clustering, classified images are clustered to group highly relevant images into one cluster and the keywords corresponding to the image clusters are extracted. The original keyword is extended by appending the extracted keyword with highest frequency. This gives more detail idea about user's search intention. The images are then re-ranked using visual and textual similarity metrics. Duplicate images which are retrieved in search results are detected and eliminated by using SURF(Speeded Up Robust Feature) technique. The system is tested on variety of categories like person, scenery images at semantic level and other general categories like general objects, objects with simple background etc. The system is totally web based and works dynamically on any keyword given as a input by user.\",\"PeriodicalId\":384966,\"journal\":{\"name\":\"International Conference for Convergence for Technology-2014\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference for Convergence for Technology-2014\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/I2CT.2014.7092036\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference for Convergence for Technology-2014","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I2CT.2014.7092036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

互联网图像搜索是用户的日常活动。用户在b谷歌，雅虎，必应等搜索引擎中输入关键字，检索与关键字相关的图像，其中数百万图像通过搜索引擎检索。关键字搜索的问题是，用户输入的关键字非常短和模糊，因此检索到的图像是不同的类别，其中一些是不相关的。在基于文本的图像检索中，利用视觉信息来解决歧义问题。用户只需要点击一个查询图像。根据图像标题、图像URL、上下文等文本特征对查询图像进行分类，提取每个图像对应的元数据，同时提取直方图距离计算、SIFT、基于区域的特征等视觉特征。首先将用户选择的查询图像分类到特定的类别中，然后通过匹配查询图像的类别和其他图像的类别来检索与查询图像相关的图像。利用图像聚类，对分类后的图像进行聚类，将高度相关的图像聚到一个聚类中，并提取图像聚类对应的关键词。通过附加提取的频率最高的关键字来扩展原始关键字。这样可以更详细地了解用户的搜索意图。然后使用视觉和文本相似性度量对图像进行重新排序。利用SURF(提速鲁棒特征)技术对检索结果中的重复图像进行检测和消除。该系统在语义层面对人物、风景图像等多种类别以及一般对象、简单背景对象等一般类别进行了测试。该系统是完全基于web的，并动态工作在任何关键字上给出的输入由用户。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Retrieval of efficiently classified, re-ranked images using histogram based score computation algorithm extended with the elimination of duplicate images

Internet Image search is a day to day activity performed by user. User enters a keyword in search engines like Google, Yahoo, Bing etc for retrieval of keyword related images, where millions of images are retrieved through search engines. The problem with a keyword search is that keywords entered by user are very short and ambiguous, hence images which are retrieved are of different categories and some of them are irrelevant. Visual information is used in order to solve the ambiguity in text based image retrieval. User only has to click on one query image. The query image is categorized based on textual features like image title, image URL, context, where a metadata corresponding to every image is extracted and also some visual features like histogram distance computation, SIFT, region based features are extracted. The query image selected by the user is first classified into a particular category and the images related to the query image are then retrieved by matching the class of query image and the class of other images. Using image clustering, classified images are clustered to group highly relevant images into one cluster and the keywords corresponding to the image clusters are extracted. The original keyword is extended by appending the extracted keyword with highest frequency. This gives more detail idea about user's search intention. The images are then re-ranked using visual and textual similarity metrics. Duplicate images which are retrieved in search results are detected and eliminated by using SURF(Speeded Up Robust Feature) technique. The system is tested on variety of categories like person, scenery images at semantic level and other general categories like general objects, objects with simple background etc. The system is totally web based and works dynamically on any keyword given as a input by user.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference for Convergence for Technology-2014

自引率

0.00%

发文量