A non-parametric unsupervised approach for content based image retrieval and clustering

ACM/IEEE international workshop on Analysis and Retrieval of Tracked Events and Motion in Imagery Stream Pub Date : 2013-10-21 DOI:10.1145/2510650.2510656

Konstantinos Makantasis, A. Doulamis, N. Doulamis

{"title":"A non-parametric unsupervised approach for content based image retrieval and clustering","authors":"Konstantinos Makantasis, A. Doulamis, N. Doulamis","doi":"10.1145/2510650.2510656","DOIUrl":null,"url":null,"abstract":"Nowadays, there are available extremely large collections of images located on distributed and heterogeneous platforms over the web. The proliferation of billions of shared photos has outpaced the current technology for browsing such collections, but at the same time it spurred the emergence of new image retrieval techniques based not only on photos' visual information, but on geo-location tags and camera exif data. Although, additional image information may be proven very useful for preliminary image retrieval, the final retrieved result is necessary to be refined by exploiting visual information.\n In this paper we present a process for refining image retrieval results by exploiting and fusing two unsupervised clustering techniques: DBSCAN and spectral clustering. DBSCAN algorithm is used to remove outliers from the initially retrieved image set, and spectral clustering finalizes retrieval process by clustering together visually similar images. However, DBSCAN and spectral clustering require manual tunning of their parameters, which usually requires a priori knowledge of the dataset. To overcome this problem we developed a tuning mechanism that automatically tunes the parameters of both algorithms. For the evaluation of the proposed approach we used thousands of images from Flickr downloaded using text queries for well known cultural heritage monuments.","PeriodicalId":360789,"journal":{"name":"ACM/IEEE international workshop on Analysis and Retrieval of Tracked Events and Motion in Imagery Stream","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IEEE international workshop on Analysis and Retrieval of Tracked Events and Motion in Imagery Stream","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2510650.2510656","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

Nowadays, there are available extremely large collections of images located on distributed and heterogeneous platforms over the web. The proliferation of billions of shared photos has outpaced the current technology for browsing such collections, but at the same time it spurred the emergence of new image retrieval techniques based not only on photos' visual information, but on geo-location tags and camera exif data. Although, additional image information may be proven very useful for preliminary image retrieval, the final retrieved result is necessary to be refined by exploiting visual information. In this paper we present a process for refining image retrieval results by exploiting and fusing two unsupervised clustering techniques: DBSCAN and spectral clustering. DBSCAN algorithm is used to remove outliers from the initially retrieved image set, and spectral clustering finalizes retrieval process by clustering together visually similar images. However, DBSCAN and spectral clustering require manual tunning of their parameters, which usually requires a priori knowledge of the dataset. To overcome this problem we developed a tuning mechanism that automatically tunes the parameters of both algorithms. For the evaluation of the proposed approach we used thousands of images from Flickr downloaded using text queries for well known cultural heritage monuments.

查看原文本刊更多论文

基于内容的图像检索和聚类的非参数无监督方法

如今，在网络上的分布式和异构平台上有大量可用的图像集合。数以亿计的共享照片的激增超过了目前浏览这些照片的技术，但与此同时，它刺激了新的图像检索技术的出现，这些技术不仅基于照片的视觉信息，还基于地理位置标签和相机出口数据。虽然额外的图像信息可能被证明对初步的图像检索非常有用，但最终的检索结果需要通过利用视觉信息来改进。本文通过利用和融合两种无监督聚类技术:DBSCAN和谱聚类，提出了一种改进图像检索结果的方法。使用DBSCAN算法从初始检索的图像集中去除异常值，光谱聚类通过将视觉上相似的图像聚类在一起来完成检索过程。然而，DBSCAN和谱聚类需要手动调优它们的参数，这通常需要对数据集有先验的了解。为了克服这个问题，我们开发了一种调优机制，可以自动调整两种算法的参数。为了评估所提出的方法，我们使用了从Flickr下载的数千张图片，这些图片使用了众所周知的文化遗产纪念碑的文本查询。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM/IEEE international workshop on Analysis and Retrieval of Tracked Events and Motion in Imagery Stream

自引率

0.00%

发文量