A non-parametric unsupervised approach for content based image retrieval and clustering

Konstantinos Makantasis, A. Doulamis, N. Doulamis
{"title":"A non-parametric unsupervised approach for content based image retrieval and clustering","authors":"Konstantinos Makantasis, A. Doulamis, N. Doulamis","doi":"10.1145/2510650.2510656","DOIUrl":null,"url":null,"abstract":"Nowadays, there are available extremely large collections of images located on distributed and heterogeneous platforms over the web. The proliferation of billions of shared photos has outpaced the current technology for browsing such collections, but at the same time it spurred the emergence of new image retrieval techniques based not only on photos' visual information, but on geo-location tags and camera exif data. Although, additional image information may be proven very useful for preliminary image retrieval, the final retrieved result is necessary to be refined by exploiting visual information.\n In this paper we present a process for refining image retrieval results by exploiting and fusing two unsupervised clustering techniques: DBSCAN and spectral clustering. DBSCAN algorithm is used to remove outliers from the initially retrieved image set, and spectral clustering finalizes retrieval process by clustering together visually similar images. However, DBSCAN and spectral clustering require manual tunning of their parameters, which usually requires a priori knowledge of the dataset. To overcome this problem we developed a tuning mechanism that automatically tunes the parameters of both algorithms. For the evaluation of the proposed approach we used thousands of images from Flickr downloaded using text queries for well known cultural heritage monuments.","PeriodicalId":360789,"journal":{"name":"ACM/IEEE international workshop on Analysis and Retrieval of Tracked Events and Motion in Imagery Stream","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IEEE international workshop on Analysis and Retrieval of Tracked Events and Motion in Imagery Stream","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2510650.2510656","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

Abstract

Nowadays, there are available extremely large collections of images located on distributed and heterogeneous platforms over the web. The proliferation of billions of shared photos has outpaced the current technology for browsing such collections, but at the same time it spurred the emergence of new image retrieval techniques based not only on photos' visual information, but on geo-location tags and camera exif data. Although, additional image information may be proven very useful for preliminary image retrieval, the final retrieved result is necessary to be refined by exploiting visual information. In this paper we present a process for refining image retrieval results by exploiting and fusing two unsupervised clustering techniques: DBSCAN and spectral clustering. DBSCAN algorithm is used to remove outliers from the initially retrieved image set, and spectral clustering finalizes retrieval process by clustering together visually similar images. However, DBSCAN and spectral clustering require manual tunning of their parameters, which usually requires a priori knowledge of the dataset. To overcome this problem we developed a tuning mechanism that automatically tunes the parameters of both algorithms. For the evaluation of the proposed approach we used thousands of images from Flickr downloaded using text queries for well known cultural heritage monuments.
基于内容的图像检索和聚类的非参数无监督方法
如今,在网络上的分布式和异构平台上有大量可用的图像集合。数以亿计的共享照片的激增超过了目前浏览这些照片的技术,但与此同时,它刺激了新的图像检索技术的出现,这些技术不仅基于照片的视觉信息,还基于地理位置标签和相机出口数据。虽然额外的图像信息可能被证明对初步的图像检索非常有用,但最终的检索结果需要通过利用视觉信息来改进。本文通过利用和融合两种无监督聚类技术:DBSCAN和谱聚类,提出了一种改进图像检索结果的方法。使用DBSCAN算法从初始检索的图像集中去除异常值,光谱聚类通过将视觉上相似的图像聚类在一起来完成检索过程。然而,DBSCAN和谱聚类需要手动调优它们的参数,这通常需要对数据集有先验的了解。为了克服这个问题,我们开发了一种调优机制,可以自动调整两种算法的参数。为了评估所提出的方法,我们使用了从Flickr下载的数千张图片,这些图片使用了众所周知的文化遗产纪念碑的文本查询。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信