Towards automated large scale discovery of image families

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Pub Date : 2009-06-20 DOI:10.1109/CVPRW.2009.5204177

M. Aly, P. Welinder, Mario E. Munich, P. Perona

引用次数: 23

Abstract

Gathering large collections of images is quite easy nowadays with the advent of image sharing Web sites, such as flickr.com. However, such collections inevitably contain duplicates and highly similar images, what we refer to as image families. Automatic discovery and cataloguing of such similar images in large collections is important for many applications, e.g. image search, image collection visualization, and research purposes among others. In this work, we investigate this problem by thoroughly comparing two broad approaches for measuring image similarity: global vs. local features. We assess their performance as the image collection scales up to over 11,000 images with over 6,300 families. We present our results on three datasets with different statistics, including two new challenging datasets. Moreover, we present a new algorithm to automatically determine the number of families in the collection with promising results.

查看原文本刊更多论文

朝着自动大规模发现图像族的方向发展

随着像flickr.com这样的图片共享网站的出现，收集大量的图片变得非常容易。然而，这样的集合不可避免地包含重复和高度相似的图像，我们称之为图像族。在大型集合中自动发现和编目这些相似的图像对许多应用程序都很重要，例如图像搜索，图像集合可视化和研究目的等。在这项工作中，我们通过全面比较测量图像相似性的两种广泛方法来研究这个问题:全局特征与局部特征。当图像收集扩展到超过11,000张图像，超过6,300个家庭时，我们评估了它们的性能。我们在三个具有不同统计数据的数据集上展示了我们的结果，包括两个新的具有挑战性的数据集。此外，我们提出了一种新的算法来自动确定集合中的家庭数量，结果很有希望。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

自引率

0.00%

发文量