Optimizing progressive query-by-example over pre-clustered large image databases

Computer Vision meets Databases Pub Date : 2005-06-17 DOI:10.1145/1160939.1160946

A. Choupo, Laure Berti-Équille, A. Morin

{"title":"Optimizing progressive query-by-example over pre-clustered large image databases","authors":"A. Choupo, Laure Berti-Équille, A. Morin","doi":"10.1145/1160939.1160946","DOIUrl":null,"url":null,"abstract":"The typical mode for querying in an image content-based information system is query-by-example, which allows the user to provide an image as a query and to search for similar images (i.e., the nearest neighbors) based on one or a combination of low-level multidimensional features of the query example. Off-lime, this requires the time-consuming pre-computing of the whole set of visual descriptors over the image database. On-line, one major drawback is that multidimensional sequential NN-search is usually exhaustive over the whole image set face to the user who has a very limited patience. In this paper, we propose a technique for improving the performance of image query-by-example execution strategies over multiple visual features. This includes first, the pre-clustering of the large image database and then, the scheduling of the processing of the feature clusters before providing progressively the query results (i.e., intermediate results are sent continuously before the end of the exhaustive scan over the whole database). A cluster eligibility criterion and two filtering rules are proposed to select the most relevant clusters to a query-by-example. Experiments over more than 110 000 images and five MPEG-7 global features show that our approach significantly reduces the query time in two experimental cases: the query time is divided by 4.8 for 100 clusters per descriptor type and by 7 for 200 clusters per descriptor type compared to a \"blind\" sequential NN-search with keeping the same final query result. This constitutes a promising perspective for optimizing image query-by-example execution.","PeriodicalId":346313,"journal":{"name":"Computer Vision meets Databases","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision meets Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1160939.1160946","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

The typical mode for querying in an image content-based information system is query-by-example, which allows the user to provide an image as a query and to search for similar images (i.e., the nearest neighbors) based on one or a combination of low-level multidimensional features of the query example. Off-lime, this requires the time-consuming pre-computing of the whole set of visual descriptors over the image database. On-line, one major drawback is that multidimensional sequential NN-search is usually exhaustive over the whole image set face to the user who has a very limited patience. In this paper, we propose a technique for improving the performance of image query-by-example execution strategies over multiple visual features. This includes first, the pre-clustering of the large image database and then, the scheduling of the processing of the feature clusters before providing progressively the query results (i.e., intermediate results are sent continuously before the end of the exhaustive scan over the whole database). A cluster eligibility criterion and two filtering rules are proposed to select the most relevant clusters to a query-by-example. Experiments over more than 110 000 images and five MPEG-7 global features show that our approach significantly reduces the query time in two experimental cases: the query time is divided by 4.8 for 100 clusters per descriptor type and by 7 for 200 clusters per descriptor type compared to a "blind" sequential NN-search with keeping the same final query result. This constitutes a promising perspective for optimizing image query-by-example execution.

查看原文本刊更多论文

在预聚类的大型图像数据库上优化逐例查询

在基于图像内容的信息系统中，查询的典型模式是按例查询，它允许用户提供一个图像作为查询，并根据查询示例的一个或多个低层次多维特征来搜索相似的图像(即最近邻居)。通常，这需要对图像数据库上的整个视觉描述符集进行耗时的预计算。在线搜索的一个主要缺点是，对于耐心有限的用户来说，多维顺序nn搜索通常是对整个图像集的穷尽搜索。在本文中，我们提出了一种技术，用于提高对多个视觉特征的图像按例查询执行策略的性能。这包括首先对大型图像数据库进行预聚类，然后在逐步提供查询结果之前对特征聚类的处理进行调度(即在对整个数据库进行穷举扫描结束之前连续发送中间结果)。提出了一个聚类资格准则和两个过滤规则来选择与实例查询最相关的聚类。对超过11万张图像和5个MPEG-7全局特征进行的实验表明，我们的方法在两个实验案例中显著减少了查询时间:与保持相同最终查询结果的“盲”顺序nn搜索相比，每个描述符类型100个簇的查询时间除以4.8，每个描述符类型200个簇的查询时间除以7。这为优化图像按示例查询的执行提供了一个很有前景的视角。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer Vision meets Databases

自引率

0.00%

发文量