Optimizing progressive query-by-example over pre-clustered large image databases

A. Choupo, Laure Berti-Équille, A. Morin
{"title":"Optimizing progressive query-by-example over pre-clustered large image databases","authors":"A. Choupo, Laure Berti-Équille, A. Morin","doi":"10.1145/1160939.1160946","DOIUrl":null,"url":null,"abstract":"The typical mode for querying in an image content-based information system is query-by-example, which allows the user to provide an image as a query and to search for similar images (i.e., the nearest neighbors) based on one or a combination of low-level multidimensional features of the query example. Off-lime, this requires the time-consuming pre-computing of the whole set of visual descriptors over the image database. On-line, one major drawback is that multidimensional sequential NN-search is usually exhaustive over the whole image set face to the user who has a very limited patience. In this paper, we propose a technique for improving the performance of image query-by-example execution strategies over multiple visual features. This includes first, the pre-clustering of the large image database and then, the scheduling of the processing of the feature clusters before providing progressively the query results (i.e., intermediate results are sent continuously before the end of the exhaustive scan over the whole database). A cluster eligibility criterion and two filtering rules are proposed to select the most relevant clusters to a query-by-example. Experiments over more than 110 000 images and five MPEG-7 global features show that our approach significantly reduces the query time in two experimental cases: the query time is divided by 4.8 for 100 clusters per descriptor type and by 7 for 200 clusters per descriptor type compared to a \"blind\" sequential NN-search with keeping the same final query result. This constitutes a promising perspective for optimizing image query-by-example execution.","PeriodicalId":346313,"journal":{"name":"Computer Vision meets Databases","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision meets Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1160939.1160946","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

Abstract

The typical mode for querying in an image content-based information system is query-by-example, which allows the user to provide an image as a query and to search for similar images (i.e., the nearest neighbors) based on one or a combination of low-level multidimensional features of the query example. Off-lime, this requires the time-consuming pre-computing of the whole set of visual descriptors over the image database. On-line, one major drawback is that multidimensional sequential NN-search is usually exhaustive over the whole image set face to the user who has a very limited patience. In this paper, we propose a technique for improving the performance of image query-by-example execution strategies over multiple visual features. This includes first, the pre-clustering of the large image database and then, the scheduling of the processing of the feature clusters before providing progressively the query results (i.e., intermediate results are sent continuously before the end of the exhaustive scan over the whole database). A cluster eligibility criterion and two filtering rules are proposed to select the most relevant clusters to a query-by-example. Experiments over more than 110 000 images and five MPEG-7 global features show that our approach significantly reduces the query time in two experimental cases: the query time is divided by 4.8 for 100 clusters per descriptor type and by 7 for 200 clusters per descriptor type compared to a "blind" sequential NN-search with keeping the same final query result. This constitutes a promising perspective for optimizing image query-by-example execution.
在预聚类的大型图像数据库上优化逐例查询
在基于图像内容的信息系统中,查询的典型模式是按例查询,它允许用户提供一个图像作为查询,并根据查询示例的一个或多个低层次多维特征来搜索相似的图像(即最近邻居)。通常,这需要对图像数据库上的整个视觉描述符集进行耗时的预计算。在线搜索的一个主要缺点是,对于耐心有限的用户来说,多维顺序nn搜索通常是对整个图像集的穷尽搜索。在本文中,我们提出了一种技术,用于提高对多个视觉特征的图像按例查询执行策略的性能。这包括首先对大型图像数据库进行预聚类,然后在逐步提供查询结果之前对特征聚类的处理进行调度(即在对整个数据库进行穷举扫描结束之前连续发送中间结果)。提出了一个聚类资格准则和两个过滤规则来选择与实例查询最相关的聚类。对超过11万张图像和5个MPEG-7全局特征进行的实验表明,我们的方法在两个实验案例中显著减少了查询时间:与保持相同最终查询结果的“盲”顺序nn搜索相比,每个描述符类型100个簇的查询时间除以4.8,每个描述符类型200个簇的查询时间除以7。这为优化图像按示例查询的执行提供了一个很有前景的视角。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信