A case-study of scoring schemes for the PvS-index

Computer Vision meets Databases Pub Date : 2005-06-17 DOI:10.1145/1160939.1160953

Herwig Lejsek

{"title":"A case-study of scoring schemes for the PvS-index","authors":"Herwig Lejsek","doi":"10.1145/1160939.1160953","DOIUrl":null,"url":null,"abstract":"Recently we have proposed a new indexing method for high-dimensional data, the PvS-index. It provides fast query processing in constant time and is well suited for doing similarity search in Image Retrieval Systems using local descriptors. It is based on projecting data points onto random lines and uses this information to segment them into appropriately sized buckets, which can be read in just one I/O operation. After this preprocessing step the search queries just three buckets per query descriptor and uses a recent rank aggregation method, OMEDRANK, in order to provide good approximate results for the nearest neighbour problem.We have recently shown that PvS-indexing works well for large collections of real image data. In that work, however, we used a simple scoring scheme and collected few nearest neighbours for each query descriptor. In this study we examine how much the actual number of nearest neighbours, gathered for each local descriptor, influences the final query result, when searching a PvS-index. Based on the results we propose two new alternative scoring schemes, which improve the retrieval quality and stabilise the results, making the search less affected by the actual number of nearest neighbours accumulated.","PeriodicalId":346313,"journal":{"name":"Computer Vision meets Databases","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision meets Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1160939.1160953","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Recently we have proposed a new indexing method for high-dimensional data, the PvS-index. It provides fast query processing in constant time and is well suited for doing similarity search in Image Retrieval Systems using local descriptors. It is based on projecting data points onto random lines and uses this information to segment them into appropriately sized buckets, which can be read in just one I/O operation. After this preprocessing step the search queries just three buckets per query descriptor and uses a recent rank aggregation method, OMEDRANK, in order to provide good approximate results for the nearest neighbour problem.We have recently shown that PvS-indexing works well for large collections of real image data. In that work, however, we used a simple scoring scheme and collected few nearest neighbours for each query descriptor. In this study we examine how much the actual number of nearest neighbours, gathered for each local descriptor, influences the final query result, when searching a PvS-index. Based on the results we propose two new alternative scoring schemes, which improve the retrieval quality and stabilise the results, making the search less affected by the actual number of nearest neighbours accumulated.

查看原文本刊更多论文

pvs指数评分方案的案例研究

最近，我们提出了一种新的高维数据索引方法——pvs索引。它在恒定时间内提供了快速的查询处理，非常适合在使用局部描述符的图像检索系统中进行相似度搜索。它基于将数据点投射到随机线上，并使用这些信息将它们分割成适当大小的桶，这些桶可以在一个I/O操作中读取。在这个预处理步骤之后，搜索每个查询描述符只查询三个桶，并使用最近的秩聚合方法omedrunk，以便为最近邻问题提供良好的近似结果。我们最近的研究表明，pvs索引可以很好地处理大量真实图像数据。然而，在这项工作中，我们使用了一个简单的评分方案，并为每个查询描述符收集了几个最近的邻居。在本研究中，我们研究了在搜索pvs索引时，为每个局部描述符收集的最近邻居的实际数量对最终查询结果的影响程度。在此基础上，我们提出了两种新的评分方案，提高了检索质量，稳定了检索结果，使搜索不受实际累积近邻数量的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer Vision meets Databases

自引率

0.00%

发文量